NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-0.95 Text Generation • 8B • Updated Feb 18, 2025 • 6
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-decrease_linear-1.0to0.95 Text Generation • 8B • Updated Feb 18, 2025 • 5
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-increase_linear_0.95to1.0 Text Generation • 8B • Updated Feb 18, 2025 • 6
NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0 Text Generation • 8B • Updated Feb 18, 2025 • 7
NoManDeRY/DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-0.95 Text Generation • 8B • Updated Feb 18, 2025 • 8
NoManDeRY/DPO-Shift-Llama-3-8B-Ultrafeedback-fixed-1.0 Text Generation • 8B • Updated Feb 18, 2025 • 12