clembench-playpen/Qwen2-7B-DPO_dialogue
Updated
clembench-playpen/Qwen2-7B-DPO_turn
Updated
clembench-playpen/Qwen2-7B-SFT_merged
Text Generation
•
8B
•
Updated
•
11
clembench-playpen/Llama8B_DPO_turn_solved
Updated
clembench-playpen/Qwen2-7B-Instruct
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_turn
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_dialogue
Updated
clembench-playpen/Qwen2.5-7B-Instruct_dialogue
Updated
clembench-playpen/Mistral-Small-24B-Instruct-less-steps_playpen_SFT-e3_DFINAL_0.35K-steps
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_copy_turn
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_dialogue
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_copy
Text Generation
•
8B
•
Updated
•
4
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_turn
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_full_precision
Updated
clembench-playpen/Llama-3.1-8B-Instruct_dialogue
Updated
clembench-playpen/Llama-3.1-70B-Instruct_dialogue
Updated
clembench-playpen/meta-llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_turn_training
Updated
clembench-playpen/meta-llama-3.1-8B-Instruct_turn_training
Updated
clembench-playpen/llama3.1_8B_DPO_dialogue______Player1
Updated
clembench-playpen/llama3.1_8B_DPO_turn-level_10Klimit
Updated
clembench-playpen/llama3.1_8B_DPO_turn-level_10Klimit_backup
Updated
clembench-playpen/llama3.1_8B_DPO_from_fp_merged_full_precision
Text Generation
•
8B
•
Updated
•
4
clembench-playpen/llama3.1_8B_DPO_from_fp
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision
Text Generation
•
8B
•
Updated
•
9
clembench-playpen/llama3.1-70B_DPO_noSFT
Updated
clembench-playpen/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit_KTO_Final_KTO_noSFT
Updated
clembench-playpen/Mistral_DPO_noSFT
Updated
clembench-playpen/Mistral-Small-24B-Instruct-2501_KTO_Final_KTO_noSFT
Updated
clembench-playpen/Mistral-Small-24B-Instruct-rehearsal_playpen_SFT-e3_DABL02_0.82K-steps
Updated
clembench-playpen/Mistral-Small-24B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.6K-steps
Updated