AI & ML interests
None yet
Organizations
mlxha/Qwen2.5-7B-Instruct-grpo-medmcqa-medi70
Text Generation
•
Updated
•
4
mlxha/Llama-3.1-8B-Instruct-GRPO-medmcqa
Text Generation
•
Updated
•
13
mlxha/llama8b-sft-grpo-medmcqa
Text Generation
•
Updated
•
6
mlxha/medicouenne7b-grpo-medmcqa
Text Generation
•
Updated
•
6
mlxha/Qwen3-8B-grpo-medmcqa-medi70
Text Generation
•
Updated
•
9
•
1
mlxha/Qwen3-8B-grpo-medmcqa-v2
Text Generation
•
Updated
•
30
•
1
mlxha/Qwen3-32B-grpo-medmcqa
mlxha/Qwen3-4B-grpo-medmcqa
Text Generation
•
Updated
•
81
•
2
mlxha/Qwen3-8B-grpo-medmcqa
Text Generation
•
Updated
•
8
•
2
mlxha/DeepSeek-R1-Distill-Llama-8B-GRPO-medmcqa-notemplate
mlxha/Qwen-2.5-3B-grpo-medmcqa
Text Generation
•
Updated
•
8
mlxha/Qwen-2.5-3B-grpo-code
Text Generation
•
Updated
•
9
mlxha/DeepSeek-R1-Distill-Llama-8B-GRPO-code-2
mlxha/DeepSeek-R1-Distill-Llama-8B-GRPO-code
mlxha/DeepSeek-R1-Distill-Llama-8B-notemplate
Text Generation
•
Updated
•
9
mlxha/DeepSeek-R1-Distill-Llama-8B-GRPO-medmcqa
Text Generation
•
Updated
•
5
mlxha/Qwen2.5-1.5B-Open-R1-Code-GRPO
mlxha/Qwen-2.5-7B-GRPO-test2
Updated
mlxha/Qwen-2.5-7B-GRPO-test
Text Generation
•
Updated
•
8
mlxha/Qwen-2.5-7B-Simple-RL
Updated
mlxha/Qwen2.5-1.5B-Open-R1-Distill
Updated
mlxha/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-sft-dpo-final
Text Generation
•
Updated
•
5
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-final-v2
Text Generation
•
Updated
•
5
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-sft-final-v2
Text Generation
•
Updated
•
5
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-final
Text Generation
•
Updated
•
7
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-sft-final
Text Generation
•
Updated
•
8
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-v2
Text Generation
•
Updated
•
2
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-sft-mmlu
Text Generation
•
Updated
•
8
mlxha/mnlp-openaint-phi3-mini-mcq-dpo-sft
Text Generation
•
Updated
•
7