Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Lunzima 
posted an update Mar 12, 2025
Post
1391
I'm currently experimenting with the SFT dataset Lunzima/alpaca_like_dataset to further boost the performance of NQLSG-Qwen2.5-14B-MegaFusion-v9.x. This includes data sourced from DeepSeek-R1 or other cleaned results (excluding CoTs). Additionally, datasets that could potentially enhance the model's performance in math and programming/code, as well as those dedicated to specific uses like Swahili, are part of the mix.
@sometimesanotion @sthenno @wanlige

I don't know if the performance of Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v9.2 has improved or regressed because https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/ is stuck.

In this post