-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Paper • 2106.13914 • Published • 1 -
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Paper • 2506.15196 • Published • 3 -
Ascend HiFloat8 Format for Deep Learning
Paper • 2409.16626 • Published • 1 -
Recipes for Pre-training LLMs with MXFP8
Paper • 2506.08027 • Published • 1
zhangwenbin
ExceedZhang
AI & ML interests
None yet
Recent Activity
liked
a model
about 10 hours ago
Qwen/Qwen3-Reranker-0.6B
liked
a model
about 11 hours ago
zai-org/GLM-4.5-Air
upvoted
an
article
2 days ago
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
Organizations
None yet