sparse_decode

university

AI & ML interests

None defined yet.

laxury

authored a paper 8 months ago

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Paper • 2502.20766 • Published Feb 28, 2025 • 1

rookiemango

authored a paper 9 months ago

Efficient Pretraining Length Scaling

Paper • 2504.14992 • Published Apr 21, 2025 • 20