On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models
Paper
•
2510.09008
•
Published
•
16
None defined yet.
LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
torch.backends.cuda.enable_cudnn_sdp(False)bash
git clone https://github.com/Beomi/InfiniTransformer
bash
pip install -r requirements.txt
pip install -e git+https://github.com/huggingface/transformers.git@b109257f4f#egg=transformers
./train.gemma.infini.noclm.1Mseq.sh.