-
-
-
-
-
-
Inference Providers
Active filters: awq
EliasOenal/MiniMax-M2.5-Hybrid-AWQ-W4A16G128-Attn-fp8_e4m3-KV-fp8_e4m3
Text Generation
• 34B • Updated
• 242
• 10
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated
• 862k
• 13
mratsim/MiniMax-M2.5-BF16-INT4-AWQ
Text Generation
• 39B • Updated
• 43.9k
• 27
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated
• 755
• 6
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated
• 3.73k
• 5
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated
• 43.4k
• 10
mratsim/MiniMax-M2.5-FP8-INT4-AWQ
Text Generation
• 39B • Updated
• 4.91k
• 8
QuantTrio/Qwen3.5-397B-A17B-AWQ
Image-Text-to-Text
• Updated
• 3
Text Generation
• 33B • Updated
• 449k
• 129
TheHouseOfTheDude/GLM-4.7-Flash_AWQ
Text Generation
• Updated
• 774
• 3
MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-AWQ
Text Generation
• 141B • Updated
• 40.2k
• 13
stelterlab/SauerkrautLM-v2-14b-SFT-AWQ
15B • Updated
• 1
Qwen/Qwen2.5-Coder-14B-Instruct-AWQ
Text Generation
• 15B • Updated
• 159k
• 16
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
• 24B • Updated
• 109k
• 26
gaunernst/gemma-3-4b-it-int4-awq
Image-Text-to-Text
• Updated
• 56.2k
• 6
gaunernst/gemma-3-27b-it-int4-awq
Image-Text-to-Text
• 6B • Updated
• 15.8k
• 38
Text Generation
• 33B • Updated
• 3.05k
• 1
QuixiAI/Qwen3-235B-A22B-AWQ
Text Generation
• 235B • Updated
• 670
• 14
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
Text Generation
• 8B • Updated
• 483
• 5
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
• 31B • Updated
• 4.93k
• 4
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated
• 327k
• 40
QuantTrio/Qwen3-VL-30B-A3B-Thinking-AWQ
Text Generation
• 31B • Updated
• 3.51k
• 12
QuantTrio/Qwen3-VL-32B-Thinking-AWQ
Image-Text-to-Text
• 33B • Updated
• 1.3k
• 7
abhishekchohan/maesar-VL-32B-AWQ
33B • Updated
• 36
• 1
ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16
Text Generation
• 33B • Updated
• 9
• 2
ModelCloud/opt-125m-llm-awq
0.2B • Updated
• 115
• 1
Text Generation
• Updated
• 9.54k
• 27
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated
• 124k
• 7
sasa2000/Qwen3-4B-Instruct-2507-heretic-AWQ-4bit
4B • Updated
• 16
• 1
groxaxo/Qwen3-4B-Instruct-2507-heretic-W8A16
Text Generation
• 1B • Updated
• 6
• 1