Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

7,294

Full-text search

Active filters: awq

EliasOenal/MiniMax-M2.5-Hybrid-AWQ-W4A16G128-Attn-fp8_e4m3-KV-fp8_e4m3

Text Generation • 34B • Updated 9 days ago • 242 • 10

bullpoint/Qwen3-Coder-Next-AWQ-4bit

Text Generation • 14B • Updated 23 days ago • 862k • 13

mratsim/MiniMax-M2.5-BF16-INT4-AWQ

Text Generation • 39B • Updated 9 days ago • 43.9k • 27

QuantTrio/Qwen3.5-122B-A10B-AWQ

Image-Text-to-Text • 125B • Updated about 8 hours ago • 755 • 6

QuantTrio/Qwen3.5-35B-A3B-AWQ

Image-Text-to-Text • 36B • Updated about 8 hours ago • 3.73k • 5

QuantTrio/MiniMax-M2.5-AWQ

Text Generation • 229B • Updated 10 days ago • 43.4k • 10

mratsim/MiniMax-M2.5-FP8-INT4-AWQ

Text Generation • 39B • Updated 9 days ago • 4.91k • 8

QuantTrio/Qwen3.5-397B-A17B-AWQ

Image-Text-to-Text • Updated about 8 hours ago • 3

Qwen/Qwen3-32B-AWQ

Text Generation • 33B • Updated May 21, 2025 • 449k • 129

TheHouseOfTheDude/GLM-4.7-Flash_AWQ

Text Generation • Updated 28 days ago • 774 • 3

MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-AWQ

Text Generation • 141B • Updated Apr 18, 2024 • 40.2k • 13

stelterlab/SauerkrautLM-v2-14b-SFT-AWQ

15B • Updated Nov 5, 2024 • 1

Qwen/Qwen2.5-Coder-14B-Instruct-AWQ

Text Generation • 15B • Updated Jan 12, 2025 • 159k • 16

stelterlab/Mistral-Small-24B-Instruct-2501-AWQ

Text Generation • 24B • Updated Mar 30, 2025 • 109k • 26

gaunernst/gemma-3-4b-it-int4-awq

Image-Text-to-Text • Updated Apr 6, 2025 • 56.2k • 6

gaunernst/gemma-3-27b-it-int4-awq

Image-Text-to-Text • 6B • Updated Apr 6, 2025 • 15.8k • 38

CobraMamba/Qwen3-32B-AWQ

Text Generation • 33B • Updated Apr 30, 2025 • 3.05k • 1

QuixiAI/Qwen3-235B-A22B-AWQ

Text Generation • 235B • Updated May 4, 2025 • 670 • 14

stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ

Text Generation • 8B • Updated Jun 4, 2025 • 483 • 5

QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ

Text Generation • 31B • Updated Sep 5, 2025 • 4.93k • 4

QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ

Text Generation • 31B • Updated Oct 8, 2025 • 327k • 40

QuantTrio/Qwen3-VL-30B-A3B-Thinking-AWQ

Text Generation • 31B • Updated Oct 8, 2025 • 3.51k • 12

QuantTrio/Qwen3-VL-32B-Thinking-AWQ

Image-Text-to-Text • 33B • Updated Dec 3, 2025 • 1.3k • 7

abhishekchohan/maesar-VL-32B-AWQ

33B • Updated Oct 27, 2025 • 36 • 1

ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16

Text Generation • 33B • Updated Oct 30, 2025 • 9 • 2

ModelCloud/opt-125m-llm-awq

0.2B • Updated Dec 12, 2025 • 115 • 1

QuantTrio/GLM-4.7-AWQ

Text Generation • Updated Dec 29, 2025 • 9.54k • 27

QuantTrio/GLM-4.7-Flash-AWQ

Text Generation • 31B • Updated Jan 21 • 124k • 7

sasa2000/Qwen3-4B-Instruct-2507-heretic-AWQ-4bit

4B • Updated 26 days ago • 16 • 1

groxaxo/Qwen3-4B-Instruct-2507-heretic-W8A16

Text Generation • 1B • Updated 9 days ago • 6 • 1