AI Model Rankings

Best models for every use case, ranked by quality and sorted by minimum VRAM. Find the best model your hardware can run.

💬

Chat & General

General-purpose language models for conversation, writing, and reasoning

#ModelAuthorParamsMin VRAMAction
1SmolLM2 135MHuggingFace0.135B0.64GBCheck
2SmolLM2 360MHuggingFace0.36B0.75GBCheck
3Danube 3 500MH2O.ai0.5B0.8GBCheck
4Qwen 2.5 0.5BAlibaba0.5B0.96GBCheck
5TinyLlama 1.1BTinyLlama1.1B1.12GBCheck
6Llama 3.2 1B InstructMeta1.24B1.25GBCheck
7Gemma 3 1BGoogle1B1.25GBCheck
8SmolLM2 1.7BHuggingFace1.7B1.48GBCheck
9Falcon 3 1BTII1B1.48GBCheck
10Qwen 2.5 1.5BAlibaba1.5B1.54GBCheck
11DeepSeek R1 Distill 1.5BDeepSeek1.5B1.54GBCheck
12Granite 3.3 2BIBM2B1.94GBCheck
13EXAONE 3.5 2.4BLG AI2.4B2.03GBCheck
14StableLM Zephyr 3BStability AI3B2.09GBCheck
15Rocket 3BPansophic3B2.09GBCheck
16Gemma 2 2BGoogle2.6B2.09GBCheck
17Falcon 3 3BTII3B2.37GBCheck
18Llama 3.2 3B InstructMeta3.2B2.38GBCheck
19Qwen 2.5 3BAlibaba3B2.46GBCheck
20Danube 3 4BH2O.ai4B2.73GBCheck
21Phi-3.5 Mini 3.8BMicrosoft3.8B2.73GBCheck
22Gemma 3 4BGoogle4B2.82GBCheck
23Phi-4 Mini 3.8BMicrosoft3.8B2.82GBCheck
24Nemotron Mini 4BNVIDIA4B3.01GBCheck
25Yi 1.5 6B Chat01.AI6B3.92GBCheck
26Mistral 7B Instruct v0.3Mistral AI7.3B4.57GBCheck
27OpenChat 3.5 7BOpenChat7B4.57GBCheck
28OLMo 2 7BAllen AI7B4.67GBCheck
29InternLM 2.5 7BShanghai AI Lab7.7B4.89GBCheck
30EXAONE 3.5 7.8BLG AI7.8B4.94GBCheck
31Falcon 3 7BTII7B5GBCheck
32DeepSeek R1 Distill 8BDeepSeek8B5.08GBCheck
33Llama 3.1 8B InstructMeta8B5.08GBCheck
34Granite 3.3 8BIBM8B5.1GBCheck
35Qwen 2.5 7B InstructAlibaba7.6B5.3GBCheck
36Yi 1.5 9B Chat01.AI9B5.46GBCheck
37Gemma 2 9B InstructGoogle9.2B5.87GBCheck
38Falcon 3 10BTII10B6.36GBCheck
39Solar 10.7BUpstage10.7B6.52GBCheck
40Gemma 3 12BGoogle12B7.3GBCheck
41Mistral Nemo 12BMistral AI12B7.46GBCheck
42Qwen 2.5 14BAlibaba14B8.87GBCheck
43Phi-4Microsoft14B8.93GBCheck
44Mistral Small 22BMistral AI22B12.93GBCheck
45Gemma 3 27BGoogle27B15.91GBCheck
46Qwen 2.5 32BAlibaba32B18.99GBCheck
47Llama 3.1 70B InstructMeta70B40.1GBCheck
💻

Coding

Specialized models for code generation, completion, and debugging

#ModelAuthorParamsMin VRAMAction
1Qwen 2.5 Coder 0.5BAlibaba0.5B1.13GBCheck
2DeepSeek Coder 1.3BDeepSeek1.3B1.31GBCheck
3Yi Coder 1.5B01.AI1.5B1.4GBCheck
4Qwen 2.5 Coder 1.5BAlibaba1.5B1.54GBCheck
5CodeGemma 2BGoogle2B2.02GBCheck
6Stable Code 3BStability AI3B2.09GBCheck
7StarCoder2 3BBigCode3B2.26GBCheck
8Qwen 2.5 Coder 3BAlibaba3B2.46GBCheck
9Code Llama 7BMeta7B4.3GBCheck
10DeepSeek Coder 6.7BDeepSeek6.7B4.3GBCheck
11StarCoder2 7BBigCode7B4.66GBCheck
12Qwen 2.5 Coder 7BAlibaba7.6B4.86GBCheck
13Yi Coder 9B01.AI9B5.46GBCheck
14CodeGemma 7BGoogle8.5B5.46GBCheck
15Code Llama 13B InstructMeta13B7.83GBCheck
16Qwen 2.5 Coder 14BAlibaba14B8.87GBCheck
🎨

Image Generation

Text-to-image models for art, photos, and design

#ModelAuthorParamsMin VRAMAction
1Stable Diffusion 2.1 Base (CoreML)Stability AI / Apple0.86B1.56GBCheck
2Stable Diffusion 1.5 (GGUF)Runway / GPUStack0.86B2.13GBCheck
3Stable Diffusion 1.5 (CoreML)Runway0.86B2.5GBCheck
4Stable Diffusion 2.1 (GGUF)Stability AI0.86B2.66GBCheck
5Stable Diffusion XL (CoreML)Stability AI3.5B3.34GBCheck
6SDXL Turbo (GGUF)Stability AI3.5B5GBCheck
7Stable Diffusion 3 Medium (GGUF)Stability AI2.5B9.15GBCheck
8FLUX.1 Schnell (GGUF)Black Forest Labs12B14GBCheck
9FLUX.1 Dev (GGUF)Black Forest Labs12B14GBCheck
🎤

Speech-to-Text

Transcription and speech recognition models

#ModelAuthorParamsMin VRAMAction
1Whisper Tiny English (Quantized)OpenAI0.039B0.1GBCheck
2Whisper TinyOpenAI0.039B0.2GBCheck
3Whisper BaseOpenAI0.074B0.3GBCheck
4Whisper Base EnglishOpenAI0.074B0.3GBCheck
5Whisper SmallOpenAI0.24B0.95GBCheck
6Distil-Whisper Large v3HuggingFace0.76B1.92GBCheck
7Whisper MediumOpenAI0.77B1.93GBCheck
8Whisper Large v3 TurboOpenAI0.81B2.01GBCheck
9Whisper Large v3OpenAI1.55B3.38GBCheck
🔊

Text-to-Speech

Voice synthesis and text-to-speech models

#ModelAuthorParamsMin VRAMAction
1Piper TTS - Amy (English)Rhasspy0.02B0.15GBCheck
2Piper TTS - Lessac (English)Rhasspy0.02B0.15GBCheck
3Piper TTS - Spanish (MLS)Rhasspy0.02B0.15GBCheck
4Piper TTS - German (Thorsten)Rhasspy0.02B0.15GBCheck
5Piper TTS - Chinese (Huayan)Rhasspy0.02B0.15GBCheck
6Piper TTS - Japanese (Kokoro)Rhasspy0.02B0.15GBCheck
7Piper TTS - KoreanRhasspy0.02B0.15GBCheck
8Piper TTS - Russian (Irina)Rhasspy0.02B0.15GBCheck
9Piper TTS - Portuguese (Faber)Rhasspy0.02B0.15GBCheck
10Piper TTS - Arabic (Kareem)Rhasspy0.02B0.15GBCheck
11Piper TTS - French (Siwis)Rhasspy0.02B0.53GBCheck
12Piper TTS - Italian (Riccardo)Rhasspy0.02B0.53GBCheck
13Piper TTS - LibriTTS-R (English)Rhasspy0.02B0.57GBCheck
14Kokoro 82M TTSKokoro0.082B0.58GBCheck
🎵

Audio Generation

AI music and audio creation

#ModelAuthorParamsMin VRAMAction
1MusicGen SmallMeta0.3B0.78GBCheck
👁️

Multimodal / Vision

Models that understand both images and text

#ModelAuthorParamsMin VRAMAction
1Qwen2-VL 2BAlibaba2.2B1.42GBCheck
2Moondream 2Moondream1.8B1.5GBCheck
3MiniCPM-V 2.6OpenBMB2B2.1GBCheck
4PaliGemma 3BGoogle3B2.5GBCheck
5Phi-3.5 VisionMicrosoft4.2B3.2GBCheck
6LLaVA 1.6 7BLLaVA7B5GBCheck
🔗

Embedding

Text embedding models for search and retrieval

#ModelAuthorParamsMin VRAMAction
1BGE Small EN v1.5BAAI0.033B0.1GBCheck
2Snowflake Arctic Embed SSnowflake0.033B0.1GBCheck
3all-MiniLM-L6-v2Sentence Transformers0.023B0.1GBCheck
4Nomic Embed Text v1.5Nomic AI0.137B0.3GBCheck
5BGE Large EN v1.5BAAI0.335B0.83GBCheck

Can't Run the Model You Want?

Cloud GPUs give you instant access to any model, any size.