Large Language Models
General-purpose language models for text generation, reasoning, and conversation
60 models available
LLaMA 3.1 405B
405B9.4MetaMeta's largest and most capable open-source language model with 405 billion parameters, offering state-of-the-art performance across reasoning, coding, and multilingual tasks.
LLaMA 3.1 70B
70B9.1MetaA powerful 70B parameter model that balances performance and efficiency, ideal for production deployments requiring high-quality outputs.
LLaMA 3.1 8B
8B8.2MetaA compact yet capable model perfect for edge deployment and resource-constrained environments while maintaining strong performance.
Mixtral 8x7B
46.7B (8x7B MoE)8.7Mistral AIA sparse mixture-of-experts model that matches or outperforms LLaMA 2 70B while being faster and more efficient through its innovative architecture.
Mixtral 8x22B
141B (8x22B MoE)9Mistral AIMistral's largest open model with 141B total parameters, offering exceptional performance across all tasks with efficient sparse activation.
Qwen 2.5 72B
72B8.9Alibaba CloudAlibaba's flagship open model with exceptional multilingual capabilities, particularly strong in Chinese and Asian languages.
Gemma 2 27B
27B8.5GoogleGoogle's open model built on Gemini research, offering strong performance with efficient architecture and safety features.
Phi-3 Medium
14B8.3MicrosoftMicrosoft's efficient small language model that punches above its weight class with strong reasoning and coding abilities.
Falcon 180B
180B8.8TIIOne of the largest open-source models with 180B parameters, trained on 3.5 trillion tokens.
Yi 34B
34B8.401.AIHigh-performance bilingual model excelling in both English and Chinese tasks.
Mistral 7B
7B8.1Mistral AIEfficient 7B model that outperforms larger models through superior architecture.
Vicuna 33B
33B7.8LMSYSFine-tuned LLaMA model trained on user conversations, excelling at dialogue.
MPT 30B
30B7.9MosaicMLCommercially-usable model with strong performance and flexible licensing.
Nous Hermes 2 Mixtral
46.7B (8x7B MoE)8.5Nous ResearchFine-tuned Mixtral model optimized for instruction following and reasoning.
OpenHermes 2.5
7B7.7TekniumHigh-quality fine-tune focused on instruction following and helpfulness.
Zephyr 7B Beta
7B7.8Hugging FaceAligned Mistral 7B model optimized for helpful, harmless responses.
Starling 7B Alpha
7B7.9BerkeleyRLAIF-trained model achieving strong performance through reinforcement learning.
Solar 10.7B
10.7B7.8UpstageDepth-upscaled model achieving strong performance through innovative architecture.
Orca 2
13B8.3MicrosoftReasoning-focused model trained with progressive learning techniques.
Platypus 2
70B8.4Platypus TeamFine-tuned model optimized for STEM and logical reasoning.
Airoboros
70B8.2Jon DurbinInstruction-tuned model with strong creative writing abilities.
Dolphin 2.5
70B8.1Eric HartfordUncensored model trained for helpfulness without alignment restrictions.
Samantha
70B7.8Eric HartfordCompanion-focused model trained for empathetic conversation.
Nous Capybara
34B8.5Nous ResearchLong-context model optimized for extended conversations and documents.
OpenChat 3.5
7B8OpenChatC-RLFT trained model achieving strong performance with efficient training.
Neural Chat
7B7.9IntelIntel-optimized model for efficient deployment on Intel hardware.
StableLM 2
12B7.8Stability AIEfficient language model with strong performance for its size.
Persimmon 8B
8B7.9AdeptEfficient model with strong performance from Adept AI.
Llama 2 70B
70B8.3MetaPrevious generation Meta model, still widely used and capable.
Llama 2 13B
13B7.8MetaMid-size LLaMA 2 model balancing performance and efficiency.
Llama 2 7B
7B7.3MetaCompact LLaMA 2 model for efficient deployment.
Qwen 1.5 110B
110B8.8Alibaba CloudLarge Qwen model with exceptional multilingual capabilities.
Qwen 1.5 72B
72B8.4Alibaba CloudPrevious generation Qwen model, still highly capable.
Gemma 2 9B
9B7.9GoogleCompact Gemma 2 model with strong performance for its size.
Gemma 2 2B
2B7.2GoogleUltra-compact Gemma 2 for extreme edge deployment.
Phi-3 Mini
3.8B7.5MicrosoftSmallest Phi-3 model optimized for mobile and edge deployment.
Phi-3 Small
7B7.9MicrosoftBalanced Phi-3 model with good performance and efficiency.
Falcon 40B
40B8.1TIIMid-size Falcon model with strong performance.
Falcon 7B
7B7.6TIICompact Falcon model for efficient deployment.
Yi 6B
6B7.701.AICompact Yi model with strong bilingual capabilities.
Baichuan 2 13B
13B7.8BaichuanChinese-focused model with strong performance in Chinese tasks.
InternLM 2
20B8.3Shanghai AI LabAdvanced Chinese model with strong reasoning capabilities.
ChatGLM 3
6B7.9Tsinghua UniversityBilingual conversational model optimized for Chinese and English.
BLOOM 176B
176B8.2BigScienceMassively multilingual model trained on 46 languages.
BLOOM 7B
7B7.4BigScienceCompact BLOOM model for efficient multilingual deployment.
OLMo 7B
7B7.7AI2Fully open language model with complete training data and code.
Pythia 12B
12B7.6EleutherAISuite of models for studying training dynamics and scaling laws.
GPT-J 6B
6B7.1EleutherAIEarly open-source GPT-style model, historically significant.
GPT-NeoX 20B
20B7.7EleutherAILarge-scale open-source model from EleutherAI.
RedPajama 7B
7B7.3TogetherOpen reproduction of LLaMA with fully open training data.
OpenLLaMA 13B
13B7.4OpenLM ResearchOpen reproduction of LLaMA with permissive licensing.
StableLM Zephyr 3B
3B7.3Stability AICompact aligned model for helpful, harmless responses.
TinyLlama 1.1B
1.1B6.8TinyLlama TeamUltra-compact model for extreme edge deployment.
H2O-Danube 1.8B
1.8B7.1H2O.aiEfficient small model optimized for enterprise deployment.
Amber
7B7.2LLM360Fully open model with complete training process transparency.
BGE Large
335M9BAAIHigh-quality embedding model for semantic search and retrieval.
E5 Large
335M8.7MicrosoftText embedding model with strong performance on retrieval tasks.
BGE M3
568M9.1BAAIMulti-lingual, multi-functionality, multi-granularity embedding model.
GTE Large
335M8.6AlibabaGeneral text embedding model with strong performance.
Instructor XL
335M8.5HKUNLPInstruction-based embedding model for customizable representations.