Large Language Models

General-purpose language models for text generation, reasoning, and conversation

60 models available

LLaMA 3.1 405B

Meta's largest and most capable open-source language model with 405 billion parameters, offering state-of-the-art performance across reasoning, coding, and multilingual tasks.

Multilingual supportFunction callingLong contextReasoningCode generation

LLaMA 3.1 70B

A powerful 70B parameter model that balances performance and efficiency, ideal for production deployments requiring high-quality outputs.

Multilingual supportFunction callingLong contextEfficient inference

LLaMA 3.1 8B

A compact yet capable model perfect for edge deployment and resource-constrained environments while maintaining strong performance.

Fast inferenceLow resource usageLong contextEdge deployment

Mixtral 8x7B

46.7B (8x7B MoE)8.7Mistral AI

A sparse mixture-of-experts model that matches or outperforms LLaMA 2 70B while being faster and more efficient through its innovative architecture.

Mixture of ExpertsFast inferenceMultilingualFunction calling

Mixtral 8x22B

141B (8x22B MoE)9Mistral AI

Mistral's largest open model with 141B total parameters, offering exceptional performance across all tasks with efficient sparse activation.

Large MoE architectureExtended contextMultilingualAdvanced reasoning

Qwen 2.5 72B

72B8.9Alibaba Cloud

Alibaba's flagship open model with exceptional multilingual capabilities, particularly strong in Chinese and Asian languages.

Exceptional multilingualLong contextStrong reasoningCode generation

Gemma 2 27B

Google's open model built on Gemini research, offering strong performance with efficient architecture and safety features.

Efficient architectureSafety featuresFast inferenceInstruction following

Phi-3 Medium

14B8.3Microsoft

Microsoft's efficient small language model that punches above its weight class with strong reasoning and coding abilities.

Efficient architectureLong contextStrong reasoningFast inference

Falcon 180B

One of the largest open-source models with 180B parameters, trained on 3.5 trillion tokens.

Large scaleMultilingualStrong reasoningApache 2.0

Yi 34B

High-performance bilingual model excelling in both English and Chinese tasks.

BilingualLong contextFast inferenceStrong reasoning

Mistral 7B

7B8.1Mistral AI

Efficient 7B model that outperforms larger models through superior architecture.

EfficientFast inferenceLong contextSliding window attention

Vicuna 33B

Fine-tuned LLaMA model trained on user conversations, excelling at dialogue.

ConversationalInstruction followingDialogue optimization

MPT 30B

Commercially-usable model with strong performance and flexible licensing.

Commercial useALiBi positional encodingEfficient training

Nous Hermes 2 Mixtral

46.7B (8x7B MoE)8.5Nous Research

Fine-tuned Mixtral model optimized for instruction following and reasoning.

Instruction followingReasoningFunction callingLong context

OpenHermes 2.5

High-quality fine-tune focused on instruction following and helpfulness.

Instruction followingHelpful responsesFast inference

Zephyr 7B Beta

7B7.8Hugging Face

Aligned Mistral 7B model optimized for helpful, harmless responses.

AlignedHelpfulSafe responsesFast

Starling 7B Alpha

RLAIF-trained model achieving strong performance through reinforcement learning.

RLAIF trainingStrong reasoningHelpful responses

Solar 10.7B

10.7B7.8Upstage

Depth-upscaled model achieving strong performance through innovative architecture.

Depth upscalingEfficientStrong performance

Orca 2

13B8.3Microsoft

Reasoning-focused model trained with progressive learning techniques.

Strong reasoningProgressive learningInstruction following

Platypus 2

70B8.4Platypus Team

Fine-tuned model optimized for STEM and logical reasoning.

STEM focusLogical reasoningMathScience

Airoboros

70B8.2Jon Durbin

Instruction-tuned model with strong creative writing abilities.

Creative writingInstruction followingRoleplayVersatile

Dolphin 2.5

70B8.1Eric Hartford

Uncensored model trained for helpfulness without alignment restrictions.

UncensoredHelpfulVersatileInstruction following

Samantha

70B7.8Eric Hartford

Companion-focused model trained for empathetic conversation.

EmpatheticConversationalCompanion-likeHelpful

Nous Capybara

34B8.5Nous Research

Long-context model optimized for extended conversations and documents.

Extremely long contextConversationDocument analysisReasoning

OpenChat 3.5

C-RLFT trained model achieving strong performance with efficient training.

C-RLFT trainingEfficientFastGood performance

Neural Chat

Intel-optimized model for efficient deployment on Intel hardware.

Intel optimizedFast inferenceEfficientGood performance

StableLM 2

12B7.8Stability AI

Efficient language model with strong performance for its size.

EfficientGood performanceFast inferenceVersatile

Persimmon 8B

Efficient model with strong performance from Adept AI.

EfficientLong contextFastGood performance

Llama 2 70B

Previous generation Meta model, still widely used and capable.

Strong performanceWell-testedWide supportReliable

Llama 2 13B

Mid-size LLaMA 2 model balancing performance and efficiency.

BalancedEfficientReliableWell-supported

Llama 2 7B

Compact LLaMA 2 model for efficient deployment.

FastEfficientLow resourceReliable

Qwen 1.5 110B

110B8.8Alibaba Cloud

Large Qwen model with exceptional multilingual capabilities.

MultilingualLong contextStrong reasoningVersatile

Qwen 1.5 72B

72B8.4Alibaba Cloud

Previous generation Qwen model, still highly capable.

MultilingualLong contextGood performanceApache 2.0

Gemma 2 9B

Compact Gemma 2 model with strong performance for its size.

EfficientFastGood performanceGoogle research

Gemma 2 2B

Ultra-compact Gemma 2 for extreme edge deployment.

Ultra-compactVery fastLow resourceEdge deployment

Phi-3 Mini

3.8B7.5Microsoft

Smallest Phi-3 model optimized for mobile and edge deployment.

Ultra-efficientLong contextFastMIT license

Phi-3 Small

Balanced Phi-3 model with good performance and efficiency.

EfficientLong contextFastMIT license

Falcon 40B

Mid-size Falcon model with strong performance.

Strong performanceApache 2.0Well-trainedReliable

Falcon 7B

Compact Falcon model for efficient deployment.

EfficientFastApache 2.0Reliable

Yi 6B

Compact Yi model with strong bilingual capabilities.

BilingualEfficientFastApache 2.0

Baichuan 2 13B

Chinese-focused model with strong performance in Chinese tasks.

Chinese focusBilingualGood performanceEfficient

InternLM 2

20B8.3Shanghai AI Lab

Advanced Chinese model with strong reasoning capabilities.

Long contextChinese focusStrong reasoningApache 2.0

ChatGLM 3

6B7.9Tsinghua University

Bilingual conversational model optimized for Chinese and English.

BilingualLong contextConversationalEfficient

BLOOM 176B

176B8.2BigScience

Massively multilingual model trained on 46 languages.

46 languagesMultilingualLarge scaleCommunity-driven

BLOOM 7B

7B7.4BigScience

Compact BLOOM model for efficient multilingual deployment.

46 languagesMultilingualEfficientCommunity-driven

OLMo 7B

Fully open language model with complete training data and code.

Fully openTraining data availableReproducibleApache 2.0

Pythia 12B

12B7.6EleutherAI

Suite of models for studying training dynamics and scaling laws.

Research-focusedTraining checkpointsReproducibleApache 2.0

GPT-J 6B

6B7.1EleutherAI

Early open-source GPT-style model, historically significant.

Historic significanceApache 2.0EfficientWell-supported

GPT-NeoX 20B

20B7.7EleutherAI

Large-scale open-source model from EleutherAI.

Large scaleApache 2.0Open sourceResearch-friendly

RedPajama 7B

Open reproduction of LLaMA with fully open training data.

Fully openReproducibleApache 2.0Open data

OpenLLaMA 13B

13B7.4OpenLM Research

Open reproduction of LLaMA with permissive licensing.

Open reproductionApache 2.0PermissiveResearch-friendly

StableLM Zephyr 3B

3B7.3Stability AI

Compact aligned model for helpful, harmless responses.

AlignedCompactFastHelpful

TinyLlama 1.1B

1.1B6.8TinyLlama Team

Ultra-compact model for extreme edge deployment.

Ultra-compactVery fastLow resourceApache 2.0

H2O-Danube 1.8B

Efficient small model optimized for enterprise deployment.

EfficientEnterprise-focusedFastApache 2.0

Amber

Fully open model with complete training process transparency.

Fully transparentTraining data availableReproducibleApache 2.0

BGE Large

High-quality embedding model for semantic search and retrieval.

Semantic searchHigh quality embeddingsMultilingualFast

E5 Large

335M8.7Microsoft

Text embedding model with strong performance on retrieval tasks.

Text embeddingsRetrievalSemantic searchEfficient

BGE M3

Multi-lingual, multi-functionality, multi-granularity embedding model.

Multi-lingualDense + sparse + multi-vectorVersatileHigh quality

GTE Large

General text embedding model with strong performance.

High qualityFastGeneral purposeEfficient

Instructor XL

Instruction-based embedding model for customizable representations.

Instruction-basedCustomizableVersatileHigh quality