Back to Models

Llama 3 (70B)

Meta
Released: 2024-04-18
Type: Open Source
Context
8k
Max Out
8.2k
Cutoff
2024-03
Params
70B

Evolution: What Changed?

  • Massive reasoning jump
  • Vocabulary size increased
  • Reduced refusal rate
Compared to llama-2-70b

Evolution: What Changed?

  • Massive reasoning jump
  • Vocabulary size increased
  • Reduced refusal rate
compared to previous generation

The Breakdown

The Llama 3 (70B) represents a significant leap forward in Meta's lineup. Released in 2024-04-18, it targets developers and enterprise use cases with a specific focus on reasoning capabilities over raw conversational speed. While previous iterations in the Open Source category often struggled with complex multi-step instruction following, this model introduces a refined architecture that dramatically improves adherence to system prompts and reduces hallucination rates in technical domains. It competes directly with top-tier frontier models but carves out a distinct niche for workflows where precision and context retention matter more than creative flair. For businesses looking to integrate reliable AI agents, Llama 3 (70B) offers a compelling balance of performance and cost-efficiency.

Overall Score
9.3
/10
Pricing (per 1k tokens)
Input$0
Output$0
Currency: USD

The Good

  • GPT-4 class performance for free
  • Extremely dense knowledge
  • improved tokenizer

The Bad

  • 8k context limit at launch

The Verdict

The current open source champion. Runs on dual 3090s.

Performance Benchmarks

mmlu
82
human eval
60
math
50