Andromeda
Note

GPT-4

Definition

GPT-4 (released 2023) is a highly capable Large Language Model developed by OpenAI. It represents a significant leap over previous models in reasoning, coding ability, and complex problem-solving.

Why It Matters

GPT-4 serves as a benchmark for the rapid advancement of AI capabilities, demonstrating emergent behaviors that hint at early stages of artificial general intelligence and validating the power of scaling laws.

Core Concepts

  • Transformer Engine: Built upon the self-attention mechanism of the Transformer Architecture.
  • Sparks of AGI: GPT-4 demonstrated emergent abilities that went beyond simple pattern matching, such as solving novel logical puzzles and passing professional exams (e.g., Bar Exam) in the top 10% (Sparks of AGI).
  • Scaling Laws: Its success was the ultimate proof of the “scaling laws”—that adding more compute and data leads to qualitatively new cognitive abilities (Scaling Laws (AI)).

Connected Concepts