AI Term

What is Large Language Model (LLM)?

An AI model trained on massive amounts of text data that can understand, generate, and work with human language at a sophisticated level.

A Large Language Model (LLM) is an AI system trained on vast amounts of text data to understand and generate human language. These models power tools like ChatGPT, Claude, and Gemini.

What Makes Them "Large"

LLMs are characterized by:

  • Training data: Hundreds of billions of words from books, websites, and other text
  • Parameters: Billions of numerical values the model adjusts during learning (GPT-4 is estimated to have over 1 trillion parameters)
  • Compute: Training requires massive computing resources
  • How LLMs Work

    Training Phase

    The model reads massive amounts of text and learns patterns—which words tend to follow which, how sentences are structured, what topics are related.

    Prediction

    When given text, the model predicts what should come next based on learned patterns. It's essentially very sophisticated autocomplete.

    Fine-Tuning

    After initial training, models are often adjusted for specific tasks or to reduce harmful outputs.

    Capabilities

    LLMs can:

  • Answer questions on almost any topic
  • Write in many styles and formats
  • Translate between languages
  • Summarize long documents
  • Write and explain code
  • Engage in multi-turn conversations
  • Key LLMs

    | Model | Company | Notes |

    |-------|---------|-------|

    | GPT-4 | OpenAI | Powers ChatGPT Plus |

    | Claude | Anthropic | Known for nuanced responses |

    | Gemini | Google | Integrated with Google services |

    | LLaMA | Meta | Open-source family |

    Limitations

  • Don't truly "understand"—predict likely text
  • Can generate false information confidently
  • Training data has cutoff dates
  • Can reflect biases in training data
  • The Revolution

    LLMs represent a step-change in AI capability. For the first time, AI can engage with human language in a way that feels natural and useful for a wide range of tasks.

    Examples

    GPT-4ClaudeGeminiLLaMA

    Want to learn more AI terms?

    Browse All Terms