Large Language Model (LLM)

Also known as: LLM, Foundation model, Frontier model

A neural network trained on massive text corpora to predict the next token, used for chat, coding, reasoning and as the brain inside AI agents.

A large language model (LLM) is a neural network — almost always a Transformer — trained on terabytes of text and code to predict the next token in a sequence. After pre-training, models are fine-tuned with supervised data and reinforcement learning from human feedback (RLHF) to follow instructions, refuse harmful requests, and behave like an assistant.

Modern LLMs scale to hundreds of billions or trillions of parameters. Capabilities emerge with scale: small models do autocomplete, mid-size models hold a conversation, and the largest frontier models (GPT-5, Claude Opus 4, Gemini 2.5 Pro) can reason across long documents, use tools, and act as the planning layer inside an AI agent.

LLMs are priced per million tokens — a token is roughly 0.75 of an English word. Input tokens (your prompt) are typically 4–8× cheaper than output tokens (the model's reply).

See also on SoftPerceptron

Related terms

More to explore

Other wiki entries that touch on Large Language Model (LLM).