What is a large language model? LLM definition

What is a large language model (LLM)?

A large language model (LLM) is a type of artificial intelligence model trained on massive volumes of text data that can understand and generate human language with remarkable fluency and versatility. LLMs are the foundation of most modern generative AI applications: they power chatbots, AI copilots, coding assistants, document analysis tools, and natural language interfaces to data and systems.

The “large” in large language model refers to two things: the volume of training data (hundreds of billions to trillions of tokens of text) and the number of parameters in the model (billions to hundreds of billions of learned weights that encode the patterns the model has extracted from that data). Scale, combined with the transformer architecture introduced in 2017, is what produces the emergent capabilities that distinguish LLMs from earlier natural language processing systems.

How large language models work

LLMs are trained using a process called self-supervised learning on large text corpora. The core training objective is next-token prediction: given a sequence of text, the model learns to predict the next word (or more precisely, the next token, where a token is roughly a word or word-fragment). This apparently simple task, applied at enormous scale to a sufficiently diverse text corpus, produces models that develop sophisticated internal representations of language structure, world knowledge, reasoning patterns, and domain-specific terminology.

The transformer architecture that underlies modern LLMs processes text through a mechanism called attention, which allows the model to weight the relevance of different parts of the input sequence when generating each output token. Attention enables LLMs to handle long-range dependencies in text — understanding that a pronoun in sentence ten refers to a noun in sentence two, for example — in a way that earlier recurrent architectures could not do reliably.

After pre-training on large text corpora, LLMs are typically further refined through instruction tuning and reinforcement learning from human feedback (RLHF), which aligns the model’s outputs with human preferences for helpfulness, accuracy, and appropriate behavior. This fine-tuning stage is what transforms a raw language model into a useful assistant.

At inference time, LLMs generate text autoregressively: they produce one token at a time, with each token conditioned on all previous tokens in the sequence. The result is text that flows naturally, responds appropriately to context, and can sustain coherent reasoning across multi-paragraph outputs.

LLMs vs. foundation models vs. generative AI

These three terms are related but not interchangeable.

A foundation model is any large AI model trained on broad data at scale that can be adapted to a wide range of downstream tasks. Foundation models include LLMs (trained on text), vision-language models (trained on text and images), and multimodal models (trained on multiple data types). LLMs are a type of foundation model.

A large language model is specifically a foundation model trained primarily on text data, capable of understanding and generating natural language. All LLMs are foundation models, but not all foundation models are LLMs.

Generative AI is the broader category of AI models that produce new content. LLMs are the most widely deployed type of generative AI for text, but generative AI also includes image generation models (diffusion models), audio generation models, and video generation models. See also: generative AI.

Key capabilities of large language models

Text generation and completion. LLMs generate coherent, contextually appropriate text from a prompt: writing emails, drafting reports, creating code, producing creative content, and completing partial documents.

Question answering and information retrieval. LLMs answer questions based on information in their training data or provided in context. Combined with retrieval systems (RAG architecture), they can answer questions grounded in current, organization-specific documents and data.

Summarization. LLMs condense long documents, conversation histories, or data sets into concise summaries, extracting key points and presenting them in structured or narrative form.

Classification and extraction. LLMs classify text into categories, extract structured information from unstructured documents, and identify named entities, relationships, and key data points in natural language text.

Reasoning and analysis. Modern LLMs can reason through multi-step problems, evaluate arguments, identify inconsistencies, and produce structured analytical outputs from complex inputs. This capability underpins the use of LLMs as the reasoning core of AI agents.

Code generation. LLMs trained on code can generate, explain, debug, and translate code across programming languages, accelerating software development and enabling non-technical users to automate tasks.

LLMs in enterprise applications

LLMs are the language and reasoning layer of most enterprise AI copilots and agents. They provide the natural language interface through which practitioners interact with AI systems, and the reasoning capability that allows those systems to handle complex, variable inputs that rule-based automation cannot.

In enterprise settings, LLMs are rarely used as general-purpose models accessed directly. They are adapted through several techniques to make them more useful and reliable in specific domains.

Fine-tuning further trains a pre-trained LLM on domain-specific data, adjusting the model’s weights to improve performance on the target domain’s language, entities, and tasks. A financial services LLM fine-tuned on compliance documents and investigation records will outperform a general LLM on compliance-specific tasks.

Retrieval-augmented generation (RAG) connects an LLM to a retrieval system that fetches relevant documents, records, or data at inference time and provides them as context. This grounds the LLM’s outputs in current, accurate, organization-specific information rather than relying on training data alone, significantly reducing hallucination risk.

Industrial LLMs go further than fine-tuning: they are trained specifically on industrial domain data from the ground up. SymphonyAI’s Industrial LLM is purpose-built for industrial operations, trained on industrial text, sensor data documentation, and domain knowledge to provide a language model that genuinely understands the industrial domain rather than approximating it from general text training.

Limitations of large language models

LLMs are powerful but have well-documented limitations that enterprise teams need to manage.

Hallucination. LLMs can generate plausible-sounding text that is factually incorrect. This is a fundamental property of probabilistic text generation, not a bug that can be fully eliminated. Mitigations include RAG (grounding outputs in verified sources), output verification workflows, and human review for high-stakes outputs.

Knowledge cutoff. LLMs are trained on data up to a cutoff date. They have no knowledge of events after that date unless provided with current information in context. RAG and tool use (allowing the model to query current databases) address this limitation.

Context window limits. LLMs process a finite amount of text at once (the context window). Very long documents or very long conversation histories may exceed this limit, requiring chunking strategies. Context windows have grown substantially with each model generation and this is a diminishing constraint.

General-purpose limitations in specialized domains. A general LLM applied to a highly specialized domain without domain adaptation will produce outputs that are generic, sometimes incorrect, and often miss the precise terminology and reasoning patterns of the domain. This is the core argument for vertical AI: domain-specific models built on or adapted from LLMs outperform general-purpose LLMs on domain-specific tasks.

Frequently asked questions

What is a large language model in simple terms?

A large language model is an AI system trained on enormous amounts of text that can read, understand, and generate human language. LLMs power applications like ChatGPT, Claude, and Gemini, and are the underlying technology in AI copilots, coding assistants, document analysis tools, and natural language interfaces to data systems.

What is the difference between an LLM and a chatbot?

A large language model is the underlying AI technology; a chatbot is an application built on top of that technology. An LLM is a general-purpose language model capable of many tasks. A chatbot is an interface specifically designed for conversational interaction, typically built using an LLM as its core but with additional layers for managing conversation flow, connecting to external systems, and shaping the model’s behavior for a specific purpose.

What is the difference between an LLM and a foundation model?

A foundation model is any large AI model trained on broad data at scale that can be adapted to many downstream tasks. LLMs are a type of foundation model — specifically, foundation models trained on text. Other foundation models include vision models, multimodal models, and audio models. All LLMs are foundation models, but not all foundation models are LLMs.

How are LLMs used in enterprise AI?

In enterprise AI, LLMs serve as the language and reasoning layer for copilots, agents, and natural language interfaces. They are typically adapted for specific domains through fine-tuning or retrieval-augmented generation (RAG), which grounds their outputs in organization-specific data. Enterprise LLM applications include compliance report drafting, knowledge management, service desk automation, code generation, document analysis, and natural language access to operational data.

What is an industrial LLM?

An industrial LLM is a large language model trained specifically on industrial domain data: technical documentation, maintenance records, sensor data annotations, engineering specifications, and industrial process knowledge. Unlike a general LLM adapted to industrial tasks, an industrial LLM is built from the ground up with industrial domain understanding. SymphonyAI’s Industrial LLM is purpose-built for industrial AI applications within the IRIS Foundry platform.

Last updated: May 2026

Large Language Model (LLM)