AI Glossary

What is Large Language Model?

A Large Language Model (LLM) is a type of artificial intelligence trained on massive amounts of text data to understand, generate, and reason about human language. LLMs use transformer architectures with billions of parameters, enabling them to perform tasks like writing, translation, coding, summarization, and question answering. Examples include GPT-4, Claude, Gemini, and Llama. LLMs work by predicting the next token in a sequence, learning statistical patterns across language during pre-training, then being fine-tuned for specific tasks or aligned with human preferences through techniques like RLHF.

Related Terms

Applied Computing launches Orbital foundation model for full oil and gas facility operations TechCrunch · Jul 16 What Anthropic’s latest AI discovery does—and doesn’t—show MIT Tech Review · Jul 13 Open Source AI Matters More Than Ever, Says Hugging Face CEO Clem Delangue TechCrunch · Jul 10 Anthropic uncovers hidden J-space in Claude that reveals what it will say next MIT Tech Review · Jul 9 SkyPilot enables zero-egress AI workloads with Hugging Face storage across any cloud Hugging Face · Jul 7

Frequently Asked Questions

What is an LLM?

An LLM (Large Language Model) is an AI system trained on vast text datasets to understand and generate human-like text. They power tools like ChatGPT, Claude, and Gemini.

How do LLMs work?

LLMs predict the next word (token) in a sequence using transformer neural networks. They learn language patterns from billions of text examples during training.