AI LLMs

Large Language Model (LLM)

Definition

A large language model is a neural network trained on massive text datasets to understand and generate human language. LLMs power chatbots, code assistants, and content generation tools by predicting the most likely next token in a sequence. Models like GPT-4, Claude, and Gemini are prominent examples with hundreds of billions of parameters.

How It Works

LLMs are trained using self-supervised learning on internet-scale text corpora, learning statistical patterns between words and concepts. During inference, the model generates text token by token, using attention mechanisms to maintain context across long passages. Fine-tuning and RLHF (reinforcement learning from human feedback) align the model's outputs with user expectations.