Large Language Model (LLM)
Definition
A large language model is a neural network trained on massive text datasets to understand and generate human language. LLMs power chatbots, code assistants, and content generation tools by predicting the most likely next token in a sequence. Models like GPT-4, Claude, and Gemini are prominent examples with hundreds of billions of parameters.
How It Works
LLMs are trained using self-supervised learning on internet-scale text corpora, learning statistical patterns between words and concepts. During inference, the model generates text token by token, using attention mechanisms to maintain context across long passages. Fine-tuning and RLHF (reinforcement learning from human feedback) align the model's outputs with user expectations.
Key Tools
GPT (OpenAI)Industry-leading large language models powering ChatGPT
$20/mo (ChatGPT Plus)Claude (Anthropic)Safe, helpful AI assistant with extended context and reasoning
$20/mo (Pro)Gemini (Google)Google's multimodal AI model family
$19.99/mo (Advanced)Llama (Meta)Open-source large language models from Meta
Free (open source)MistralEuropean AI lab building efficient open and commercial LLMs
Usage-based API