What is Tokenizer? — AI Glossary 2026

Definition

The component that converts raw text into tokens (integer IDs) that the model processes. Most modern LLMs use Byte-Pair Encoding (BPE) or similar subword algorithms. Token count determines cost and fits within the context window limit.

Related Terms

Context Window

The maximum number of tokens a model can process in a single call — including both the input (prompt) and output (completion). Larger windows allow processing entire codebases, books, or long conversations. Measured in tokens, not characters.

Embedding

A dense numerical vector that represents text, images, or other data in a high-dimensional space where semantic similarity maps to geometric closeness. Foundation of semantic search, RAG systems, and recommendation engines.

LLM (Large Language Model)

A neural network trained on massive text corpora to predict the next token, resulting in emergent abilities like reasoning, coding, and language understanding. Examples include GPT-4, Claude, Gemini, and Llama. Scale in parameters ranges from billions to trillions.

Articles about Tokenizer

llama-benchy brings llama-bench tests to APIs

How to Fine-Tune LLMs with SFT, LoRA, and RLHF

How to Build AI Research Foundations with DeepMind

GitHub repo turns AI theory into 435 build lessons

ConvexTok Reframes Tokenization as Optimization

Definition

Related Terms

Articles about Tokenizer

All Terms