← Glossary

Multimodal

Concept

Definition

A model capable of processing and generating multiple types of data — text, images, audio, and video — within a single unified architecture. Examples include GPT-4o, Gemini, Claude (vision), and Sora.