Chunking is the process of breaking a large document into smaller, discrete segments before storing them in a vector database or retrieval system. Each chunk is embedded independently and retrieved as a unit when the chunk’s meaning matches a query.
How a document is chunked determines what gets retrieved. A chunk that contains a complete, self-contained answer is more likely to be retrieved and cited than a chunk that cuts a sentence in half or buries an answer in the middle of a paragraph. For content strategists, chunking is the retrieval-layer explanation for why self-contained paragraphs and clear heading structure matter — the structural choices that make content readable for humans also make it chunk-friendly for AI systems.