Catch up on keynotes, technical sessions, and announcements from Microsoft Build 2026.
Overview
Accelerate multimodal AI app and agent development
Azure Content Understanding helps enterprises transform unstructured multimodal data into insights.
- Derive meaningful insights from diverse types of input data, ranging from text, documents, audio, images, video and AI agents. Leverage powerful document AI engine that brings together industry- leading Azure Document Intelligence models and the newest generative AI capabilities.
- Achieve precise, high-quality data for downstream applications with sophisticated AI methods such as scheme extraction and grounding.
- Streamline and unify pipelines of varied data types into a single streamlined workflow, reducing overall costs and accelerating time to value.
- See how businesses and call center operators generate valuable insights from call recordings to track essential KPIs, enhance product experiences, and respond to customer inquiries more swiftly and accurately.
Features
Multimodal AI: Transforming data into insights
Multimodal data ingestion
Ingest various modalities—such as documents, images, audio, or video—and use a range of AI models available in Microsoft Foundry to transform input data into structured output that can be easily processed and analyzed by downstream applications.
Customizable output schemas
Customize the schemas of extracted results to meet your specific needs. Tailor the format and structure of summaries, insights, or features to include only the most relevant details—such as key points or timestamps—from video or audio files.
Confidence scores
Utilize confidence scores to reduce human intervention and increase accuracy with continuous improvement through user feedback.
Output ready for downstream applications
Automate business processes by building enterprise AI apps or agentic workflows. Use outputs that downstream applications can consume for reasoning with retrieval-augmented generation (RAG).
Grounding
Ensure the information extracted, inferred, or abstracted is represented in the underlying content.
Bring your own model
Empower your enterprise with the new bring your own (BYO) option in Azure Content Understanding—designed for flexibility, control, and cost transparency. Seamlessly integrate your own Azure subscription and models while preserving multimodal performance across documents, images, audio, and video.
Complete Document AI
Brings together traditional document intelligence and advanced LLM-based understanding capabilities, enabling highest quality information extraction across structured and unstructured content.
SECURITY
Built-in security and compliance
Pricing
Flexible pricing to meet your needs
Pay for only what you use. There are no upfront costs with Azure Content Understanding pay-as-you-go pricing. Pricing is based on input (content extraction) and output (field extraction) costs.
Use cases
Apply Content Understanding to a variety of use cases
Discover how others are leveraging Content Understanding.
CUSTOMER STORIES
See how customers are innovating with Azure
FAQ
Frequently asked questions
- Azure Content Understanding in Foundry Tools is Microsoft's comprehensive content AI service, bringing together multiple approaches to document and content processing under one product. Content Understanding takes diverse types of input data—text, audio, images, documents, and video—and enables organizations to build generative AI solutions seamlessly with the latest models available. Azure Content Understanding brings together our document intelligence and multimodal content understanding capabilities, enabling both traditional and LLM-based approaches to information extraction across structured and unstructured content.
- Azure Content Understanding—formerly part of Azure AI services—is now Azure Content Understanding in Foundry Tools. This change is part of a broader platform unification under Microsoft Foundry, aligning these services with how today’s developers build intelligent, agentic applications.
Content Understanding still offers the same powerful capabilities, like extracting structured insights from multimodal data (text, audio, images, video), grounding information, and automating schema generation. But it’s now a core tool within the Foundry ecosystem, designed to work seamlessly with other tools like Speech, Translator, and Document Intelligence.
This renaming helps clarify how Content Understanding fits into the Foundry platform, making it easier to discover, orchestrate, and integrate into modern AI workflows for document automation, customer insights, and agentic reasoning. - Content Understanding enables developers to incorporate data types across modalities simultaneously into their existing apps and deploy custom models for their enterprise. It significantly simplifies generative AI solution development for multimodal scenarios and removes the manual effort to switch to the latest model upon release. It accelerates time-to-value with multiple modalities simultaneously analyzed in a unified workflow.
- Check out Content Understanding in the Microsoft Foundry.
- Pay for only what you use. There are no upfront costs with Content Understanding pay-as-you-go pricing. Pricing is based on input (content extraction) and output (field extraction) costs.
- Azure Document Intelligence (now part of Azure Content Understanding) is a Foundry Tool that applies specialized AI neural models to extract text, key-value pairs, tables, and structures from documents automatically and accurately. Azure Document Intelligence provides high-accuracy and reliable deterministic extraction of structured and templated documents while Azure Content Understanding also offers LLM-powered analyzers for complex, unstructured, and multimodal content. Together, they make it easier to prepare data for intelligent agents and applications that can read, analyze, and respond to real-world content with precision and speed. For detailed guidance, see Choose the right Azure AI tool for document processing.
Next Steps
Choose the Azure account that’s right for you
Pay as you go or try Azure free for up to 30 days.
AI development tools
Design and manage AI applications
Create, customize, and scale AI apps and agents efficiently.
Products
Find the right AI products for your needs
Create the next generation of applications using artificial intelligence capabilities for any developer and any scenario.