This is the Trace Id: e8c984f991af0e700fe2051485c5aa81
Skip to main content
Azure

Catch up on keynotes, technical sessions, and announcements from Microsoft Build 2026.

Explore event highlights Bg image

Azure Content Understanding in Foundry Tools

Accelerate multimodal AI agent development.
Overview

Accelerate multimodal AI app and agent development

Azure Content Understanding helps enterprises transform unstructured multimodal data into insights.
  • Derive meaningful insights from diverse types of input data, ranging from text, documents, audio, images, video and AI agents. Leverage powerful document AI engine that brings together industry- leading Azure Document Intelligence models and the newest generative AI capabilities.
  • Achieve precise, high-quality data for downstream applications with sophisticated AI methods such as scheme extraction and grounding.
  • Streamline and unify pipelines of varied data types into a single streamlined workflow, reducing overall costs and accelerating time to value.
  • See how businesses and call center operators generate valuable insights from call recordings to track essential KPIs, enhance product experiences, and respond to customer inquiries more swiftly and accurately.
Features

Multimodal AI: Transforming data into insights

Multimodal data ingestion

Ingest various modalities—such as documents, images, audio, or video—and use a range of AI models available in Microsoft Foundry to transform input data into structured output that can be easily processed and analyzed by downstream applications.

Customizable output schemas

Customize the schemas of extracted results to meet your specific needs. Tailor the format and structure of summaries, insights, or features to include only the most relevant details—such as key points or timestamps—from video or audio files.

Confidence scores

Utilize confidence scores to reduce human intervention and increase accuracy with continuous improvement through user feedback.

Output ready for downstream applications

Automate business processes by building enterprise AI apps or agentic workflows. Use outputs that downstream applications can consume for reasoning with retrieval-augmented generation (RAG).

Grounding

Ensure the information extracted, inferred, or abstracted is represented in the underlying content.

Bring your own model

Empower your enterprise with the new bring your own (BYO) option in Azure Content Understanding—designed for flexibility, control, and cost transparency. Seamlessly integrate your own Azure subscription and models while preserving multimodal performance across documents, images, audio, and video.

Complete Document AI

Brings together traditional document intelligence and advanced LLM-based understanding capabilities, enabling highest quality information extraction across structured and unstructured content.
SECURITY

Built-in security and compliance

80K Foundry is used by developers at more than 80,000 enterprises and digital natives, including 80% of Fortune 500 companies. 3B daily enterprise search queries Watch the video 11K+ Foundry Models to choose from—see why Microsoft Phi on Foundry Models has over 60 million downloads. Learn more
A woman standing and holding tablet in her hand
Pricing

Flexible pricing to meet your needs

Pay for only what you use. There are no upfront costs with Azure Content Understanding pay-as-you-go pricing. Pricing is based on input (content extraction) and output (field extraction) costs.
FAQ

Frequently asked questions

  • Azure Content Understanding in Foundry Tools is Microsoft's comprehensive content AI service, bringing together multiple approaches to document and content processing under one product. Content Understanding takes diverse types of input data—text, audio, images, documents, and video—and enables organizations to build generative AI solutions seamlessly with the latest models available. Azure Content Understanding brings together our document intelligence and multimodal content understanding capabilities, enabling both traditional and LLM-based approaches to information extraction across structured and unstructured content.
  • Azure Content Understanding—formerly part of Azure AI services—is now Azure Content Understanding in Foundry Tools. This change is part of a broader platform unification under Microsoft Foundry, aligning these services with how today’s developers build intelligent, agentic applications.

    Content Understanding still offers the same powerful capabilities, like extracting structured insights from multimodal data (text, audio, images, video), grounding information, and automating schema generation. But it’s now a core tool within the Foundry ecosystem, designed to work seamlessly with other tools like Speech, Translator, and Document Intelligence.

    This renaming helps clarify how Content Understanding fits into the Foundry platform, making it easier to discover, orchestrate, and integrate into modern AI workflows for document automation, customer insights, and agentic reasoning.
  • Content Understanding enables developers to incorporate data types across modalities simultaneously into their existing apps and deploy custom models for their enterprise. It significantly simplifies generative AI solution development for multimodal scenarios and removes the manual effort to switch to the latest model upon release. It accelerates time-to-value with multiple modalities simultaneously analyzed in a unified workflow.
  • Check out Content Understanding in the Microsoft Foundry.
  • Pay for only what you use. There are no upfront costs with Content Understanding pay-as-you-go pricing. Pricing is based on input (content extraction) and output (field extraction) costs.
  • Azure Document Intelligence (now part of Azure Content Understanding) is a Foundry Tool that applies specialized AI neural models to extract text, key-value pairs, tables, and structures from documents automatically and accurately. Azure Document Intelligence provides high-accuracy and reliable deterministic extraction of structured and templated documents while Azure Content Understanding also offers LLM-powered analyzers for complex, unstructured, and multimodal content. Together, they make it easier to prepare data for intelligent agents and applications that can read, analyze, and respond to real-world content with precision and speed. For detailed guidance, see Choose the right Azure AI tool for document processing.
A woman sitting at a table using a laptop.
Next Steps

Choose the Azure account that’s right for you

Pay as you go or try Azure free for up to 30 days.
Two people are discussing, and a woman in a green shirt with short curly hair is smiling.
AI development tools

Design and manage AI applications

Create, customize, and scale AI apps and agents efficiently.
Three people are gathered around a laptop in a modern office setting, engaged in discussion.
Products

Find the right AI products for your needs

Create the next generation of applications using artificial intelligence capabilities for any developer and any scenario.