語音轉換文字

A Speech service feature that accurately transcribes spoken audio to text

Make spoken audio actionable

快速而精確地將音訊轉譯成超過 85 種語言的文字與各種不同版本。自訂模型,以提高特定領域專業術語的精確度。藉由對轉譯的文字啟用搜尋或分析,或是輔助動作 (全部使用您慣用的程式設計語言),從語音取得更多價值。

高品質轉譯

Get accurate audio to text transcriptions with state-of-the-art speech recognition.

可自訂的模型

Add specific words to your base vocabulary or build your own speech-to-text models.

彈性部署

以容器在任何位置 (雲端或邊緣) 執行語音轉換文字。

已準備好投入生產環境

存取支援不同 Microsoft 產品中語音辨識的相同強大技術。

透過這個示範應用程式 (採用 JavaScript SDK 建置) 試用語音轉換文字功能

若要使用麥克風以您自己的聲音試用示範,請變更為具有 WebRTC 支援的其他瀏覽器,例如 Microsoft Edge、Firefox 或 Chrome 的最新版本。

系統不會儲存您的語音資料

Accurately transcribe speech from various sources

Convert audio to text from a range of sources, including microphones, audio files, and blob storage. Use speaker diarisation to determine who said what and when. Get readable transcripts with automatic formatting and punctuation.

依據您的需求自訂語音模型

Tailor your speech models to understand organization- and industry-specific terminology. Overcome speech recognition barriers such as background noise, accents, or unique vocabulary. Customize your models by uploading audio data and transcripts. Automatically generate custom models using Office 365 data to optimize speech recognition accuracy for your organization.

Deploy anywhere

Run Speech to Text wherever your data resides. Build speech applications that are optimized for robust cloud capabilities and on-premises using containers.

完整的隱私權與安全性

  • 語音服務屬於 Azure 認知服務,並經過 SOC、FedRAMP、PCI DSS、HIPAA、HITECH 和 ISO 的認證
  • Your data remains yours. Your audio input and transcription data aren't logged during audio processing.
  • 您可以隨時檢視和刪除自訂語音資料和模型。您的資料會在儲存期間加密。
  • 語音服務由 Azure 基礎結構所支援,提供企業級安全性、可用性、合規性與管理性。

Flexible pricing gives you the control you need

With Speech to Text, pay as you go based on the number of hours of audio you transcribe, with no upfront costs.

文件與資源

開始使用

Browse the documentation

Create a speech service with the Microsoft Learn course

探索程式碼範例

Check out our sample code

查看自訂資源

Customize your voice-to-text solution with Speech Studio. No code required.

Businesses that trust Speech to Text

KPMG 簡化了通話轉譯程序

KPMG 使用語音轉換文字功能轉譯數千小時的通話並建立目錄,將用戶端的合規性成本降低了 80%。

KPMG

Motorola 採用語音技術,協助緊急醫療處理人員取得重要資料

Motorola Solutions 透過採用語音技術的虛擬助理,協助員警和其他緊急醫療處理人員更快取得重要資訊。

Motorola Solutions

Universal Electronics 提供採用語音技術的智慧型居家體驗

Universal Electronics 協助品牌提供採用語音技術的導覽與控制功能,可用於居家各種日常裝置,並讓消費者擁有真正獨特的體驗。

Universal Electronics

Hochtief 使用語音功能來記載建構缺失

Hochtief 透過採用語音技術的虛擬助理,協助專案經理識別並記載專案網站上的建構缺失。

Cheetah Mobile

NTT DATA 使用會議見解來加速決策制定

NTT DATA 透過即時會議轉譯發掘語音資料中的見解。有了自訂語音,工作人員就能自訂語音辨識模型,以了解組織特定的詞彙。

NTTDATA

由見解強化的交談式銀行體驗

Insight Enterprises 利用交談式 AI 驅動的銀行解決方案,協助銀行將數位的快速及便利性推廣到各分行。語音轉換文字功能可將客戶說的話轉換成可處理和分析的資料,以便客戶即時取得相關的回應。

Insight Enterprise, Inc.

Frequently asked questions about Speech to Text

  • It is a feature within the Speech service that accurately and quickly transcribes audio to text.
  • Cognitive Services are a collection of customizable, prebuilt AI models that can be used to add AI to applications. There are a variety of domains, including Speech, Decision, Language, and Vision. Speech to Text is one feature within the Speech service. Other Speech related features include Text to Speech, Speech Translation, and Speaker Recognition. An example of a Decision service is Personalizer, which allows you to deliver personalized, relevant experiences. Examples of Language services include Language Understanding, Text Analytics for natural language processing, QnA Maker for FAQ experiences, and Translator for language translation.

開始使用語音