{"id":7571,"date":"2022-11-17T00:00:00","date_gmt":"2022-11-17T00:00:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english"},"modified":"2025-06-10T02:15:54","modified_gmt":"2025-06-10T09:15:54","slug":"expanding-ai-technology-for-unstructured-text-beyond-english","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/","title":{"rendered":"Expanding AI technology for unstructured biomedical text beyond English"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">The health industry is embracing the power of big data, cloud computing, and clinical analytics, harnessing data to deliver insights that can improve care and efficiency. Still, unstructured text remains a challenge\u2014made even more complex by barriers of language. Doctors\u2019 notes and other unstructured text are often left unreferenced, are hard to parse and learn from, and are difficult to extract insights from, which leads to missed opportunities for diagnosis and better care.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Microsoft recognizes the need to enable healthcare organizations worldwide to gather insights from this data\u2014for better, faster, and more personalized care, and to improve health equity. With <a href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/cognitive-services\/language-service\/text-analytics-for-health\/overview?tabs=ner\" target=\"_blank\" rel=\"noopener\">Text Analytics for Health<\/a>, a part of <a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/cognitive-services\/?OCID=AIDcmm5edswduu_SEM_bcc6d3e354001b1501bab086ce3ab821:G:s&amp;ef_id=bcc6d3e354001b1501bab086ce3ab821:G:s&amp;msclkid=bcc6d3e354001b1501bab086ce3ab821\" target=\"_blank\" rel=\"noopener\">Azure Cognitive Services<\/a>, healthcare organizations around the world can now extract meaningful insights from unstructured text in seven languages and process it in a way that enables clinical decision support like never before. Moving beyond English, Text Analytics for Health has now released six additional languages in preview\u2014Spanish, French, German, Italian, Portuguese, and Hebrew\u2014making this groundbreaking technology that helps extract insights from multilingual unstructured clinical notes accessible to more health organizations globally. This marks the first of its kind Natural Language Processing (NLP) service that holistically supports analysis of unstructured biomedical data in multiple languages and was developed with a federated learning approach. Most health technology is limited to the English language, making it inaccessible to millions of people and countries where English is not the primary language. Releasing NLP technology in multiple languages is a huge step forward in bridging the gaps in health equity created by language barriers and ensuring that access and quality of health care is not determined by one\u2019s ability to speak and understand English.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Text Analytics for Health uses powerful NLP to detect and identify medical terms in text, classify them and associate them with standard clinical coding systems, as well as infer semantic relationships and assertions in the data, enabling deeper contextual understanding. This opens a world of possibilities for providers, payors, life sciences, and pharmaceutical companies, allowing them to unify data points from unstructured text with structured data, and enabling them to&nbsp;surface key insights, identify risks, automate form-filling, or match clinical trials to patients for better sourcing of candidates\u2014based on comprehensive data including unstructured clinical text.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"623\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp\" alt=\"Desk with doctors stethoscope, medical reports and a tablet showing graphs\" class=\"wp-image-41126\" srcset=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp 1024w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1-300x183.webp 300w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1-768x467.webp 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"training-the-nlp-model-for-different-languages\">Training the NLP model for different languages<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">One of the challenges for an NLP service comes in moving past English\u2014in aiming to analyze text from different languages. This is what Microsoft\u2019s team aimed to do\u2014the goal was to empower all health organizations, no matter the language their text is in. The unique challenges come from the need to train AI models for multiple languages, as well as adjust to country-specific needs. Syntax is different between languages, especially when it comes to non-Latin languages. Languages have different semantics and boundaries, especially those with rich morphology or compound words. Vocabularies are different, jargon is country-specific, and even coding systems differ by country. Words are often borrowed from other languages, leading to text that contains a mixture of multiple languages. Written text is a mixture of colloquialisms, local medical terms, and shorthand that is country-specific. Training models to understand these differences and then evaluating those models required significant amounts of clinical data and working with subject matter experts in different languages.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/www.leumit.co.il\/eng\/home\/\" target=\"_blank\" rel=\"noopener\">Leumit Health Services<\/a>, one of the four national health funds in Israel, worked closely with Microsoft&#8217;s R&amp;D team to train the TA4H model for the Hebrew language. Israel has a unique and robust healthcare&nbsp;system where every individual\u2019s records are stored in electronic medical records (EMR) and all citizen residents are required to join one of the four designated HMOs as per law. The health data available is rich, diverse, and provides a great starting point for research and analysis.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/www.leumit.co.il\/eng\/home\/\" target=\"_blank\" rel=\"noopener\">Leumit Health Services<\/a> had over 130 million patient records in their EMR that could be used for training the Text Analytics for Health multilingual model for Hebrew. The challenge was\u2014how to allow Microsoft access to de-identified data for training purposes in a manner that protected the privacy and security of the customer\u2019s health information. The answer was in a Federated Learning approach\u2014meaning data never left Leumit\u2019s trust boundary and Microsoft was never exposed to patient\u2019s health information. Leumit created a separate subscription in Azure with strict access permissions where Microsoft installed its federated learning infrastructure and tools. Leumit then put in de-identified data needed for the research and Microsoft developers triggered the model training in a federated learning setup on that de-identified data\u2014all the while, this data never left their subscription, and the developers were never able to see any identifying details of the data.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Leumit then became one of the first customers to test the Text Analytics for Health model for clinical Hebrew, which is challenging since it often includes Hebrew and English words in the same sentence. The use case was trying to see if the Text Analytics for Health model could analyze free text from medical visits to identify predictors of strokes in patients. Preliminary results are very encouraging and positive\u2014showing the model has ability to parse through both the Hebrew and English clinical statements and analyze them in a way that could help identify various potential indicators of stroke. This could help care providers set up early warning mechanisms and provide more personalized care for a variety of acute conditions.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-large-font-size wp-block-paragraph\"><em>Using Microsoft\u2019s Hebrew NLP, we will be able to analyze our 20 years of EMR data and patient-to-doctor messages to develop tools that will save physicians time and will reduce their burnout in a post-Covid-19 worl<\/em>d &#8211; Izhar Laufer, Head of <a href=\"https:\/\/www.innovation.leumit.co.il\/\" target=\"_blank\" rel=\"noopener\">Leumit Start<\/a>.<\/p>\n<\/blockquote>\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/f0623f80-a1e3-4528-8b4a-3e98630401bb.webp\" alt=\"analysis of Hebrew unstructured biomedical text using Text Analytics for Health\" class=\"wp-image-22798 webp-format\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/f0623f80-a1e3-4528-8b4a-3e98630401bb.webp\"><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Figure 1: Analysis of Hebrew unstructured biomedical text using Text Analytics for Health<\/em><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/bdd298f1-f2b4-4cdc-9a47-2ee603618027.webp\" alt=\"analysis of Hebrew unstructured biomedical text using Text Analytics for Health\" title=\"Picture2\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Figure 2: Analysis of Hebrew unstructured biomedical text using Text Analytics for Health<\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"analyzing-unstructured-text-for-real-world-data\">Analyzing unstructured text for Real-World Data<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The challenge of unstructured data is even greater in the research world with the use of Real-World Data (RWD). In Brazil, amongst other places, the lack of a standard for interoperability and data collection leads to a lot of unstructured data\u2014field reports, doctors&#8217; notes, and even laboratory exam results. This slows down the process of research and analysis for providers such as <a href=\"https:\/\/grupooncoclinicas.com\/\" target=\"_blank\" rel=\"noopener\">Grupo Oncocl\u00ednicas<\/a>. Founded in 2010, Grupo Oncocl\u00ednicas is the largest oncology treatment provider in the private sector in Brazil, with 129 units in 33 cities\u2014including clinics, genomics and pathology laboratories, and integrated cancer treatment centers.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">With the help of <a href=\"https:\/\/www.dataside.com.br\/\" target=\"_blank\" rel=\"noopener\">Dataside<\/a>, a Microsoft partner in Brazil, OncoClinicas is using Microsoft\u2019s Text Analytics for Health to extract data from non-structured fields like medical notes, anatomic pathology, and genomic and imaging reports like MRIs. This data is then used for various use cases such as clinical trial feasibility, a better understanding of the scenarios for pharmacoeconomics, and gaining a deeper understanding of group epidemiology and outcomes of interest.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6d9b6539-90be-401a-9ff9-fd7efe096353.webp\" alt=\"analysis of Portuguese unstructured biomedical text using Text Analytics for Health\" title=\"Picture4\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Figure 3: Analysis of Portuguese unstructured biomedical text using Text Analytics for Health<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201c<em>Text Analytics for Health was a turning point for <a href=\"https:\/\/grupooncoclinicas.com\/\" target=\"_blank\" rel=\"noopener\">Grupo Oncocl\u00ednicas<\/a> to scale our processes and to structure our clinical notes, exam reports and field analysis, which previously only depended on manual curation. Having a solution that works in Portuguese is key\u2014most global solutions tend to only cater to English, thereby neglecting other languages. Accuracy in the native Portuguese allowed us to maintain a high level of accuracy while analyzing the unstructured text.<\/em>\u201d\u2014Marcio Guimaraes Souza, Head of Data and AI at Groupo OncoClinicas.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"analysis-and-structuring-to-fast-healthcare-interoperability-resources-fhir\">Analysis and structuring to Fast Healthcare Interoperability Resources (FHIR\u00ae)<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The <a href=\"https:\/\/www.unisr.it\/en\/homepage\" target=\"_blank\" rel=\"noopener\">Italian Vita-Salute San Raffaele University<\/a> and <a href=\"https:\/\/www.hsr.it\/\" target=\"_blank\" rel=\"noopener\">IRCCS San Raffaele Hospital<\/a> are building the healthcare of the future by leveraging Microsoft\u2019s Artificial Intelligence(AI) services. With Text Analytics for Health, the hospitals can classify, standardize, and analyze the enormous amount of clinical data available at the hospital in order to create an innovative digital platform for data management. Using this platform, the hospital\u2019s physicians can gain important clinical insights about their patients and provide more personalized care. One of the use cases that is currently being developed using this data platform is for allowing the selection of patients eligible for immunotherapy for non-small cell lung cancer. Medical staff can leverage the analysis of AI solutions to increase the success rate of therapy by matching the relevant treatment to the most eligible patients.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201c<em>Text Analytics for Health has played a key role in analyzing the enormous amount of unstructured clinical data that we have at the hospital. We are also using the FHIR structuring capability, which allows greater interoperability with other hospital systems. Having Text Analytics for Health available in Italian now allows us to expand our capabilities even further to offer our patients the best possible care.<\/em>\u201d\u2014Professor Carlo Tacchetti, Professor of Human Anatomy, Vita-Salute San Raffaele University, and coordinator of the project.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/fbfb8521-58e7-423f-a78e-ead42ceabdf4.webp\" alt=\"analysis of Italian unstructured biomedical text using Text Analytics for Health\" title=\"Picture5\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Figure 4: Analysis of Italian unstructured biomedical text using Text Analytics for Health<\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"do-more-with-your-data-with-microsoft-cloud-for-healthcare\">Do more with your data with Microsoft Cloud for Healthcare<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">With Text Analytics for Health, health organizations can transform their patient care, discover new insights and harness the power of machine learning and AI by leveraging unstructured text. Microsoft is committed to delivering technology that enables your data for the future of healthcare innovation with new features in the Microsoft Cloud for Healthcare.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We look forward to being your partner as you build the future of health.<br>\n\u2022&nbsp;&nbsp;&nbsp; Learn more about <a href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/cognitive-services\/language-service\/text-analytics-for-health\/overview?tabs=ner\" target=\"_blank\" rel=\"noopener\">Text Analytics for Health<\/a>.<br>\n\u2022&nbsp;&nbsp;&nbsp; Learn more about <a href=\"https:\/\/aka.ms\/cloudforhealthcare\" target=\"_blank\" rel=\"noopener\">Microsoft Cloud for Healthcare<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u00aeFHIR is a registered trademark of Health Level Seven International, registered in the U.S. Trademark Office, and is used with their permission.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1454,1556],"tags":[1484],"audience":[3057,3055,3056],"content-type":[1481],"product":[3164],"tech-community":[],"topic":[],"coauthors":[126],"class_list":["post-7571","post","type-post","status-publish","format-standard","hentry","category-ai-machine-learning","category-mobile","tag-microsoft-cloud-for-healthcare","audience-data-professionals","audience-developers","audience-it-implementors","content-type-thought-leadership","product-microsoft-foundry","review-flag-1680286581-56","review-flag-1-1680286581-825","review-flag-2-1680286581-601","review-flag-3-1680286581-173","review-flag-4-1680286581-250","review-flag-artif-1680286586-345","review-flag-free-1680286579-836","review-flag-lever-1680286579-649","review-flag-machi-1680286585-314","review-flag-microsofts","review-flag-never-1680286580-606","review-flag-new-1680286579-546","review-flag-partn-1680286579-901","review-flag-the-m-1680286586-24"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Expanding AI technology for unstructured biomedical text beyond English | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Expanding AI technology for unstructured biomedical text beyond English | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2022-11-17T00:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-10T09:15:54+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"623\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Hadas Bitran\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Hadas Bitran\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/hadas-bitran\/\",\"@type\":\"Person\",\"@name\":\"Hadas Bitran\"}],\"headline\":\"Expanding AI technology for unstructured biomedical text beyond English\",\"datePublished\":\"2022-11-17T00:00:00+00:00\",\"dateModified\":\"2025-06-10T09:15:54+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\"},\"wordCount\":1611,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp\",\"keywords\":[\"Microsoft Cloud for Healthcare\"],\"articleSection\":[\"AI + machine learning\",\"Mobile\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\",\"name\":\"Expanding AI technology for unstructured biomedical text beyond English | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp\",\"datePublished\":\"2022-11-17T00:00:00+00:00\",\"dateModified\":\"2025-06-10T09:15:54+00:00\",\"description\":\"With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp\",\"width\":1024,\"height\":623,\"caption\":\"A tablet with a stethoscope and pen on top of it\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI + machine learning\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Expanding AI technology for unstructured biomedical text beyond English\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117\",\"name\":\"shakir\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"caption\":\"shakir\"},\"sameAs\":[\"https:\/\/azure.microsoft.com\"],\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Expanding AI technology for unstructured biomedical text beyond English | Microsoft Azure Blog","description":"With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/","og_locale":"en_US","og_type":"article","og_title":"Expanding AI technology for unstructured biomedical text beyond English | Microsoft Azure Blog","og_description":"With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2022-11-17T00:00:00+00:00","article_modified_time":"2025-06-10T09:15:54+00:00","og_image":[{"width":1024,"height":623,"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp","type":"image\/webp"}],"author":"Hadas Bitran","twitter_card":"summary_large_image","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Hadas Bitran","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/hadas-bitran\/","@type":"Person","@name":"Hadas Bitran"}],"headline":"Expanding AI technology for unstructured biomedical text beyond English","datePublished":"2022-11-17T00:00:00+00:00","dateModified":"2025-06-10T09:15:54+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/"},"wordCount":1611,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp","keywords":["Microsoft Cloud for Healthcare"],"articleSection":["AI + machine learning","Mobile"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/","name":"Expanding AI technology for unstructured biomedical text beyond English | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp","datePublished":"2022-11-17T00:00:00+00:00","dateModified":"2025-06-10T09:15:54+00:00","description":"With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in eight languages and process it in a way that enables clinical decision support like never before.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2022\/11\/6f868353-ea86-41b9-ad54-f9d428aaf494-1.webp","width":1024,"height":623,"caption":"A tablet with a stethoscope and pen on top of it"},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/expanding-ai-technology-for-unstructured-text-beyond-english\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"AI + machine learning","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/"},{"@type":"ListItem","position":3,"name":"Expanding AI technology for unstructured biomedical text beyond English"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117","name":"shakir","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4","url":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","caption":"shakir"},"sameAs":["https:\/\/azure.microsoft.com"],"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/7571","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=7571"}],"version-history":[{"count":1,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/7571\/revisions"}],"predecessor-version":[{"id":41128,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/7571\/revisions\/41128"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=7571"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=7571"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=7571"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=7571"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=7571"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=7571"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=7571"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=7571"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=7571"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}