Skip to main content

 Subscribe

Kushal Datta

Principal Software Engineer

Latest posts

Showing 1 – 2 of 2 posts found

Published • 4 min read

Azure sets a scale record in large language model training 

By Principal Software Engineer, Technical Program Manager 2, Microsoft, and Senior Software Engineer, Microsoft

Customers need reliable and performant infrastructure to bring the most sophisticated AI use cases to market in record time. Our objective is to build state-of-the-art infrastructure and meet these demands. The latest MLPerf™ 3.1 Training results1 are a testament to our unwavering commitment to building high-quality and high-performance systems in the cloud to achieve unparalleled efficiency in training LLMs at scale.

Image of a Data center operator

Published • 3 min read

Azure empowers easy-to-use, high-performance, and hyperscale model training using DeepSpeed 

By Principal Software Engineer

Large-scale transformer-based deep learning models trained on large amounts of data have shown great results in recent years in several cognitive tasks and are behind new products and features that augment human capabilities. Azure Machine Learning (AzureML) brings large fleets of the latest GPUs powered by the InfiniBand interconnect to tackle large-scale AI training.