Apache Airflow is an open source platform used to author, schedule, and monitor workflows. Airflow overcomes some of the limitations of the cron utility by providing an extensible framework that includes operators, programmable interface to author jobs, scalable distributed architecture, and rich tracking and monitoring capabilities.
Today, we are announcing the ability to make customizations to live running Apache Hadoop clusters in Azure HDInsight.
As we get ready for SQL PASS Summit 2015, the world’s largest gathering of SQL Server and BI professionals, we’re excited to share that Azure Data Lake Analytics and Azure Data Lake Store are now in Public Preview.
As we get ready for AzureCon and to join the thousands of big data enthusiasts at Stata + Hadoop World in NYC this week, we’re excited to share a new and expanded Azure Data Lake that makes big data more accessible.
Today at Build, we announced the Azure Data Lake, Microsoft’s hyperscale repository for big data analytic workloads in the cloud. This offering is built for the cloud, compatible with HDFS, and has unbounded scale with massive throughput and enterprise-grade capabilities.