Apache

Deploying Apache Airflow in Azure to build and run data pipelines

10 декабря 2018 г.

Apache Airflow is an open source platform used to author, schedule, and monitor workflows. Airflow overcomes some of the limitations of the cron utility by providing an extensible framework that includes operators, programmable interface to author jobs, scalable distributed architecture, and rich tracking and monitoring capabilities.

Senior Program Manager, Azure OSS Database service

Introducing Azure Data Lake

29 апреля 2015 г.

Today at Build, we announced the Azure Data Lake, Microsoft’s hyperscale repository for big data analytic workloads in the cloud. This offering is built for the cloud, compatible with HDFS, and has unbounded scale with massive throughput and enterprise-grade capabilities.

Product Marketing, Hadoop/Big Data and Data Warehousing