Customer churn prediction using real-time analytics

Azure Machine Learning

Solution ideas

This article is a solution idea. If you'd like us to expand the content with more information, such as potential use cases, alternative services, implementation considerations, or pricing guidance, let us know by providing GitHub feedback.

Customer Churn Prediction uses Azure AI platform to predict churn probability, and it helps find patterns in existing data that are associated with the predicted churn rate.

Architecture

Architecture diagram: predicting customer churn with machine learning.

Download a Visio file of this architecture.

Dataflow

  1. Use Azure Event Hubs to stream all live data into Azure.

  2. Process real-time data using Azure Stream Analytics. Stream Analytics can output processed data into Azure Synapse . This allows customers to combine existing and historical data to create dashboards and reports in Power BI.

  3. Ingest historical data at scale into Azure Blob Storage using Azure Synapse or another ETL tool.

  4. Use Azure Synapse to combine streaming data with historical data for reporting or experimentation in Azure Machine Learning.

  5. Use Azure Machine Learning to build models for predicting churn probability and identify data patterns to deliver intelligent insights.

  6. Use Power BI to build operational reports and dashboards on top of Azure Synapse. Azure Machine Learning models can be used to further enhance reporting and to assist businesses in decision making processes.

Components

  • Azure Event Hubs is an event ingestion service that can process millions of events per second. Data sent to event hub can be transformed and stored using any real-time analytics provider.
  • Azure Stream Analytics is a real-time analytics engine designed to analyze and process high volume of fast streaming data. Relationships and patterns identified in the data can be used to trigger actions and initiate workflows such as creating alerts, feeding information to a reporting tool, or storing transformed data for later use.
  • Azure Blob Storage is a cloud service for storing large amounts of unstructured data such as text, binary data, audio, and documents more-easily and cost-effectively. Azure Blob Storage allows data scientists quick access to data for experimentation and AI model building.
  • Azure Synapse Analytics is a fast and reliable data warehouse with limitless analytics that brings together data integration, enterprise data warehousing, and big data analytics. It gives you the freedom to query data on your terms, using either serverless or dedicated resources and serve data for immediate BI and machine learning needs.
  • Azure Machine Learning can be used for any supervised and unsupervised machine learning, whether you prefer to write Python of R code. You can build, train, and track machine learning models in an Azure Machine Leaning workspace.
  • Power BI is a suite of tools that delivers powerful insights to organizations. Power BI connects to various data sources, simplify data prep and model creation from disparate sources. Enhance team collaboration across the organization to produce analytical reports and dashboard to support the business decisions and publish them to the web and mobile devices for users to consume.

Scenario details

Keeping existing customers is five times cheaper than the cost of getting new customers. For this reason, marketing executives often find themselves trying to estimate the likelihood of customer churn and finding the necessary actions to minimize the churn rate.

Potential use cases

This solution uses Azure Machine Learning to predict churn probability and helps find patterns in existing data associated with the predicted churn rate. By using both historical and near real-time data, users are able to create predictive models to analyze characteristics and identify predictors of the existing audience. This information provides businesses with actionable intelligence to improve customer retention and profit margins.

This solution is optimized for the retail industry.

Deploy this scenario

For more details on how to build and deploy this solution, visit the solution guide in GitHub.

The objective of this guide is to demonstrate predictive data pipelines for retailers to predict customer churn. Retailers can use these predictions to prevent customer churn by using their domain knowledge and proper marketing strategies to address at-risk customers. The guide also shows how customer churn models can be retrained to use more data as it becomes available.

What's under the hood

The end-to-end solution is implemented in the cloud, using Microsoft Azure. The solution is composed of several Azure components, including data ingest, data storage, data movement, advanced analytics, and visualization. The advanced analytics are implemented in Azure Machine Learning, where you can use Python or R language to build data science models. Or you can reuse existing in-house or third-party libraries. With data ingest, the solution can make predictions based on data transferred to Azure from an on-premises environment.

Solution dashboard

The snapshot below shows an example Power BI dashboard that gives insights into the predicted churn rates across a customer base.

Power BI dashboard that gives insights into the predicted churn rates across a customer base.

Next steps

Architecture guides:

Reference architectures: