{"id":2557,"date":"2018-06-28T00:00:00","date_gmt":"2018-06-28T07:00:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/blog\/the-emerging-big-data-architectural-pattern"},"modified":"2023-08-01T16:53:40","modified_gmt":"2023-08-01T23:53:40","slug":"the-emerging-big-data-architectural-pattern","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/","title":{"rendered":"The emerging big data architectural pattern"},"content":{"rendered":"<h2>Why lambda?<\/p>\n<p>Lambda architecture is a popular pattern in building Big Data pipelines. It is designed to handle massive quantities of data by taking advantage of both a <a href=\"https:\/\/en.wikipedia.org\/wiki\/Batch_processing\">batch<\/a> layer (also called cold layer) and a <a href=\"https:\/\/en.wikipedia.org\/wiki\/Stream_processing\">stream-processing<\/a> layer (also called hot or speed layer).<\/p>\n<p>The following are some of the reasons that have led to the popularity and success of the lambda architecture, particularly in big data processing pipelines.<\/p>\n<h3>Speed and business challenges<\/h3>\n<p>The ability to process data at high speed in a streaming context is necessary for operational needs, such as transaction processing and real-time reporting. Some examples are fault\/fraud detection, connected\/smart cars\/factory\/hospitals\/city, sentiment analysis, inventory control, network\/security monitoring, and many more.<\/p>\n<p>Typically, batch processing, involving massive amounts of data, and related correlation and aggregation is important for business reporting. This is to understand how the business is performing, what the trends are, and what corrective or additive measure can be executed to improve business or customer experience.<\/p>\n<h3>Product challenges<\/h3>\n<p>One of the triggers that lead to the very existence of lambda architecture was to make the most of the technology and tool set available. Existing batch processing systems, such as data warehouse, data lake, Spark\/Hadoop, and more, could deal with petabyte scale data operations easily but couldn\u2019t do it fast enough that was warranted by the operational needs.<\/p>\n<p>Similarly, very fast layers such as cache databases, NoSQL, streaming technology allows fast operational analytics on smaller data sets but cannot do massive scale correlation and aggregation and other analytics operations (such as <a href=\"https:\/\/en.wikipedia.org\/wiki\/Online_analytical_processing\">Online Analytical Processing<\/a>) like a batch system can.<\/p>\n<h3>The skills challenge<\/h3>\n<p>Additionally, in the market you will find people who are highly skilled in batch systems, and often they do not have the same depth of skills in stream processing, and vice versa.<\/p>\n<h3>Lambda on Azure<\/h3>\n<p>The following is one of the many representative Lambda architecture on Azure for building Big Data pipelines.<\/p>\n<p align=\"center\"><img loading=\"lazy\" decoding=\"async\" alt=\"Figure 1\" height=\"713\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp\" title=\"Figure 1\" width=\"1541\"><br \/>\n\u00a0<br \/>\n<em>Figure 1: Lambda architecture for big data processing represented by Azure products and services. Note, other Azure and (or) ISV solutions can be placed in the mix if needed based on specific requirements.<\/em><\/p>\n<h2 align=\"left\">What problems do lambda solve vs. what problems does it introduce?<\/h2>\n<p>As stated in the previous section, lambda architecture resolves some business challenges. Various parts of the business have different needs in terms of speed, level of granularity and mechanism to consume data. It also resolves the challenge of the choice of technology, by using the best of the speed layer and batch layer together, and not stretching one product to do both which it isn\u2019t comfortable in doing. Finally, it ensures people with skills dealing with transaction and speed layer can work in parallel and together with people with skills in batch processing.<\/p>\n<p>Although immensely successful and widely adopted across many industries and a defacto architectural pattern for big data pipelines, it comes with its own challenges. Here are a few:<\/p>\n<ul>\n<li>\n<p><strong>Transient data silos:<\/strong> Lambda pipelines often creates silos that could may cause some challenges in the business. The reporting at the speed layer that the operations team is dealing with, may be different for the aggregate batch layer that the management teams are working with. Such creases may eventually iron out, but it has the potential of causing some inconsistencies.<\/p>\n<\/li>\n<\/ul>\n<ul>\n<li><strong>More management overhead: <\/strong>It also increases the number of subsystems, as a result during maintenance time, many needed to be managed and maintained. This could potentially mean one may need bigger teams with deep and wide skill sets.<\/li>\n<\/ul>\n<h2>The emerging big data design pattern<\/h2>\n<p>If there was a way that utilized the right mix of technologies that didn\u2019t need a separate speed or batch layer, we could build a system that has only a single layer and allows attributes of both the speed layer and batch layer. With the technological breakthrough at Microsoft, particularly in <a href=\"https:\/\/azure.microsoft.com\/en-gb\/services\/cosmos-db\/\">Azure Cosmos DB<\/a>, this is now possible.<\/p>\n<p>Azure Cosmos DB is a globally distributed, multi-model database. With Cosmos DB you can independently scale throughput and storage across any number of Azure&#8217;s geographic regions. It offers throughput, latency, availability, and consistency guarantees with comprehensive <a href=\"https:\/\/aka.ms\/acdbsla\">service level agreements<\/a> (SLAs).<\/p>\n<p>Here are some of the key features that renders Cosmos DB as a suitable candidate for implementing the proposed reference architecture where the speed later and the batch layer merges into a single layer.<\/p>\n<h3>Cosmos DB change feed<\/h3>\n<ul>\n<li>Most importantly, the key feature that is pivotal in building this emerging big data architectural pattern is the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/change-feed\">Cosmos DB change feed<\/a>. Change feed support works by listening to an Azure Cosmos DB collection for any changes. Then, it outputs the sorted list of documents that were changed in the order in which they were modified. The changes are persisted, can be processed asynchronously and incrementally, and the output can be distributed across one or more consumers for parallel processing.<\/li>\n<\/ul>\n<h3>Features for speed\/hot layer<\/h3>\n<ul>\n<li>Azure Cosmos DB\u2019s database engine is fully schema-agnostic \u2013 it automatically indexes all the data it ingests without requiring any schema or indexes and serves blazing fast queries.<\/li>\n<li>Cosmos DB allows you to easily scale database throughput at a <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/request-units\">per-second<\/a> granularity, and change it anytime you want.<\/li>\n<li>You can <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/distribute-data-globally\">distribute your data<\/a> to any number of <a href=\"https:\/\/azure.microsoft.com\/regions\/\">Azure regions<\/a>, with the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/tutorial-global-distribution-sql-api\">click of a button<\/a>. This enables you to put your data where your users are, ensuring the lowest possible latency to your customers.<\/li>\n<li>Azure Cosmos DB guarantees end-to-end low latency at the 99th percentile to its customers. For a typical 1KB item, Cosmos DB guarantees end-to-end latency of reads under 10 ms and indexed writes under 15 ms at the 99th percentile, within the same Azure region. The median latencies are significantly lower (under 5 ms).<\/li>\n<\/ul>\n<h3>Features for batch\/cold layer<\/h3>\n<ul>\n<li>You can access your data by using APIs of your choice, like the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/documentdb-introduction\">SQL<\/a>, <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/mongodb-introduction\">MongoDB<\/a>, <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/cassandra-introduction\">Cassandra API<\/a>, and <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/table-introduction\">Table<\/a> APIs, and graph via the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/graph-introduction\">Gremlin API<\/a>. All APIs are all natively supported.<\/li>\n<li>You can also scale storage size <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/partition-data\">transparently and automatically<\/a> to handle your size requirements now and forever.<\/li>\n<li>Five well-defined, practical, and intuitive <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/consistency-levels\">consistency models<\/a> provide a spectrum of strong SQL-like consistency all the way to the relaxed NoSQL-like eventual consistency, and everything in-between.<\/li>\n<li>Rapidly iterate the schema of your application without worrying about database schema and\/or index management.<\/li>\n<li>Using the features described above, the following will be an implementation of the emerging architectural pattern.<\/li>\n<\/ul>\n<p>The following is a diagrammatic representation of the emerging big data pipeline that we have been discussing in this blog:<\/p>\n<ul>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" alt=\"Figure 2\" height=\"666\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/eb7a5666-1040-454f-a535-84c1d219389a.webp\" title=\"Figure 2\" width=\"1723\"><\/p>\n<p align=\"center\"><em>Figure 2: Emerging architectural pattern implemented using Cosmos DB for Big Data pipelines as an evolution of the traditional lambda architecture.<\/em><\/p>\n<p>Hence, by leveraging Cosmos DB features, particularly the change feed architecture, this emerging pattern can resolve many of the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/use-cases\">common use-cases<\/a>. This in turn, gives all the benefits of the lambda architecture, and resolves some of complexities that lambda introduces. More and more customers adopting this and resulting in a successful community, and success of this new pattern and increased adoption of Azure Cosmos DB.<\/p>\n<h2>What\u2019s next?<\/h2>\n<p>The following is a list of resources that may help you get started quickly:<\/p>\n<ul>\n<li>If you haven&#8217;t already, download the Spark to Azure Cosmos DB connector from the <a href=\"https:\/\/github.com\/Azure\/azure-cosmosdb-spark\">azure-cosmosdb-spark<\/a> GitHub repository.<\/li>\n<li>The <a href=\"https:\/\/github.com\/tknandu\/TwitterCosmosDBFeed\">stream feed from Twitter to CosmosDB<\/a>, which is the mechanism to push new data into Azure Cosmos DB.<\/li>\n<li>As well with the Cosmos DB Time-to-Live (TTL) feature, you can configure your documents to be automatically deleted after a set duration. For more information on the Azure Cosmos DB TTL feature, see <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/time-to-live\">Expire data in Azure Cosmos DB collections automatically with time to live<\/a>.<\/li>\n<li>Azure Cosmos DB <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/bulk-executor-overview\">BulkExecutor library<\/a> overview.<\/li>\n<li>Hands on labs using Azure Services to build <a href=\"https:\/\/github.com\/rangv\/AzureIoTLabs\">Azure IoT End to End Solutions<\/a> connecting real and simulated devices to Azure IoT Hub.<\/li>\n<li>Perform <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/spark-connector-graph\">graph analytics by using Spark and Apache TinkerPop Gremlin<\/a>.<\/li>\n<li>Connect to <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cosmos-db\/odbc-driver\">Azure Cosmos DB using BI analytics tools<\/a> with the ODBC driver.<\/li>\n<\/ul>\n<\/h2>\n","protected":false},"excerpt":{"rendered":"<p>Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1473],"tags":[],"audience":[3057,3055,3056],"content-type":[],"product":[1538],"tech-community":[],"topic":[],"coauthors":[97],"class_list":["post-2557","post","type-post","status-publish","format-standard","hentry","category-databases","audience-data-professionals","audience-developers","audience-it-implementors","product-azure-cosmos-db"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>The emerging big data architectural pattern | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The emerging big data architectural pattern | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2018-06-28T07:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-08-01T23:53:40+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp\" \/>\n<meta name=\"author\" content=\"Microsoft Azure\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Microsoft Azure\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/microsoft-azure\/\",\"@type\":\"Person\",\"@name\":\"Microsoft Azure\"}],\"headline\":\"The emerging big data architectural pattern\",\"datePublished\":\"2018-06-28T07:00:00+00:00\",\"dateModified\":\"2023-08-01T23:53:40+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\"},\"wordCount\":1323,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp\",\"articleSection\":[\"Databases\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\",\"name\":\"The emerging big data architectural pattern | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp\",\"datePublished\":\"2018-06-28T07:00:00+00:00\",\"dateModified\":\"2023-08-01T23:53:40+00:00\",\"description\":\"Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Databases\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/databases\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"The emerging big data architectural pattern\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117\",\"name\":\"shakir\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"caption\":\"shakir\"},\"sameAs\":[\"https:\/\/azure.microsoft.com\"],\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The emerging big data architectural pattern | Microsoft Azure Blog","description":"Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/","og_locale":"en_US","og_type":"article","og_title":"The emerging big data architectural pattern | Microsoft Azure Blog","og_description":"Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2018-06-28T07:00:00+00:00","article_modified_time":"2023-08-01T23:53:40+00:00","og_image":[{"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp","type":"","width":"","height":""}],"author":"Microsoft Azure","twitter_card":"summary_large_image","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Microsoft Azure","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/microsoft-azure\/","@type":"Person","@name":"Microsoft Azure"}],"headline":"The emerging big data architectural pattern","datePublished":"2018-06-28T07:00:00+00:00","dateModified":"2023-08-01T23:53:40+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/"},"wordCount":1323,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp","articleSection":["Databases"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/","name":"The emerging big data architectural pattern | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp","datePublished":"2018-06-28T07:00:00+00:00","dateModified":"2023-08-01T23:53:40+00:00","description":"Lambda architecture is popular is Big Data architecture pipelines and is designed to handle massive quantities of data by taking advantage of both a batch layer (also called cold layer)\u2026","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/06\/38f819e8-b707-442e-9374-97b57c5d5742.webp"},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-emerging-big-data-architectural-pattern\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"Databases","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/databases\/"},{"@type":"ListItem","position":3,"name":"The emerging big data architectural pattern"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117","name":"shakir","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4","url":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","caption":"shakir"},"sameAs":["https:\/\/azure.microsoft.com"],"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/2557","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=2557"}],"version-history":[{"count":0,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/2557\/revisions"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=2557"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=2557"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=2557"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=2557"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=2557"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=2557"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=2557"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=2557"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=2557"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}