KAVE Analytics Platform

게시자: KAVE
Data Analytics

Through complete use of your own data, you generate value; for people, for society, and for customers. KAVE forms the heart of a new 'Big Data ecosystem', removing common obstacles, allowing quick, easy and consistent data analytics.


KAVE anticipates a near future when data analysis plays an increasingly central position in society. To realize this KAVE adopts the best of breed analytics tools around Hadoop in combination with tools for collaboration, continuous integration, management and security.

KAVE.io video


KAVE can be tailored to your needs, is modular, extensible. A few common use-cases include:

  • Data Exploration PoC
  • Data science education
  • Team collaboration environment
  • Monthly report generation
  • Combined real-time and batch processing: Lambda architecture
  • Hadoop-based Data-Lake

For more information see KAVE on Github. If you need help tailoring your KAVE solution we have an experienced team of data scientist, architects and engineers available to assist you. For more information or assistance send us a message: kave@kpmg.com


This KAVE includes a cluster running Hadoop (HDP 2.4), with user management and integration provided by FreeIPA, along with the collaboration, development and analysis tools shown in the diagram below. A desktop environment provided via RDP and VNC is included as well. This is a generic environment suitable for small-medium scale analyses.

Environment size

This KAVE environment uses multiple Azure resources and spreads specific workloads among them in its operation. Among them these are the most noteworthy:

  • 5x - Virtual Machine - Standard_D4_v2 (8 cores, 28 GB memory)
  • 3x - Virtual Machine - Standard_DS4_v2 (8 cores, 28 GB memory)
  • 500 GB - Standard Storage Account
  • 3000 GB - Premium Storage Account
  • Virtual Network

버전: 1.0.3