Imanis Data - Cloud migration, backup for your big data applications on Azure HDInsight

8月 17, 2017 に投稿済み

Program Manager, Azure, Big Data

We are pleased to announce the availability of Imanis Data on Azure.

Azure HDInsight is the industry leading fully-managed cloud Apache Hadoop & Spark offering, which allows customers to run reliable open source analytics with an industry-leading SLA. Imanis Data provides data management software that allows users to migrate data and add backup and restore functionality for their big data applications.

This combined offering of Imanis Data on HDInsight and integration with Azure Blob Storage and Data Lake Store enables customers to migrate to cloud faster, protect critical data assets from application or human error.

Microsoft Azure HDInsight – Reliable open source analytics at enterprise grade and scale

HDInsight is the only fully-managed cloud Hadoop offering that provides optimized open source analytical clusters for Spark, Hive, Interactive Hive, MapReduce, HBase, Storm, Kafka, and R Server, backed by a 99.9% SLA. Each of these big data technologies are easily deployable as managed clusters with enterprise-level security and monitoring.

Imanis Data – Cloud migration, backup and restore for big data applications

The explosive growth of cloud computing in general, and the rise of big data applications, has brought about a need to ensure that workloads previously running on-premises can run at-scale in Azure; as well as keeping the underlying HDInsight data assets protected from disasters, human errors and application corruption.

To that end, we’re excited to highlight Imanis Data (formerly Talena, Inc.) who just launched their software solution on the Azure. Imanis Data provides data management software that covers a wide range of use cases that will benefit HDInsight customers, including:

  • Migration of on-premise or other cloud big data workloads to Azure HDInsight: Imanis Data provides a compelling way for companies to migrate their big data workloads to HDInsight, independent of which Hadoop distribution you’re using. This includes both data and application specific metadata as well.
  • Cloud Disaster Recovery: Imanis Data can easily be used as the basis for moving both data and metadata of your Open Source Workloads such as Hive, HBase, Spark to a secondary region, enabling cross-region DR.
  • Scalable Backup and Rapid Recovery: Imanis Data enables extremely rapid backup and point-in-time recovery of petabyte-scale data used by open source workloads such as Hive, HBase, Spark.
  • Test Data Management: As enterprises move data to the cloud, protecting PII is critical. The native data masking capabilities in Imanis Data enable enterprises to protect sensitive data while migrating data to QA, data analytics or other clusters in the cloud.
  • Archiving for compliance and regulatory requirements.
  • Native integration with Microsoft Azure Blob Storage and Azure Data Lake Store.
     

To support these diverse use cases, the Imanis Data software architecture incorporates:

  • A distributed and highly-scalable file system that enables support for petabyte-scale workloads.
  • Rapid recovery capabilities with an intuitive metadata catalog, the flexibility to recover to different database topologies, and support for parallel data transfers.
  • A built-in storage optimization engine that focuses on incremental-forever backups, global block-level de-duplication, and compression.
  • Agentless integration with various databases.
  • Support for data mirroring and replication across multiple Azure regions.

To learn more about Imanis Data offering on Azure, please see this.

Getting started with Imanis Data on Azure HDInsight

You can install Imanis Data from Azure marketplace. Imanis Data software is installed on a VM which sits outside the cluster.

To configure Imanis Data for Azure HDInsight, please read this detailed guide. Following is a screenshot of configuring Imanis Data for HDInsight.

Screen Shot 2017-08-15 at 2.45.04 PM


After you install it, connect to the Azure HDInsight cluster and perform the following operations:

  • Connect Imanis Data to on-premise Hadoop or Spark cluster: Imanis Data can help migrate data from on-premise Hadoop, Spark or HBase, as well as metadata associated with these workloads to the cloud. You can store the data in Azure Blob Storage or Azure Data Lake Store. Once you move the data you can run Hadoop, Spark or HBase or use R Server on Azure HDInsight to perform advanced analytics.
  • Cloud Disaster Recovery: Imanis Data can easily be used as the basis for moving both data and metadata of your Open Source Workloads such as Hive, HBase, Spark to a secondary region, enabling cross-region DR.
  • Scalable Backup and Rapid Recovery: Imanis Data enables extremely rapid backup and point-in-time recovery of petabyte-scale data used by open source workloads such as Hive, HBase, Spark.
  • Test Data Management: As enterprises move data to the cloud, protecting PII is critical. The native data masking capabilities in Imanis Data enable enterprises to protect sensitive data while migrating data to QA, data analytics or other clusters in the cloud.
  • Archiving for compliance and regulatory requirements.

Joint webinar on cloud migration, backup and restore, and more

We hosted a joint webinar on June 27, during which we highlighted how enterprises can benefit from using Imanis Data to manage their big data applications on HDInsight. We covered various patterns on how you can use Imanis Data to set up a hybrid environment, dev/test management, backup and restore, and replication across different regions in Azure. The following diagram shows a summary of the patterns covered. In case you missed it, you can still watch the webinar to learn more. We look forward to talking with you and getting your feedback.

Screen Shot 2017-08-16 at 10.09.16 PM

Resources

The following resources are available to learn more about this integration:

Summary

This combined offering of Imanis Data on HDInsight and integration with Azure Blob Storage and Data Lake Store enables customers to migrate to cloud faster, protect critical data assets from application or human error. If you have any feedback or questions, feel free to drop us an email at hdiask@microsoft.com. We’d love to hear from you!