Dataguise, a leading provider of data-centric security and data governance solutions for Big Data, has announced it has expanded its DgSecure platform to support Hadoop in the cloud, including the largest cloud Hadoop implementation, Amazon EMR (Elastic MapReduce). Additionally, key Big Data cloud service providers, Altiscale and Qubole, have joined Dataguise’s Big Data Protection Partner Program (BDP3) to leverage Dataguise DgSecure to provide comprehensive discovery, protection and visibility to sensitive data for their cloud-based Hadoop customers. Already a certified partner of Cloudera, Hortonworks and MapR, Dataguise is now the only vendor that supports 100 percent of the major Hadoop distributions along with Hadoop-in-the-Cloud, traditional databases, files and SharePoint.
The “Hadoop-as-a-Service” (HaaS) market, also termed “Hadoop-in-the-cloud,” is projected to grow to $16.1 billion by 2020, according to a report from Allied Market Research. The report goes on to explain that HaaS has emerged as a replacement to on-premises Hadoop, as it is comprised of data storage in the cloud and its analysis through Hadoop without the need to install any infrastructure in the premises. With Dataguise’s DgSecure platform support for Cloudera, MapR, Hortonworks, Pivotal, IBM BigInsights, and now Amazon EMR, along with its partnerships with Altiscale and Qubole – enterprises now have the flexibility to deploy Hadoop however it best fits their business, whether on-premise or in the cloud, while maintaining compliance and protecting sensitive data with the most comprehensive solution available.
New Hadoop Cloud and Amazon Support Capabilities
Dataguise DgSecure for Hadoop has been tested against the full-suite of Amazon Hadoop capabilities, including EMR, S3, Flume, HIVE services and optimized for memory management through the Amazon bootstrapping feature. With this announcement, Dataguise and its partners offer their customers full discovery, encryption, masking and redaction for sensitive data in Amazon EMR, as well as at-rest security for sensitive data stored in Amazon S3.
Companies can quickly install and run DgSecure within the Amazon Web Services. Specific features for Amazon include:
- The ability to automatically provision through command lines to create a system from a single command
- High availability (warm failover capabilities that trigger in less than one minute)
- Multi-cluster support (support for global deployments with multiple Hadoop/EMR clusters from a single console)
- Elasticity (allows companies to provision new servers as needed with centralized, integrated configuration)
“We all know the amount of data stored in Hadoop, databases and data lakes on-premise is accelerating at astonishing rates. Simultaneously, the Hadoop in the cloud market is growing just as fast. Companies today are already facing unprecedented data breaches with traditional on-premise data stores. Now, as that data is migrating to cloud, it’s more important than ever for enterprises as well as service providers to ensure sensitive information is kept safe,” said Manmeet Singh, CEO of Dataguise. “Our data-centric, most-granular field-level approach for Hadoop, on-premise and now in-the-cloud with Amazon EMR and HaaS providers like Altiscale and Qubole, enables our customers and partners to go beyond traditional perimeter-based, access-based and application-based security strategies, with unparalleled ease of deployment and time-to-value.”