@BigDataExpo Authors: Yeshim Deniz, Elizabeth White, Pat Romanski, Liz McMillan, Janakiram MSV

Blog Feed Post

Cloudera Strengthens Hadoop Security with Acquisition of Gazzang: Builds on additional community efforts to deliver end-to-end security offering


One thing I really love about being in the technology field is watching things get done that just a short while ago seemed impossible. I felt that way again when reading the press release below.  In the early days of production systems built around Apache Hadoop, security was only possible by limiting access to your cluster. Later, more and more security related capabilities were added, including better access control, authentication, auditing, and data provenance. Many players delivered niche solutions for encrypting data, but not so long ago most solutions I saw introduced new weaknesses for each solution.  Then some very positive things started happening.  One is Intel corporation started a deep focus on enhanced security, including creating an open source community activity that leveraged smart design that could leverage Intel Data Protection Technology with AES-NI (Project Rhino) in 2013. Cloudera continued to focus on security and find-grain access control with capabilities like Sentry.  Another very positive development was the application of engineering and security talent by an amazing firm named Gazzang. One of the big advances from Gazzang: well engineered key management.

The news below is the product of many of these factors plus the vision and leadership of very smart people at Gazzang, Intel and Cloudera. The result– something that was absolutely impossible just a few years ago, is now achievable. Security still takes forethought, but the fact that well engineered end to end encryption is now possible is a dramatically positive step.

From: http://ctolink.us/Tbddag

Cloudera Strengthens Hadoop Security with Acquisition of Gazzang


Combines Apache Sentry and Intel’s Project Rhino with Gazzang’s Encryption and Key Management to Build the Industry’s Most Robust End-to-End Security Offering for Hadoop Environments

PALO ALTO, Calif. – June 3, 2014 – Cloudera, a leader in enterprise analytic data management powered by Apache Hadoop™, today announced that it has acquired Gazzang, the big data security experts, to dramatically strengthen its security offerings, building on the roadmap laid out last year when Cloudera first delivered Sentry. Terms of the deal were not disclosed.

The addition will immediately deliver enterprise-grade data encryption and key management, addressing head on the challenges associated with securing and processing sensitive and legally protected data within the Hadoop ecosystem. Thus fulfilling a requirement in myriad compliance regulations like HIPAA-HITECH, PCI-DSS, FERPA and the EU Data Protection Directive.

While Cloudera customers will continue to have a choice of a broad range of cross-platform data protection methods available from Cloudera partners, Cloudera now offers encryption for all data-at-rest stored inside the Hadoop cluster – using an approach that is transparent to applications using the data, thereby minimizing the costs associated with enabling encryption.

Cloudera plans to focus the efforts of the Gazzang team on additional security challenges in Hadoop. The team will become the heart of the Cloudera Center for Security Excellence focusing exclusively on Hadoop security. The Center will focus on:

    • Comprehensive data and cluster security technologies - including “follow the data” authorization and encryption policies riding on Cloudera’s data lineage tracking capabilities.
    • Security testing and certification - including continuous vulnerability assessment, performance optimization, and developing regulatory compliance playbooks.
    • Security ecosystem partner enablement - developing security integration APIs and certifying partner products.

In addition to immediately providing a transparent data-at-rest encryption and key management solution to enterprise customers – addressing one of the biggest gaps in Hadoop security – Cloudera, Intel and Gazzang form a powerful team of big data security and silicon performance optimization expertise that will improve security in core Hadoop through the open source community.

Cloudera is continuing to invest broadly in the open source community to support and accelerate security features into project Rhino—an open source effort founded by Intel in early 2013. Project Rhino is a broad based open source security architecture addressing many of the major pillars of enterprise security including: perimeter security, entitlements and access control and data protection.

“Data security is no longer a checkbox for IT organizations or operations departments, it has become a top business priority,” said Tom Reilly, chief executive officer, Cloudera. “At the same time compliance requirements for protecting data continue to expand in scope where data access comes under scrutiny. We’re entering a whole new era with the rise of the Industrial Internet and the Internet of Things where there is vastly more data being streamed from billions of devices. Centralizing and accessing that net-new data to unlock its value is therefore a challenge when you consider the security requirements. That’s what we’re solving now.”

Simplifying the process of injecting core security features such as encryption and key management into highly scalable environments will enable customers to move beyond test and development workloads to real-world implementations much more quickly and easily. For example, companies that are weighing the value of putting workloads in public cloud environments against security concerns will now be able to move forward by putting in place additional process-based access controls. This limits access to encrypted data only to authorized system functions – rather than specific users or roles – so a cloud administrator, who likely does not need access to the sensitive encrypted data, cannot run commands that grant them access. This is critical for compliance initiatives that require organizations to restrict data access based on “business need to know.”

“Enterprises are adopting big data solutions, despite what some mainstream press has stated, but only when they can address data security and compliance requirements. That Cloudera can now address the enterprise’s most critical security requirement — data encryption — directly into the platform is a big win for security-sensitive customers,” said Adrian Lane of the analyst firm Securosis. “What’s more, Gazzang’s transparent form of encryption scales right along with NoSQL clusters, so Cloudera customers get data security at big data scale. This is an astute acquisition by Cloudera.”

Today a rapidly growing number of large enterprises are building enterprise data hubs built on Hadoop to address a wide variety of data challenges and increasingly to work with data in more ways, not only for processing and archiving, but now for self-service BI and advanced analytics. The success of Hadoop has also drawn the attention of big, established players in the market, including most leading enterprise software companies. Many with decades of experience serving large and demanding customers now are building out software and systems that incorporate Hadoop.

Cloudera has driven enterprise capabilities and more power into the Hadoop platform than any other company as evidenced by the incorporation of real- time query with its open source Cloudera Impala; real-time search support with Lucene and Solr; security with Cloudera’s Apache Sentry project; integrated governance, compliance, reporting and disaster recovery—all on to the Hadoop platform.

Cloudera plans to incorporate Gazzang’s technology into its Cloudera Enterprise offering. Existing customers will benefit immediately as the new products become part of the company’s existing offering. Cloudera will provide support for the Gazzang customer base.


About Gazzang

Gazzang provides data security solutions and expertise to help enterprises protect sensitive information and maintain performance in big data and cloud environments. Our technology enables SaaS vendors, health care organizations, financial institutions, public sector agencies and more to meet regulatory compliance initiatives, secure personally identifiable information and prevent unauthorized access to sensitive data and systems. The company is headquartered in Austin, Texas and backed by Austin Ventures and Silver Creek Ventures.

About Cloudera

Cloudera is revolutionizing enterprise data management by offering the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data. Only Cloudera offers everything needed on a journey to an enterprise data hub, including software for business critical data challenges such as storage, access, management, analysis, security and search. As the leading educator of Hadoop professionals, Cloudera has trained over 22,000 individuals worldwide. Over 1,000 partners and a seasoned professional services team help deliver greater time to value. Finally, only Cloudera provides proactive and predictive support to run an enterprise data hub with confidence. Leading organizations in every industry plus top public sector organizations globally run Cloudera in production.www.cloudera.com

Connect with Cloudera

Read our blogs: http://www.cloudera.com/blog/ andhttp://vision.cloudera.com/

Follow us on Twitter:http://twitter.com/cloudera

Visit us on Facebook:http://www.facebook.com/cloudera

Cloudera, Cloudera’s Platform for Big Data, Cloudera Enterprise Data Hub Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Basic Edition and CDH are trademarks or registered trademarks of Cloudera Inc. in the United States, and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley writes on enterprise IT. He is a founder and partner at Cognitio Corp and publsher of CTOvision.com

@BigDataExpo Stories
SYS-CON Events announced today that CDS Global Cloud, an Infrastructure as a Service provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. CDS Global Cloud is an IaaS (Infrastructure as a Service) provider specializing in solutions for e-commerce, internet gaming, online education and other internet applications. With a growing number of data centers and network points around the world, ...
Virgil consists of an open-source encryption library, which implements Cryptographic Message Syntax (CMS) and Elliptic Curve Integrated Encryption Scheme (ECIES) (including RSA schema), a Key Management API, and a cloud-based Key Management Service (Virgil Keys). The Virgil Keys Service consists of a public key service and a private key escrow service. 

Developing software for the Internet of Things (IoT) comes with its own set of challenges. Security, privacy, and unified standards are a few key issues. In addition, each IoT product is comprised of (at least) three separate application components: the software embedded in the device, the back-end service, and the mobile application for the end user’s controls. Each component is developed by a different team, using different technologies and practices, and deployed to a different stack/target –...
SYS-CON Events announced today that Tintri Inc., a leading producer of VM-aware storage (VAS) for virtualization and cloud environments, will present at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Tintri VM-aware storage is the simplest for virtualized applications and cloud. Organizations including GE, Toyota, United Healthcare, NASA and 6 of the Fortune 15 have said “No to LUNs.” With Tintri they manag...
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
Big Data has been changing the world. IoT fuels the further transformation recently. How are Big Data and IoT related? In his session at @BigDataExpo, Tony Shan, a renowned visionary and thought leader, will explore the interplay of Big Data and IoT. He will anatomize Big Data and IoT separately in terms of what, which, why, where, when, who, how and how much. He will then analyze the relationship between IoT and Big Data, specifically the drilldown of how the 4Vs of Big Data (Volume, Variety,...
From wearable activity trackers to fantasy e-sports, data and technology are transforming the way athletes train for the game and fans engage with their teams. In his session at @ThingsExpo, will present key data findings from leading sports organizations San Francisco 49ers, Orlando Magic NBA team. By utilizing data analytics these sports orgs have recognized new revenue streams, doubled its fan base and streamlined costs at its stadiums. John Paul is the CEO and Founder of VenueNext. Prior ...
SYS-CON Events announced today that Embotics, the cloud automation company, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Embotics is the cloud automation company for IT organizations and service providers that need to improve provisioning or enable self-service capabilities. With a relentless focus on delivering a premier user experience and unmatched customer support, Embotics is the fas...
Ask someone to architect an Internet of Things (IoT) solution and you are guaranteed to see a reference to the cloud. This would lead you to believe that IoT requires the cloud to exist. However, there are many IoT use cases where the cloud is not feasible or desirable. In his session at @ThingsExpo, Dave McCarthy, Director of Products at Bsquare Corporation, will discuss the strategies that exist to extend intelligence directly to IoT devices and sensors, freeing them from the constraints of ...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, will discuss how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team a...
The Internet of Things can drive efficiency for airlines and airports. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Sudip Majumder, senior director of development at Oracle, will discuss the technical details of the connected airline baggage and related social media solutions. These IoT applications will enhance travelers' journey experience and drive efficiency for the airlines and the airports. The session will include a working demo and a technical d...
Apache Hadoop is a key technology for gaining business insights from your Big Data, but the penetration into enterprises is shockingly low. In fact, Apache Hadoop and Big Data proponents recognize that this technology has not yet achieved its game-changing business potential. In his session at 19th Cloud Expo, John Mertic, director of program management for ODPi at The Linux Foundation, will explain why this is, how we can work together as an open data community to increase adoption, and the i...
What happens when the different parts of a vehicle become smarter than the vehicle itself? As we move toward the era of smart everything, hundreds of entities in a vehicle that communicate with each other, the vehicle and external systems create a need for identity orchestration so that all entities work as a conglomerate. Much like an orchestra without a conductor, without the ability to secure, control, and connect the link between a vehicle’s head unit, devices, and systems and to manage the ...
What are the successful IoT innovations from emerging markets? What are the unique challenges and opportunities from these markets? How did the constraints in connectivity among others lead to groundbreaking insights? In her session at @ThingsExpo, Carmen Feliciano, a Principal at AMDG, will answer all these questions and share how you can apply IoT best practices and frameworks from the emerging markets to your own business.
More and more brands have jumped on the IoT bandwagon. We have an excess of wearables – activity trackers, smartwatches, smart glasses and sneakers, and more that track seemingly endless datapoints. However, most consumers have no idea what “IoT” means. Creating more wearables that track data shouldn't be the aim of brands; delivering meaningful, tangible relevance to their users should be. We're in a period in which the IoT pendulum is still swinging. Initially, it swung toward "smart for smar...
Security, data privacy, reliability, and regulatory compliance are critical factors when evaluating whether to move business applications from in-house, client-hosted environments to a cloud platform. Quality assurance plays a vital role in ensuring that the appropriate level of risk assessment, verification, and validation takes place to ensure business continuity during the migration to a new cloud platform.
DevOps is speeding towards the IT world like a freight train and the hype around it is deafening. There is no reason to be afraid of this change as it is the natural reaction to the agile movement that revolutionized development just a few years ago. By definition, DevOps is the natural alignment of IT performance to business profitability. The relevance of this has yet to be quantified but it has been suggested that the route to the CEO’s chair will come from the IT leaders that successfully ma...
In past @ThingsExpo presentations, Joseph di Paolantonio has explored how various Internet of Things (IoT) and data management and analytics (DMA) solution spaces will come together as sensor analytics ecosystems. This year, in his session at @ThingsExpo, Joseph di Paolantonio from DataArchon, will be adding the numerous Transportation areas, from autonomous vehicles to “Uber for containers.” While IoT data in any one area of Transportation will have a huge impact in that area, combining sensor...
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, discussed how research has demonstrated the value of Machine Learning in delivering next generation analytics to impr...
The best way to leverage your Cloud Expo presence as a sponsor and exhibitor is to plan your news announcements around our events. The press covering Cloud Expo and @ThingsExpo will have access to these releases and will amplify your news announcements. More than two dozen Cloud companies either set deals at our shows or have announced their mergers and acquisitions at Cloud Expo. Product announcements during our show provide your company with the most reach through our targeted audiences.