What is HDP and CDH?

Cloudera data platform

In short, users who were using the HDP (Data Platform) will benefit from services such as Cloudera Manager, Impala, Hue and Kudu, and those who were using the CDH (Data Hub) will benefit from using Hive LLAP, Hive on Tez, Atlas 2.0, Ranger, Nifi and Knox.

In the new Cloudera platform the first product in which a number of tools have been integrated that we heard about is Cloudera Data Flow (CDF), an evolution of HDF that includes NiFi, Edge Flow Manager, Kafka Streams, Flink, etc. It is the platform for real-time data processing and analysis to generate automatic actions that deliver value.

The second product offered is for workloads that do not require real-time data processing tools but structured and unstructured storage called Cloudera Data Warehouse (CDW) that will feature:

Cloudera cdp

Early 2019 saw the completion of Cloudera’s purchase of Hortonworks and thus the start of a new generation of enterprise; the first to offer the right tools for advanced big data analytics in hybrid clouds.

The first release of CDP will provide a combined feature set for new users, and will continue with a second release that will support upgrades to existing HDP and CDH applications, said Aron Murthy, who led engineering at Hortonworks and is now Cloudera’s product manager.

The new platform will include a unified section of software to manage security and data governance, but it was not specified whether it will be based on a combination of the different technologies currently offered by Cloudera and Horton or a combination of the two.

Read more  What are the features of Maya?

In addition Reilly said HDP will integrate with Cloudera Data Science Workbench (CDSW), a collaboration and workflow management platform for teams of data scientists or analysts. He did not say whether Cloudera will also offer IBM rival Data Science Experience workbench software, which Hortonworks has resold since mid-2017.

Cloudera (cdh)

Every day massive data sets originate from different organizations or companies, and whose management turns out to be very complex. This is the reason why Apache Hadoop was born, which gives us the facility for distributed storage and subsequent processing of large data sets.

Although there are several similarities between Cloudera and Hortonworks, both have their own strengths and weaknesses. So, when choosing the right distribution for your business, it is important to consider the added value that each can offer.

Organizations or companies should analyze the performance, scalability, manageability, reliability and data access for both options, taking into account both short-term and long-term goals.

Hortonworks vs cloudera

As expected, the new merger of Cloudera and Hortonworks will operate under the Cloudera brand, and its goal is to begin moving customers to a new, unified Cloudera data platform, while committing to hybrid and multi-cloud deployments and remaining “100% open source.”

It was not known at the time what the new company would be called, but it is now known that it will be called Cloudera, with the Hortonworks brand being scrapped. This reflects what we wrote at the time, “Cloudera is the alpha dog in this negotiation, with Cloudera shareholders owning approximately 60% of the combined company’s equity and Cloudera CEO Tom Reilly leading the new joint venture, with Bearden joining the board.”

Read more  What is Oracle utility analytics?

Reilly also announced that “true to our heritage, the new platform will be 100% open source…with unified enterprise governance and security,” as well as a focus on “new open source standards such as Kubernetes and containers, which will influence our strategy for the future.”