On Thursday, HP announced an agreement to invest $50M in Hortonworks. HP’s investment builds on the $100M Hortonworks raised in March in a financing red led by funds managed by Blackrock and Passport Capital as well as existing investors. The investment illustrates HP’s commitment to its reseller relationship with Hortonworks that allows it to resell the Hortonworks Data Platform. Moreover, HP plans to continue refining the engineering of its products such that they integrate with YARN, the resource management component of version 2.x of Hadoop. In addition to preparing its products to operate in conjunction with YARN, HP will be integrating its product architecture to optimally perform in conjunction with the Hortonworks Data Platform more generally. Key HP products targeted for integration with the Hortonworks Data Platform include the HP HAVEn platform, one component of which is HP Vertica. As a result of the $50M equity investment, HP’s Executive Vice President and Chief Technology Officer Martin Fink will join the board of directors of Hortonworks. HP’s investment in Hortonworks underscores how the Big Data revolution lies poised to accelerate as technology companies deepen their relationships with Hadoop vendors in anticipation of delivering turnkey big data analytics solutions that simplify and streamline the operationalization of Big Data.
On Thursday, Hortonworks announced that Apache Spark is “YARN Ready” and compatible with the multiple workloads and additional CPU processing-demands specific to Spark applications. As a result of the compatibility of Apache Spark with YARN, Hadoop users can now use one Hadoop cluster with a single repository of data for a variety of purposes rather than having to segment workloads such that some data is dedicated to Apache Spark. More specifically, Hadoop users can now rest assured that YARN-based applications work collaboratively with applications that leverage Spark’s capabilities to facilitate real-time analytics, interactive analytics, machine learning and stream processing. Hortonworks introduced Apache Spark to the Hortonworks Data Platform as a technology preview download in May but today announces the integration of Spark with YARN, its recent acquisition, XA Secure, for authentication and data security purposes, as well as Ambari toward the larger goal of delivering an integrated, turnkey, enterprise-grade Hadoop platform. Thursday’s announcement by Hortonworks responds to similar statements by competitors MapR regarding the integration of Spark into its Hadoop distribution, and Cloudera’s announcement of its enterprise-grade support for Apache Spark.
The following graphic illustrating the integration of Spark into YARN originated from the Hortonworks blog post Making Apache Spark YARN Ready.