Greenplum announces Analytics Workbench
1,000 node cluster will act as a lab environment for accelerating big data innovation
EMC has announced that its Greenplum Analytics Workbench - a 1,000-node cluster designed to act as a lab environment for accelerating the pace of Big Data innovation - is now live.
The Greenplum Analytics Workbench will primarily act as an environment for running scale validation of the Apache Hadoop code base. Greenplum is working with the Apache Software Foundation to ensure that results from the Analytics Workbench are available to the open source community in an effort to leverage the resources of the Workbench to further accelerate the development of Hadoop as a revolutionary technology for Big Data.
"We're thrilled to announce that the Greenplum Analytics Workbench is now live. With more companies implementing Big Data analytics than ever before, Hadoop-based batch processing of data at massive scale, with continuous testing, is a key component to driving even better, faster data analytics," said Scott Yara, senior vice president of products and co-founder, Greenplum.
Technology from some of the world's leading software and hardware manufacturers is providing the infrastructure for the Greenplum Analytics Workbench.
According to Greenplum, it will use the Analytics Workbench to test the limits of scale-out infrastructure technology and also to explore the models for applying Big Data analytics. Whether that involves working with academic institutions on data-intensive research studies, or collaborating with big data application developers, Greenplum has plans to provide the most innovative thinkers in the data space with access to the Analytics Workbench.
The 1,000-node cluster will also be made available to members of Greenplum's training and certification classes for Hadoop. With the first publicly available courses launching this summer, Greenplum will offer a set of comprehensive Hadoop training programmes designed to provide participants with the knowledge and programming skills required to leverage Hadoop. A unique aspect of Greenplum's Hadoop training programme is that any individual who successfully completes the course will be granted access to the 1,000-node cluster to use as a sandbox environment.
The Greenplum Analytics Workbench is the result of several hardware and software companies coming together to collectively facilitate the development of Apache Hadoop as an important tool for Big Data analytics, including: EMC, Intel, Mellanox Technologies, Micron, Seagate, SuperMicro, Switch and VMware.
In addition to 1,000-plus hardware nodes (or 10,000 nodes with the addition of virtual machines), the test bed cluster consists of 24 petabytes of physical storage. This is the equivalent of nearly half of the entire written works of mankind, from the beginning of recorded history.