Think Big expands capabilities for building data lakes with Apache Spark

Teradata (NYSE: TDC) said that Think Big is expanding its data lake and managed service offerings using Apache Spark.

Spark is an open source cluster computing platform used for product recommendations, predictive analytics, sensor data analysis, graph analytics and more.

Customers can use a data lake with Apache Spark in the cloud, on general “commodity built” Hadoop environments, or with Teradata´s Hadoop Appliance, the most powerful, ready-to-run enterprise platform, preconfigured and optimized to run enterprise-class big data workloads.

While interest in Spark continues to increase, many companies struggle to keep up with the rapid pace of change and frequency of releases of the open source platform. Think Big has successfully incorporated Spark in its frameworks for building enterprise-quality data lakes and analytical applications.

Think Big is building replicable service packages for Spark deployment including adding Spark as an execution engine for its Data Lake and Managed Services offers. Through its training branch–Think Big Academy–the consultancy is also launching a series of new Spark training offers for corporate clients. Led by experienced instructors, these classes help train managers, developers, and administrators on using Spark and its various modules including machine learning, graph, streaming and query.

Teradata helps companies get more value from data than any other company. Teradata´s leading portfolio of big data analytic solutions, integrated marketing applications, and services can help organizations gain a sustainable competitive advantage with data.