Databricks-Conducted Survey Reveals That Spark is Gaining Traction in All Areas -- From Democratizing Data Internally to Driving Real Business Value for Enterprise Users
SAN FRANCISCO, CA--(Marketwired - Sep 24, 2015) - Databricks, the company founded by the creators of Apache Spark, today released the findings of a survey of more than 1,400 respondents from the Spark community to identify how organizations and users are utilizing the data analytics and processing engine. The 2015 Spark User Survey results determined that the number of standalone deployments of Spark eclipses those on YARN as more users run Spark independent of Hadoop. Users that are running Spark in standalone (48 percent of respondents) exceeds those running Spark on YARN (40 percent of respondents), alongside a majority of users running Spark in the public cloud. The survey also found that 51 percent of respondents run Spark on a public cloud.
With more than 600 contributors in the last 12 months (up from 315 contributors the 12 months prior), Spark is the most active open source project in Big Data. Additionally, more than 200 organizations contribute code to Spark, making it one of the largest communities of engaged developers to date.
Key findings from the survey include:
"The continued growth of Spark has been highly encouraging, as companies are going into production to obtain real business value, and they are doing so in a wide range of environments beyond Hadoop clusters," said Matei Zaharia, creator of Apache Spark and CTO of Databricks. "Databricks and our partners are 100 percent committed to the long-term growth of Spark and we'll continue to make improvements based on this survey data and our ongoing community feedback, to make the most complete big data analytics toolkit accessible to all businesses."
"The enthusiasm for big data is matched only by the pace of innovation. Many organizations are shifting to a 'Spark-first' strategy, recognizing its advantages of analytics versatility, development familiarity, superior performance, range of data sources supported, and deployment flexibility. The market will no doubt continue to evolve, but Spark has established considerable momentum today," said Nik Rouda, Senior Analyst at Enterprise Strategy Group.
About the Survey
In 2014, Spark set the world record in large-scale sorting and saw major improvements across the entire engine. This year, the Spark community was surveyed to find out who Spark's users are, what they're building, and how they're using Spark to do it. The results reflect the answers and opinions of 1,417 respondents representing 842 organizations.
Additional Resources:
Company Overview
Databricks' vision is to dramatically simplify big data processing. It was founded by the team that created and continues to drive Apache Spark, a powerful open source data processing engine built for sophisticated analytics, ease of use, and speed. Databricks offers a cloud-based integrated workspace for big data that lets users go from data ingest, to visual exploration and production jobs, making it easy to turn data into value, without the hassle of managing complex infrastructure, systems and tools. Databricks is venture-backed by Andreessen Horowitz and NEA. For more information, contact info@databricks.com.
Contact Information:
Media Contact:
Suzanne Block
databricks@merrittgrp.com
617-824-0981