Jump to: navigation, search

Difference between revisions of "Sahara/SparkPlugin"

m (Introduction)
m (Status)
Line 20: Line 20:
 
== Status ==
 
== Status ==
  
Development is done by: Do Huy-Hoang and Vo Thanh Phuc (Master students at Eurecom), Daniele Venzano (Research Engineer at Eurecom), under the supervision of Prof. Pietro Michiardi (at eurecom). This work is partially supported by the BigFoot project, a EC-funded research project.
+
Development is done by Daniele Venzano (Research Engineer at Eurecom) and Pietro Michiardi (Prof. at Eurecom). A preliminary version of the plugin was developed with the additional help of two Master students at Eurecom, Do Huy-Hoang and Vo Thanh Phuc.
 +
This work is partially supported by the BigFoot project, a EC-funded research project with grant agreement n. 317858.
  
 
== Related Resources ==
 
== Related Resources ==
 
* [[Sahara/PluggableProvisioning/PluginAPI]]
 
* [[Sahara/PluggableProvisioning/PluginAPI]]
 
* [https://blueprints.launchpad.net/sahara/+spec/spark-plugin Blueprint]
 
* [https://blueprints.launchpad.net/sahara/+spec/spark-plugin Blueprint]

Revision as of 19:47, 26 May 2014

Introduction

Spark is a fast and general engine for large-scale data processing.
This blueprint proposes a Sahara provisioning plugin for Spark that can launch and resize Spark clusters and run EDP jobs.

From the Sahara perspective, in the first iteration no support for scaling and EDP will be available, but those features are planned and will be integrated later.

We are currently testing a more general plugin to support Shark, one of the Spark related projects. Shark is an open source distributed SQL query engine for Hadoop data. It brings state-of-the-art performance and advanced analytics to Hive users.

Requirements

Spark version 0.9.1 is supported, in standalone mode. There will be no no support for Mesos or YARN modes for now.

At first this plugin will support only Cloudera CDH HDFS as data layer. A DIB element is provided as well to generate disk images compatible with this plugin.

Documentation

Status

Development is done by Daniele Venzano (Research Engineer at Eurecom) and Pietro Michiardi (Prof. at Eurecom). A preliminary version of the plugin was developed with the additional help of two Master students at Eurecom, Do Huy-Hoang and Vo Thanh Phuc. This work is partially supported by the BigFoot project, a EC-funded research project with grant agreement n. 317858.

Related Resources