Jump to: navigation, search

Difference between revisions of "EHO/Roadmap"

< EHO
(Created page with "==== Phase 1 - Basic Cluster Provisioning (completion - April 4, with the OpenStack Grizzly release) ==== * Cluster provisioning * Deployment Engine implementation for pre-in...")
 
 
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==== Phase 1 - Basic Cluster Provisioning (completion - April 4, with the OpenStack Grizzly release) ====
+
'''<big><big>The new name of the Elastic Hadoop on OpenStack project is Savanna. This pages are obsolete and latest info is located at https://wiki.openstack.org/wiki/Savanna</big></big>'''
 +
 
 +
==== Phase 1 - Basic Cluster Provisioning (completion - early April) ====
  
 
* Cluster provisioning
 
* Cluster provisioning
Line 8: Line 10:
  
  
==== Phase 2 - Cluster Operations (completion - end of June, with the intermediate OpenStack Grizzly release) ====
+
==== Phase 2 - Cluster Operations (completion - end of June) ====
  
 
* Manual cluster scaling (add/remove nodes)
 
* Manual cluster scaling (add/remove nodes)
Line 19: Line 21:
  
  
==== Phase 3 - Analytics as a Service (completion - end of September, with the OpenStack Havana release) ====
+
==== Phase 3 - Analytics as a Service (completion - end of September) ====
  
 
* API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
 
* API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
Line 25: Line 27:
  
  
==== Further Roadmap (completion - TBD, with the next OpenStack release) ====
+
==== Further Roadmap (completion - TBD) ====
  
 
* HDFS and Swift integration
 
* HDFS and Swift integration
Line 31: Line 33:
 
** Avoid issues with Swift eventual consistency while running job
 
** Avoid issues with Swift eventual consistency while running job
 
* HBase support
 
* HBase support
 +
 +
'''<big><big>The new name of the Elastic Hadoop on OpenStack project is Savanna. This pages are obsolete and latest info is located at https://wiki.openstack.org/wiki/Savanna</big></big>'''

Latest revision as of 15:02, 22 March 2013

The new name of the Elastic Hadoop on OpenStack project is Savanna. This pages are obsolete and latest info is located at https://wiki.openstack.org/wiki/Savanna

Phase 1 - Basic Cluster Provisioning (completion - early April)

  • Cluster provisioning
  • Deployment Engine implementation for pre-installed images
  • Templates for Hadoop cluster configuration
  • REST API for cluster startup and operations
  • UI integrated into Horizon


Phase 2 - Cluster Operations (completion - end of June)

  • Manual cluster scaling (add/remove nodes)
  • Hadoop cluster topology configuration parameters
    • Data node placement control
    • HDFS location
    • Swift integration
  • Integration with vendor specific deployment/management tooling
  • Monitoring support - integration with 3rd-party monitoring tools (Zabbix, Nagios)


Phase 3 - Analytics as a Service (completion - end of September)

  • API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
  • User-friendly UI for ad-hoc analytics queries based on Hive or Pig


Further Roadmap (completion - TBD)

  • HDFS and Swift integration
    • Caching of Swift data on HDFS
    • Avoid issues with Swift eventual consistency while running job
  • HBase support

The new name of the Elastic Hadoop on OpenStack project is Savanna. This pages are obsolete and latest info is located at https://wiki.openstack.org/wiki/Savanna