Difference between revisions of "EHO/Roadmap"
< EHO
(Created page with "==== Phase 1 - Basic Cluster Provisioning (completion - April 4, with the OpenStack Grizzly release) ==== * Cluster provisioning * Deployment Engine implementation for pre-in...") |
|||
Line 1: | Line 1: | ||
− | ==== Phase 1 - Basic Cluster Provisioning (completion - April | + | ==== Phase 1 - Basic Cluster Provisioning (completion - early April) ==== |
* Cluster provisioning | * Cluster provisioning | ||
Line 8: | Line 8: | ||
− | ==== Phase 2 - Cluster Operations (completion - end of June | + | ==== Phase 2 - Cluster Operations (completion - end of June) ==== |
* Manual cluster scaling (add/remove nodes) | * Manual cluster scaling (add/remove nodes) | ||
Line 19: | Line 19: | ||
− | ==== Phase 3 - Analytics as a Service (completion - end of September | + | ==== Phase 3 - Analytics as a Service (completion - end of September) ==== |
* API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR) | * API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR) | ||
Line 25: | Line 25: | ||
− | ==== Further Roadmap (completion - TBD | + | ==== Further Roadmap (completion - TBD) ==== |
* HDFS and Swift integration | * HDFS and Swift integration |
Revision as of 15:47, 11 March 2013
Contents
Phase 1 - Basic Cluster Provisioning (completion - early April)
- Cluster provisioning
- Deployment Engine implementation for pre-installed images
- Templates for Hadoop cluster configuration
- REST API for cluster startup and operations
- UI integrated into Horizon
Phase 2 - Cluster Operations (completion - end of June)
- Manual cluster scaling (add/remove nodes)
- Hadoop cluster topology configuration parameters
- Data node placement control
- HDFS location
- Swift integration
- Integration with vendor specific deployment/management tooling
- Monitoring support - integration with 3rd-party monitoring tools (Zabbix, Nagios)
Phase 3 - Analytics as a Service (completion - end of September)
- API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
- User-friendly UI for ad-hoc analytics queries based on Hive or Pig
Further Roadmap (completion - TBD)
- HDFS and Swift integration
- Caching of Swift data on HDFS
- Avoid issues with Swift eventual consistency while running job
- HBase support