Sahara/Roadmap
< Sahara
Revision as of 07:38, 19 June 2013 by Sergey Lukjanov (talk | contribs) (→Phase 2 - Cluster Operations (July, 15 - work in progress))
Contents
Phase 1 - Basic Cluster Provisioning (April, 10 - released)
- Cluster provisioning
- Deployment Engine implementation for pre-installed images
- Templates for Hadoop cluster configuration
- REST API for cluster startup and operations
- UI integrated into Horizon
Phase 2 - Cluster Operations (July, 15 - work in progress)
- Manual cluster scaling (add/remove nodes)
- Hadoop cluster topology configuration parameters
- Data node placement control
- HDFS location
- Swift integration
- Plugin mechanism for integration with different Hadoop distributions
- Plugins implementation:
- Vanilla Apache Hadoop with pre-build image
- Hortonworks Data Platform using Ambari
- Monitoring support - integration with 3rd-party monitoring tools (Zabbix, Nagios)
Phase 3 - Analytics as a Service (October, 15 - planned)
- API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
- User-friendly UI for ad-hoc analytics queries based on Hive or Pig
- Network configuration support, integration with Quantum
Further Roadmap (completion - TBD)
- HDFS and Swift integration
- Caching of Swift data on HDFS
- Avoid issues with Swift eventual consistency while running job
- HBase support