Jump to: navigation, search

Difference between revisions of "Sahara/Roadmap"

(Phase 3 - Analytics as a Service (September, 15 - planned))
(Phase 3 - Analytics as a Service (October, 15 - planned))
Line 21: Line 21:
 
* API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
 
* API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
 
* User-friendly UI for ad-hoc analytics queries based on Hive or Pig
 
* User-friendly UI for ad-hoc analytics queries based on Hive or Pig
 +
* Network configuration support, integration with Quantum
  
 
==== Further Roadmap (completion - TBD) ====
 
==== Further Roadmap (completion - TBD) ====

Revision as of 08:55, 18 June 2013

Phase 1 - Basic Cluster Provisioning (April, 10 - released)

  • Cluster provisioning
  • Deployment Engine implementation for pre-installed images
  • Templates for Hadoop cluster configuration
  • REST API for cluster startup and operations
  • UI integrated into Horizon

Phase 2 - Cluster Operations (July, 15 - work in progress)

  • Manual cluster scaling (add/remove nodes)
  • Hadoop cluster topology configuration parameters
    • Data node placement control
    • HDFS location
    • Swift integration
  • Integration with vendor specific deployment/management tooling
  • Monitoring support - integration with 3rd-party monitoring tools (Zabbix, Nagios)

Phase 3 - Analytics as a Service (October, 15 - planned)

  • API to execute Map/Reduce jobs without exposing details of underlying infrastructure (similar to AWS EMR)
  • User-friendly UI for ad-hoc analytics queries based on Hive or Pig
  • Network configuration support, integration with Quantum

Further Roadmap (completion - TBD)

  • HDFS and Swift integration
    • Caching of Swift data on HDFS
    • Avoid issues with Swift eventual consistency while running job
  • HBase support