Skip to content

Latest commit

 

History

History
70 lines (56 loc) · 2.74 KB

BigDataandCloud.md

File metadata and controls

70 lines (56 loc) · 2.74 KB

##Big Data and Cloud

  1. nova
    OpenStack Nova provides a cloud computing fabric controller, supporting a wide variety of virtualization technologies, including KVM, Xen, LXC, VMware, and more.
    Project Source: https://github.com/openstack/nova
    Project Homepage: http://www.openstack.org/

  2. boto
    Boto is a Python package that provides interfaces to Amazon Web Services.
    Project Source: https://github.com/boto/boto
    Project Documentation: http://docs.pythonboto.org/en/latest/

  3. aws-cli
    Universal Command Line Interface for Amazon Web Services
    Project Source: https://github.com/aws/aws-cli

  4. mrjob
    Run MapReduce jobs on Hadoop or Amazon Web Services.
    Project Source: https://github.com/Yelp/mrjob
    Project Documentation: https://pythonhosted.org/mrjob/

  5. s3cmd
    Command line tool for managing Amazon S3 and CloudFront services.
    Project Source: https://github.com/s3tools/s3cmd
    Project Homepage: http://s3tools.org/s3cmd

  6. dumbo
    Python module that allows one to easily write and run Hadoop programs.
    Project Source: https://github.com/klbostee/dumbo
    Project Homepage: http://klbostee.github.io/dumbo/

  7. dpark
    DPark is a Python clone of Spark, MapReduce alike computing framework supporting iterative computation.
    Project Source: https://github.com/douban/dpark
    Project Documentation (Chinese): https://github.com/jackfengji/test_pro/wiki

  8. Hue
    Hue is an open source Web UI for doing big data with Hadoop.
    Project Source: https://github.com/cloudera/hue
    Project Homepage: http://gethue.com/

  9. falcon
    Falcon is a high-performance Python framework for building cloud APIs and web app backends.
    Project Source: https://github.com/racker/falcon
    Project Homepage: http://falconframework.org/

  10. gae-boilerplate
    Google App Engine Boilerplate gets your project off the ground quickly using the Google App Engine platform.
    Project Source: https://github.com/coto/gae-boilerplate
    Project Homepage: http://appengine.beecoss.com/

  11. goagent
    a gae proxy
    Project Source: https://github.com/goagent/goagent
    Project Homepage: https://code.google.com/p/goagent/

  12. libcloud
    Apache Libcloud is a Python library which hides differences between different cloud provider APIs and allows you to manage different cloud resources through a unified and easy to use API.
    Project Source: https://github.com/apache/libcloud
    Project Documentation: http://libcloud.apache.org/

  13. Fabulous
    Deploy django apps to Amazon EC2 with ONE command.
    Project Source: https://github.com/gcollazo/Fabulous
    Project Homepage: http://elweb.co/

  14. pyes
    pyes is a pythonic way to use ElasticSearch since 2010.
    Project Source: https://github.com/aparo/pyes
    Project Documentation: https://pypi.python.org/pypi/pyes/