A repository to explain the Hadoop eco system like HDFS, HIVE, HBASE, IMPALA, PIG, OOZIE. It is a new repository, so it will take some time to upload all the related content.
To practice you can setup cloudera virtual machine on your system. It is a big setup so make sure you have enough RAM & space. you can get all the details at the give location:
https://github.com/martandsingh/BigDataAnalytics/tree/main/ClouderaSetup
- Cloudera Management Service
- ZooKeeper
- HDFS
- Solr
- Flume
- HBase
- Key-Value Store Indexer
- MapReduce or YARN
- Hive
- Impala
- Oozie
- Sqoop
- Hue