ansible playbook to deploy cloudera hadoop components to the cluster
-
Updated
Sep 8, 2018 - Shell
ansible playbook to deploy cloudera hadoop components to the cluster
Docker image for Cloudera Hadoop components (CDH5)
A quick and dirty CDH cluster skeleton using Docker for Testing
Getting Started with Hadoop and Big Data
💂♂️ Hadoop/MapReduce Streaming
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
Otto-von-Guericke Universität Magdeburg - Big Data SoSe 2017
This is my final project for Data Engineer Expert course at Naya College.
This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.
The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of …
chatbot for hipchat (cloud or onpremise) that enables you to talk to your cloudera manager
This repository contains the TF-IDF score calculation for the documents in the Canterbury dataset for a user given search query
This repository includes two versions of hadoop management tools
Navigator is a data service that prepares the content for travel agencies, ready for exploration in EWNS (East-West-North-South) direction and hence allows them to render content to the end-user based on their desire to travel.
GCP hosted product for over 1 million movie investors on HSX.com, aiding online movie trading and box-office investments by leveraging Big Data technologies like Hive and Hadoop, and Tableau dashboards
Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."