Mcdd-Big-Data-Study

Study project for big data (Hadoop, Zookeeper, Kafka, Flink, Spark)

Features ✨

Supported Technologies:

Hadoop 3.3.6 (with JDK 8.0.352-zulu, Maven 3.6.3)

Zookeeper 3.9.2

Kafka 2.12-3.7.1

Installation 📦

Clone the repository:

git clone https://github.com/mcddhub/mcdd-big-data-study.git --depth=1 && cd mcdd-big-data-study

Build the Docker image:

cd docker
docker build -t caobaoqi1029/big-data-study:x.x.x .

Note: Replace x.x.x with the appropriate version number.

Start the containers:
```
docker compose up -d
```

Configuration 🛠

Connect to the remote server via VS Code and attach to a running container.

Install the Java Dev extension in VS Code.

Restart the extension host to apply changes.

Initialize Hadoop environment:

docker exec -it master bash
hdfs namenode -format

Start Hadoop services:
```
start-all.sh
```

Use the following commands to interact with Hadoop:

vim input.txt
hdfs dfs -put -f ./input.txt /
hdfs dfs -ls /

Build and run the Hadoop job:

mvn clean package
cd target/
hadoop jar big-data.jar

Tip: You can set the environment variable to run Java directly:
export CLASSPATH=$CLASSPATH:/tmp/
# Add this to .bashrc for persistence.

View the output:

hdfs dfs -ls /output
hdfs dfs -cat /output/part-r-00000

Contributing 🤝

We welcome contributions! Feel free to submit a pull request. For more details, see the Contribution Guide.

Thanks to all contributors:

License 📄

This project is licensed under the MIT License. See the LICENSE file for details.

Support 💖

If you find this project helpful, consider giving it a ⭐️ on GitHub!

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github		.github
assets		assets
docker		docker
docs		docs
src/main		src/main
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yaml		docker-compose.yaml
package.json		package.json
pom.xml		pom.xml
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mcdd-Big-Data-Study

Features ✨

Installation 📦

Configuration 🛠

Contributing 🤝

License 📄

Support 💖

Star History ⭐

About

Releases 1

Sponsor this project

Contributors 2

Languages

License

mcddhub/mcdd-big-data-study

Folders and files

Latest commit

History

Repository files navigation

Mcdd-Big-Data-Study

Features ✨

Installation 📦

Configuration 🛠

Contributing 🤝

License 📄

Support 💖

Star History ⭐

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 1

Sponsor this project

Contributors 2

Languages