This repository provides a general exercise for learning bigdata.
Learning code and related resource of hadoop.
Learning code and related resource of sparkcore, sparksql and so on.
Example code for pypark.
example code for flink.
mockdata for user action log data and bussiness data.
more details refer to readme in mockdata folder.
- configuration of client and bigdata runtime
- hadoop related resources
- setup kafka cluster
- configuration for spark