This is the final project of the Java course in the ITI-AI pro scholarship.
The data set used in this project was provided by Wuzzuf jobs in Egypt data set at Kaggle : https://www.kaggle.com/omarhanyy/wuzzuf-jobs
The project purpose was to build all java needed classes (POJO , DAO) and Make a web service to get the following:
- Read data set and convert it to dataframe or Spark RDD and display some from it.
- Display structure and summary of the data.
- Clean the data (null, duplications)
- Count the jobs for each company and display that in order (What are the most demanding companies for jobs?)
- Show step 4 in a pie chart
- Find out What are it the most popular job titles?
- Show step 6 in bar chart
- Find out the most popular areas?
- Show step 8 in bar chart
- Print skills one by one and how many each repeated and order the output to find out the most important skills required?
- Factorize the YearsExp feature and convert it to numbers in new col.\
There is a file ( Both were implemented on NetBeans IDEA Ultimate)