Skip to content

gokcengiz/Data-Analysis-SQL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Udemy Courses Data Analysis

alt text

The project relies on the Udemy Courses Data 2023 provided by Kaggle. It answers five simple questions to show my proficiency in SQL for data analysis.

  • How many instructors and courses are there in total?
  • What are the 10 most popular courses?
  • What are the 10 most popular instructors?
  • What are the most reviewed instructor's courses?
  • What are the 5 most popular data analysis courses?

About Dataset

The dataset contains 2 csv files

  • courses.csv: contains the information about the courses
  • instructors.csv: contains the information about the instructors

The dataset contains 10 columns in courses.csv

-id: unique id of the course [int]

-title: title of the course [string]

-url: url of the course [string]

-rating: rating of the course [float]

-num_reviews: number of reviews of the course [int]

-num_published_lectures: number of lectures in the course [int]

-created: date of creation of the course [string]

-last_update_date: date of last update of the course [string]

-duration: duration of the course [string]

-instructors_id: id of the instructor of the course [int]

The dataset contains 10 columns in instructors.csv

_class: class of the instructor [string]

-id: unique id of the instructor [int]

-title: title of the instructor [string]

-name: name of the instructor [string]

-display_name: display name of the instructor [string]

-job_title: job title of the instructor [string]

-image_50x50: image of the instructor [string]

-image_100x100: image of the instructor [string]

-initials: initials of the instructor [string]

-url: url of the instructor [string]