Pandas is one of the main Python libraries for manipulating and analysing structured data and one of the first things to learn if you want to get started with data science.
This workshop is an introduction to Pandas where you will learn about:
- Jupyter notebooks
- Pandas data structures
- Transforming and exploring data
- Visualising data
After an interactive overview for each of the subjects there will be some exercises to practice what you have learned.
Jupyter notebooks are an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text.
In this workshop we will use IBM Watson Studio to run a notebook. For this you will need an IBM Cloud account. The following steps will show you how sign up and get started. When you have the notebook up and running we will go through the notebook.
-
Sign up for an IBM Cloud account
-
When you are signed up click
Create Resource
at the top of the Resources page. You can find the resources under the hamburger menu at the top left:
- Search for Watson Studio and click on the tile:
- Select the Lite plan and click
Create
. - Go back to the Resources list and click on your Watson Studio service and then click
Get Started
.
- You should now be in Watson Studio.
- Create a new project by clicking on
Get Started
andNew Project
, orCreate Project
. Choose aStandard
project. - Give your Project a name.
- Select an Object Storage from the drop-down menu or create a new one for free. This is used to store the notebooks and data. Do not forget to click refresh when returning to the Project page.
- click
Create
.
- We will analyse data about the sizes of jeans pockets in this workshop. To do this you need to add the data to your project.
- Download the
measurements.csv
file from here (Right click and save as a csv file). A big thanks to Jan Diehm and Amber Thomas for going around shops, measuring the jeans and putting the data online for everyone to use. - Add this file to your newly created project in Watson Studio by uploading the file in the right side menu (click the 1010 button if you do not see this):
- Add a new notebook. Click
Add to project
and chooseNotebook
:
- Choose new notebook
From URL
. Give your notebook a name and copy the URLhttps://github.com/IBMDeveloperUK/pandas-workshop/blob/master/pandas-workshop-v3.ipynb
- Select the default runtime and click
Create Notebook
. - The notebook will load.
You are now ready to follow along with the workshop in the notebook!