Visualize High-Dimensional Data Fast | Watson Studio: Blog Here!
-
Download the MNIST handwritten digits sample data set (about 1,000 images per digit) from here. The file's name is
mnist_all_sample_10000.csv
. If you want to speed up results, you can use a smaller sample of about 150 images per digit here. -
Create an account on Watson Studio cloud or download the desktop version here.
-
Open Watson Studio.
-
Click
New project
on the top right to create a new project on Watson Studio. -
Click the
Assets
tab if you are not already there. -
Upload the
mnist_all_sample_10000.csv
, on the right hand side of the screen drop or browse the file. -
In your project, under
Data assets
, click the data set to see a preview of the data set. -
Click the
Refine
blue box in the top right to open the data set with the Data Refinery tool. This step might take a little while since the 10,000 by 785 dataset is being loaded into the Data Refinery tool. -
Once the Data Refinery tool is open, navigate to the
Visualizations
tab. -
Create the t-SNE visualization:
- Select the
t-SNE chart
on the CHART TYPES. - Set the
Perplexity
parameter to 75. - Select the column "label" as the
Color map
. - This is the t-SNE visualization after 1000 iterations. Each colored cloud represent a different digit from zero to nine. For instance, the purple cloud represents the images of the number one digit.
- Select the