Visualize High-Dimensional Data Fast | Watson Studio: Blog Here!
-
Download the MNIST handwritten digits sample data set (about 1,000 images per digit) from here. The file's name is
mnist_all_sample_10000.csv. If you want to speed up results, you can use a smaller sample of about 150 images per digit here. -
Create an account on Watson Studio cloud or download the desktop version here.
-
Open Watson Studio.
-
Click
New projecton the top right to create a new project on Watson Studio.
-
Click the
Assetstab if you are not already there. -
Upload the
mnist_all_sample_10000.csv, on the right hand side of the screen drop or browse the file.
-
In your project, under
Data assets, click the data set to see a preview of the data set.
-
Click the
Refineblue box in the top right to open the data set with the Data Refinery tool. This step might take a little while since the 10,000 by 785 dataset is being loaded into the Data Refinery tool.
-
Once the Data Refinery tool is open, navigate to the
Visualizationstab.
-
Create the t-SNE visualization:
- Select the
t-SNE charton the CHART TYPES.
- Set the
Perplexityparameter to 75.
- Select the column "label" as the
Color map.
- This is the t-SNE visualization after 1000 iterations. Each colored cloud represent a different digit from zero to nine. For instance, the purple cloud represents the images of the number one digit.

- Select the
