Applying PCA on MNIST

Pricipal Component Analysis (PCA) also known as Karhunen-Loeve transform, is widely used for the purpose of dimensionallity reduction, feature extraction, and data visualization. It provides a way for lossy data compression, orthogonally projecting the data onto a lower dimensional space. The goal is to reduce the dimensionality from D dimensions, here 784 (28X28 image flattened), to a lower dimension M such that the variance of the projected data is maximized.

Let x represent one data points out of N examples, and mu is the mean of N data samples then the PCA will include calculating the covariance matrix S, M largest eigen values of S and the corresponding M eigen vectors represented by A:

$\large S = \frac{1}{N} \Sigma^N_{n=1}\ (x_n - \mu)(x_n - \mu)^T$
$\large A^TSA = \lambda\\$
$\large X_{new} = A^T.X$

After PCA for reducing the dimension from 784 to 10, the covariance of the data points looks like following:

On PCA whitening, or on calculating $\small W.X$ , we can see the lossy compression of the images:

Result of Classification

Reduced to two dimensions using PCA

Result of Fisher's LDA analysis:

Metric Description	Result
training accuracy	93.0%
Class 0 (true label = 5) training accuracy	95.5%
Class 1 (true label = 8) training accuracy	90.5%
test accuracy	71%
Class 0 (true label = 5) test accuracy	6.0%
Class 1 (true label = 8) test accuracy	94.0%

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
2d.png		2d.png
PCA.ipynb		PCA.ipynb
PCAcovar.png		PCAcovar.png
README.md		README.md
b4.png		b4.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Applying PCA on MNIST

Result of Classification

About

Releases

Packages

Languages

tanishkasingh9/pca_mnist

Folders and files

Latest commit

History

Repository files navigation

Applying PCA on MNIST

Result of Classification

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages