Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

2023年4月9日下午11:50 • 深度学习

本文是基于Exercise:PCA and Whitening的练习。

理论知识见：UFLDL教程。

实验内容：从10张512*512自然图像中随机选取10000个12*12的图像块（patch），然后对这些patch进行99%的方差保留的PCA计算，最后对这些patch做PCA Whitening和ZCA Whitening，并进行比较。

实验步骤及结果

1.加载图像数据，得到10000个图像块为原始数据x，它是144*10000的矩阵，随机显示200个图像块，其结果如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

2.把它的每个图像块0均值归一化。

3.PCA降维过程的第一步：求归一化后的原始数据x的协方差矩阵sigma，然后用svd对sigma求出它的U，即原始数据的特征向量或基，再把x投影或旋转到基的方向上，得到新数据xRot。

4.检查PCA实现的第一步是否正确：只需要把xRot的协方差矩阵显示出来。如果是正确的，就会显示出一条直线对角穿过蓝色背景的图片。结果如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

5.根据要保留99%方差的要求计算出要保留的主成份个数k。

6.PCA降维过程的第二步：保留xRot的前k个成份，后面的全置为0，得到数据xTilde，基U乘以数据xTilde的前k个成份（即：前k行）就得降维后数据xHat。xHat显示结果如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

为了对比，有0均值归一化后未降维前的数据显示如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

7.对0均值归一化后的数据x实现PCA Whitening，得到PCA白化后的数据xPCAWhite，其显示结果如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

8.检查PCA白化是否规整化：显示数据xPCAWhite的协方差矩阵。如未规整化，则数据xPCAWhite的协方差矩阵是一个恒等矩阵；如已规整化，则数据xPCAWhite的协方差矩阵的对角线上的值接近于1且依次变小。所以，如未规整化，把epsilon置为0或接近于0，就会得到一条红线对角穿过蓝色背景图片；如已规整化，就会得到就会得到一条从红色渐变到蓝色的线对角穿过蓝色背景的图片。显示结果如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

9.在PCA Whitening的基础上实现ZCAWhitening，得到的数据xZCAWhite＝U* xPCAWhite。因为前面已经检查过PCA白化，而zca白化是在pca的基础上做的，故这一步不需要再检查。ZCA白化的结果显示如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

对比PCA白化结果，可以看出，ZCA白化更接近原始数据。

与其相对应的归一化原始数据显示如下：

Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程）

代码

pca_gen.m

close all;
% clear all;
%%================================================================
%% Step 0a: Load data
% Here we provide the code to load natural image data into x.
% x will be a 144 * 10000 matrix, where the kth column x(:, k) corresponds to
% the raw image data from the kth 12x12 image patch sampled.
% You do not need to change the code below.

x = sampleIMAGESRAW();
figure('name','Raw images');
randsel = randi(size(x,2),200,1); % A random selection of samples for visualization
display_network(x(:,randsel));

%%================================================================
%% Step 0b: Zero-mean the data (by row)
% You can make use of the mean and repmat/bsxfun functions.

% -------------------- YOUR CODE HERE --------------------
avg = mean(x, 1);                 %x的每一列的均值
x = x - repmat(avg, size(x, 1), 1);
%%================================================================
%% Step 1a: Implement PCA to obtain xRot
% Implement PCA to obtain xRot, the matrix in which the data is expressed
% with respect to the eigenbasis of sigma, which is the matrix U.

 
% -------------------- YOUR CODE HERE --------------------
xRot = zeros(size(x)); % You need to compute this
sigma = x * x' / size(x, 2);
[U,S,V]=svd(sigma);
xRot=U'*x;
 
%%================================================================
%% Step 1b: Check your implementation of PCA
% The covariance matrix for the data expressed with respect to the basis U
% should be a diagonal matrix with non-zero entries only along the main
% diagonal. We will verify this here.
% Write code to compute the covariance matrix, covar.
% When visualised as an image, you should see a straight line across the
% diagonal (non-zero entries) against a blue background (zero entries).
 
% -------------------- YOUR CODE HERE --------------------
covar = zeros(size(x, 1)); % You need to compute this
covar = xRot * xRot' / size(xRot, 2);
% Visualise the covariance matrix. You should see a line across the
% diagonal against a blue background.
figure('name','Visualisation of covariance matrix');
imagesc(covar);
 
%%================================================================
%% Step 2: Find k, the number of components to retain
% Write code to determine k, the number of components to retain in order
% to retain at least 99% of the variance.
 
% -------------------- YOUR CODE HERE --------------------
k = 0; % Set k accordingly
sum_k=0;
sum=trace(S);
for k=1:size(S,1)
        sum_k=sum_k+S(k,k);
        if(sum_k/sum>=0.99) %0.9
               break;
       end
end
 
%%================================================================
%% Step 3: Implement PCA with dimension reduction
% Now that you have found k, you can reduce the dimension of the data by
% discarding the remaining dimensions. In this way, you can represent the
% data in k dimensions instead of the original 144, which will save you
% computational time when running learning algorithms on the reduced
% representation.
%
% Following the dimension reduction, invert the PCA transformation to produce
% the matrix xHat, the dimension-reduced data with respect to the original basis.
% Visualise the data and compare it to the raw data. You will observe that
% there is little loss due to throwing away the principal components that
% correspond to dimensions with low variation.

% -------------------- YOUR CODE HERE --------------------
xHat = zeros(size(x));% You need to compute this
xTilde = U(:,1:k)' * x;
xHat(1:k,:)=xTilde;
xHat=U*xHat;
 
% Visualise the data, and compare it to the raw data
% You should observe that the raw and processed data are of comparable quality.
% For comparison, you may wish to generate a PCA reduced image which
% retains only 90% of the variance.
 
figure('name',['PCA processed images ',sprintf('(%d / %d dimensions)', k, size(x, 1)),'']);
display_network(xHat(:,randsel));
figure('name','Raw images');
display_network(x(:,randsel));
 
%%================================================================
%% Step 4a: Implement PCA with whitening and regularisation
% Implement PCA with whitening and regularisation to produce the matrix
% xPCAWhite.
 
epsilon = 0.1;
xPCAWhite = zeros(size(x));
 
% -------------------- YOUR CODE HERE --------------------
xPCAWhite = diag(1./sqrt(diag(S) + epsilon)) * U' * x;

figure('name','PCA whitened images');
display_network(xPCAWhite(:,randsel));

%%================================================================
%% Step 4b: Check your implementation of PCA whitening
% 检查PCA白化是否规整化。如未规整化，则协方差矩阵是一个恒等矩阵；如已规整化，则其协方差矩阵的对角线上的值接近于1且依次变小。
% Check your implementation of PCA whitening with and without regularisation.
% PCA whitening without regularisation results a covariance matrix
% that is equal to the identity matrix. PCA whitening with regularisation
% results in a covariance matrix with diagonal entries starting close to
% 1 and gradually becoming smaller. We will verify these properties here.
% Write code to compute the covariance matrix, covar.
%
% 如未规整化，把epsilon置为0或接近于0，就会得到一条红线对角穿过蓝色背景图片。
% 如已规整化，就会得到就会得到一条从红色渐变到蓝色的线对角穿过蓝色背景的图片。
% Without regularisation (set epsilon to 0 or close to 0),
% when visualised as an image, you should see a red line across the
% diagonal (one entries) against a blue background (zero entries).
% With regularisation, you should see a red line that slowly turns
% blue across the diagonal, corresponding to the one entries slowly
% becoming smaller.
 
% -------------------- YOUR CODE HERE --------------------
covar = zeros(size(xPCAWhite, 1));
covar = xPCAWhite * xPCAWhite' / size(xPCAWhite, 2);
% Visualise the covariance matrix. You should see a red line across the
% diagonal against a blue background.
figure('name','Visualisation of covariance matrix');
imagesc(covar);
 
%%================================================================
%% Step 5: Implement ZCA whitening
% Now implement ZCA whitening to produce the matrix xZCAWhite.
% Visualise the data and compare it to the raw data. You should observe
% that whitening results in, among other things, enhanced edges.
 
xZCAWhite = zeros(size(x));
 
% -------------------- YOUR CODE HERE --------------------
xZCAWhite=U * diag(1./sqrt(diag(S) + epsilon)) * U' * x;
% Visualise the data, and compare it to the raw data.
% You should observe that the whitened images have enhanced edges.
figure('name','ZCA whitened images');
display_network(xZCAWhite(:,randsel));
figure('name','Raw images');
display_network(x(:,randsel));

　

参考资料：

http://deeplearning.stanford.edu/wiki/index.php/UFLDL_Tutorial

Deep Learning三：预处理之主成分分析与白化_总结（斯坦福大学UFLDL深度学习教程）

Deep learning：十二(PCA和whitening在二自然图像中的练习)

本站文章如无特殊说明，均为本站原创，如若转载，请注明出处：Deep Learning 5_深度学习UFLDL教程：PCA and Whitening_Exercise（斯坦福大学深度学习教程） - Python技术站

赞 (0)

微信扫一扫

微信扫一扫

支付宝扫一扫

支付宝扫一扫

Deep Learning 6_深度学习UFLDL教程：Softmax Regression_Exercise（斯坦福大学深度学习教程）

上一篇 2023年4月9日下午11:49

Deep Learning 13_深度学习UFLDL教程：Independent Component Analysis_Exercise（斯坦福大学深度学习教程）

下一篇 2023年4月9日下午11:50

事实胜于雄辩,苹果MacOs能不能玩儿机器/深度(ml/dl)学习(Python3.10/Tensorflow2)

坊间有传MacOs系统不适合机器(ml)学习和深度(dl)学习，这是板上钉钉的刻板印象，就好像有人说女生不适合编程一样的离谱。现而今，无论是Pytorch框架的MPS模式，还是最新的Tensorflow2框架，都已经可以在M1/M2芯片的Mac系统中毫无桎梏地使用GPU显卡设备，本次我们来分享如何在苹果MacOS系统上安装和配置Tensorflow2框架（C…

深度学习 2023年4月13日
000
python: 深度学习-误差反向传播法

ReLU层的设计： ReLU函数：　　导数：　　 class Relu: def __init__(self): self.mask=None def forword(self,x): self.mask=(x<0) #变量mask是由True/False构成的Numpy数组 out=x.copy() out[self.mask]=0 retur…

深度学习 2023年4月10日
000
深度学习

（实战篇）从头开发机器翻译系统！

在本文中，您将学习如何使用 Keras 从头开发一个深度学习模型，自动从德语翻译成英语。机器翻译是一项具有挑战性的任务，传统上涉及使用高度复杂的语言知识开发的大型统计模型。在本教程中，您将了解如何开发用于将德语短语翻译成英语的神经机器翻译系统。完成本教程后，您将了解：如何清理和准备数据以训练神经机器翻译系统。如何为机器翻译开发编码器-解码器模型。 …

2023年2月12日
000
深度学习

【27】什么是端到端的深度学习？

什么是端到端的深度学习？（What is end-to-end deep learning?）深度学习中最令人振奋的最新动态之一就是端到端深度学习的兴起，那么端到端学习到底是什么呢？简而言之，以前有一些数据处理系统或者学习系统，它们需要多个阶段的处理。那么端到端深度学习就是忽略所有这些不同的阶段，用单个神经网络代替它。我们来看一些例子，以语音识别为例，…

2023年4月10日
000
深度学习多机多卡解决方案-purine

未经允许请不要转载，原作者：zhxfl，http://www.cnblogs.com/zhxfl/p/5287644.html 目录：一、简介二、环境配置三、运行demo 四、硬件配置建议五、其他一、简介深度学习多机多卡集群已经成为主流，相对于caffe和mxnet这两个比较活跃的开源，purine显得更值得在高校的学生细读，因为purine…

深度学习 2023年4月10日
000
神经网络与深度学习笔记（二）逻辑回归

逻辑回归函数是由两个函数符合而成，首先我们有sigmoid函数g(z)：当然这里面的参数可以加上各种有关theta的定值，并不一定必须就只有x之前的theta参数。然后再把g(z)拿到h(x)函数里面去拟合就可以了，h(x)则是我们的Logistic回归函数。把这两个方程拟合放到一起有： sigmoid函数长这样：由于我们是二分类的问题，因此y只有1…

深度学习 2023年4月11日
000
深度学习word embedding猜测性别初探

根据用户的一些特征数据，如果能推测出用户的性别借此提高产品的服务质量、广告的精准性等都是极好的。机器学习方法有很多，而且一般都可以达到不错的效果，比如svm或神经网络等。本文使用的代码参考——《TensorFlow练习18: 根据姓名判断性别》但原文代码已经无法直接跑起来，对于最新的TensorFlow需要酌情调整部分参数和函数名等，根据报错调整即可…

深度学习 2023年4月11日
000
TensorFlow实战Google深度学习框架8-9章学习笔记

目录第8章循环神经网络第9章自然语言处理第8章循环神经网络循环神经网络的主要用途是处理和预测序列数据。循环神经网络的来源就是为了刻画一个序列当前的输出与之前信息的关系。也就是说，循环神经网络的隐藏层之间的节点是有连接的，隐藏层的输入不仅包括输入层的输出，还包括上一时刻隐藏层的输出。下面给出一个长度为2的RNN前向传播示例代码： impo…

深度学习 2023年4月15日
000

合作推广

合作推广

返回顶部