Abstract
Principal component analysis (PCA) is a classical feature extraction method,but it may be adversely affected by outliers, resulting in inaccurate learningof the projection matrix. This paper proposes a robust method to estimate boththe data mean and the PCA projection matrix by learning discriminant sampleweights from data containing outliers. Each sample in the dataset is assigned aweight, and the proposed algorithm iteratively learns the weights, the mean,and the projection matrix, respectively. Specifically, when the mean and theprojection matrix are available, via fine-grained analysis of outliers, aweight for each sample is learned hierarchically so that outliers have smallweights while normal samples have large weights. With the learned weightsavailable, a weighted optimization problem is solved to estimate both the datamean and the projection matrix. Because the learned weights discriminateoutliers from normal samples, the adverse influence of outliers is mitigateddue to the corresponding small weights. Experiments on toy data, UCI dataset,and face dataset demonstrate the effectiveness of the proposed method inestimating the mean and the projection matrix from the data containingoutliers.