Abstract
This work studies the problem of learning appropriate low dimensional image representations. We propose a generic algorithmic framework, which leverages two classic representation learning paradigms, i.e., sparse representation and the trace quotient criterion, to disentangle underlying factors of variation in high dimensional images. Specifically, we aim to learn simple representations of low dimensional, discriminant factors by applying the trace quotient criterion to well-engineered sparse representations. We construct a unified cost function, coined as the SPARse LOWdimensional representation (SparLow) function, for jointly learning both a sparsifying dictionary and a dimensionality reduction transformation. The SparLow function is widely applicable for developing various algorithms in three classic machine learning scenarios, namely, unsupervised, supervised, and semi-supervised learning. In order to develop efficient joint learning algorithms for maximizing the SparLow function, we deploy a framework of sparse coding with appropriate convex priors to ensure the sparse representations to be locally differentiable. Moreover, we develop an efficient geometric conjugate gradient algorithm to maximize the SparLow function on its underlying Riemannian manifold. Performance of the proposed SparLow algorithmic framework is investigated on several image processing tasks, such as 3D data visualization, face/digit recognition, and object/scene categorization.
| Original language | English |
|---|---|
| Article number | 8731748 |
| Pages (from-to) | 3119-3135 |
| Number of pages | 17 |
| Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Volume | 42 |
| Issue number | 12 |
| DOIs | |
| State | Published - Dec 2020 |
| Externally published | Yes |
Keywords
- Dictionary learning
- Geometric conjugate gradient algorithm
- Semi-supervised learning
- Sparse representation
- Supervised learning
- Trace quotient
- Unsupervised learning
- representation learning