Skip to article frontmatterSkip to article content

Local Tangent Space Alignment

University of Lyon

Main Concept

Local Tangent Space Alignment (LTSA) aims to discover the underlying low-dimensional manifold by characterizing the local geometry around each data point. The key idea is to approximate the manifold locally at each point with a tangent space. These local tangent spaces are then aligned to form a global coordinate system that represents the manifold. This is done by finding a linear transformation that best aligns the local tangent spaces. The method transforms the original data representation by projecting the locally centered data points onto their respective tangent spaces and then assembling these local representations into a global low-dimensional embedding. The coordinates of data points change as they are first projected onto local tangent spaces and then combined into a global coordinate system through an alignment process.

Theoretical Aspect

For each data point xi\mathbf{x}_i, LTSA first finds its kk nearest neighbors NiN_i. It then computes the local tangent space at xi\mathbf{x}_i by performing Principal Component Analysis (PCA) on the locally centered neighbors:

Xi=[xj1xi,xj2xi,...,xjkxi]\mathbf{X}_i = [\mathbf{x}_{j_1} - \mathbf{x}_i, \mathbf{x}_{j_2} - \mathbf{x}_i, ..., \mathbf{x}_{j_k} - \mathbf{x}_i] where jnNij_n \in N_i.

The covariance matrix of these centered neighbors is:

Ci=XiXiT\mathbf{C}_i = \mathbf{X}_i \mathbf{X}_i^T

PCA finds the eigenvectors of Ci\mathbf{C}_i. The eigenvectors corresponding to the dd largest eigenvalues (where dd is the desired embedding dimension) form the basis of the local tangent space Ti\mathbf{T}_i.

The local coordinates Yi\mathbf{Y}_i of the neighbors in the tangent space are given by:

Yi=TiTXi\mathbf{Y}_i = \mathbf{T}_i^T \mathbf{X}_i

The key step is to find a global embedding Y\mathbf{Y} that aligns these local coordinate systems. This is achieved by minimizing the following objective function:

L=i=1nYiYBi2\mathcal{L} = \sum_{i=1}^{n} ||\mathbf{Y}_i - \mathbf{Y} \mathbf{B}_i||^2

where Bi\mathbf{B}_i is a selection matrix that selects the rows of Y\mathbf{Y} corresponding to the neighbors of xi\mathbf{x}_i. The key variable being optimized is Y\mathbf{Y}, the global embedding matrix.

This can be rewritten in a more convenient matrix form:

L=tr(YMYT)\mathcal{L} = \text{tr}(\mathbf{Y} \mathbf{M} \mathbf{Y}^T)

where M\mathbf{M} is a matrix derived from the local tangent spaces and neighborhood structure.

Solution Methodology

The optimization problem can be solved by finding the eigenvectors of the matrix M\mathbf{M}. The algorithm proceeds as follows:

  1. Neighborhood Selection: For each data point, find its kk nearest neighbors.
  2. Local Tangent Space Computation: Compute the local tangent space Ti\mathbf{T}_i at each data point using PCA.
  3. Local Coordinate Computation: Compute the local coordinates Yi\mathbf{Y}_i in each tangent space.
  4. Alignment (Global Embedding): Construct the matrix M\mathbf{M} and find its eigenvectors. The dd eigenvectors corresponding to the smallest non-zero eigenvalues form the rows of the global embedding Y\mathbf{Y}.

The solution involves standard numerical methods like nearest neighbor search, PCA (eigenvalue decomposition of covariance matrices), and eigenvalue decomposition of the global alignment matrix M\mathbf{M}.

Global Optimality

The solution obtained by finding the eigenvectors of M\mathbf{M} minimizes the alignment error as defined by the objective function L\mathcal{L}. Therefore, it is a global minimum with respect to the defined objective function. However, the overall process, which involves local tangent space approximations, does not guarantee finding the globally optimal embedding of the manifold itself. The accuracy of the local tangent space approximations depends on the local curvature of the manifold and the choice of the neighborhood size kk. If the manifold has high curvature or sharp bends, the local linear approximations might be poor, leading to suboptimal global alignments. Thus, local minima issues can arise from inaccuracies in the local tangent space estimations.

Conclusion

Local Tangent Space Alignment is a powerful manifold learning technique that effectively captures the local geometric structure of data by aligning local tangent spaces. It is particularly suitable for data lying on smooth manifolds. While the alignment process itself has a globally optimal solution, the overall quality of the embedding depends on the accuracy of the local tangent space approximations. A meaningful use case is in human motion capture analysis. Human motion data often resides on a complex, high-dimensional manifold. LTSA can be used to discover a low-dimensional representation of this motion, which can be useful for tasks like motion recognition, synthesis, and compression.