Skip to article frontmatterSkip to article content

Locally Linear Embedding

University of Lyon

Main Concept

LLE transforms high-dimensional data into a lower-dimensional space while preserving the local geometry of the data. The key insight is that each data point and its neighbors lie approximately on a locally linear patch of the manifold.

The transformation occurs in three main steps:

  1. For each point xix_i, find its kk nearest neighbors
  2. Compute weights WijW_{ij} that best reconstruct each point from its neighbors
  3. Find low-dimensional points yiy_i that are reconstructed by the same weights

The transformation preserves local relationships while allowing the global structure to be unfolded into a lower-dimensional space.

Theoretical Aspect

LLE involves two separate optimization problems:

  1. Weight Computation:
    minWixijWijxj2\min_W \sum_i |x_i - \sum_j W_{ij}x_j|^2
    Subject to constraints:
  • Wij=0W_{ij} = 0 if xjx_j is not a neighbor of xix_i
  • jWij=1\sum_j W_{ij} = 1 for each ii
  1. Embedding Computation:
    minYiyijWijyj2\min_Y \sum_i |y_i - \sum_j W_{ij}y_j|^2
    Subject to constraints:
  • iyi=0\sum_i y_i = 0 (centered at origin)
  • 1NiyiyiT=I\frac{1}{N}\sum_i y_iy_i^T = I (unit covariance)

The key variables being optimized are:

  • The weight matrix WW in the first step
  • The low-dimensional coordinates YY in the second step

Solution Methodology

The algorithm follows these main steps:

  1. Neighbor Search:

    • Use k-nearest neighbors or ε-ball methods
    • Typically employs efficient data structures like k-d trees
  2. Weight Computation:

    • Solve local linear systems for each point
    • Can be expressed as: Wi=argminwxijwjxj2W_i = \arg\min_w |x_i - \sum_j w_jx_j|^2
    • Solution involves solving a small linear system with the Gram matrix
  3. Embedding Computation:

    • The optimization reduces to an eigenvalue problem
    • M=(IW)T(IW)M = (I-W)^T(I-W)
    • Find bottom eigenvectors of MM (excluding the zero eigenvalue)
    • These eigenvectors provide the coordinates in the low-dimensional space

Global Optimality

The LLE algorithm has interesting optimality properties:

  1. Weight Computation:

    • This step is globally optimal for each local neighborhood
    • The solution is obtained by solving a convex optimization problem
  2. Embedding Computation:

    • The final embedding step has a global solution
    • It reduces to an eigenvalue problem which can be solved exactly

However, there are important limitations:

  1. Local Minima Issues:

    • The choice of neighbors affects the final solution
    • Different neighborhood sizes can lead to different embeddings
  2. Topological Limitations:

    • Cannot handle cases where the manifold is not locally linear
    • Struggles with holes or complex topological features

Conclusion

The key mathematical advantage of LLE over many other methods is its ability to reduce the dimensionality reduction problem to two sequential convex optimization problems, making it computationally efficient and theoretically well-grounded. However, its reliance on local linearity assumptions can be a limitation in practice.