A multivariate approach to dimension estimation on manifolds with triangle area U-statistics

For data on a manifold M ⊆ ℝᵐ and a point p ∈ M, this thesis introduces a multivariate estimator that leverages angle- and triangle area-based U-statistics (U₁, V₁, V₂) to assess the intrinsic dimension of M at p. By considering the variance of angles formed by pairs of nearby data points, and the m...

Full description

Autores:
Vargas Robayo, Bianca Michelle
Tipo de recurso:
Trabajo de grado de pregrado
Fecha de publicación:
2025
Institución:
Universidad de los Andes
Repositorio:
Séneca: repositorio Uniandes
Idioma:
eng
OAI Identifier:
oai:repositorio.uniandes.edu.co:1992/75813
Acceso en línea:
https://hdl.handle.net/1992/75813
Palabra clave:
Estadística
Manifold Learning
Matemáticas
Rights
openAccess
License
Attribution 4.0 International
Description
Summary:For data on a manifold M ⊆ ℝᵐ and a point p ∈ M, this thesis introduces a multivariate estimator that leverages angle- and triangle area-based U-statistics (U₁, V₁, V₂) to assess the intrinsic dimension of M at p. By considering the variance of angles formed by pairs of nearby data points, and the mean and variance of triangle areas formed by triplets, the proposed methodology captures the manifold's local geometry. A multivariate approach using the Mahalanobis distance ensures robust dimension estimation by incorporating covariance structure. The estimator is evaluated through testing on both simulated manifolds and real-world datasets. Robust statistical tools, such as the Minimum Volume Ellipsoid (MVE), are employed to enhance reliability.