XIMENA FERNANDEZ
University of Oxford
Astrophysics Seminar
Nagoya University - 22 August 2024
Geometry = fine details + quantitative answers
Topology = fundamental properties + qualitative answers
Let $X$ be a space and let $\mathbb{X}_n = \{x_1,...,x_n\}$ be a finite sample of $X$.
Q: How to infer topological properties of $X$ from $\mathbb{X}_n$?
Let $X$ be a space and let $\mathbb{X}_n = \{x_1,...,x_n\}$ be a finite sample of $X$.
Q: How to infer topological properties of $X$ from $\mathbb{X}_n$?
Point cloud
$\mathbb{X}_n \subset \mathbb{R}^D$
For $\epsilon>0$, the $\epsilon$-thickening of $\mathbb{X}_n$: \[\displaystyle U_\epsilon = \bigcup_{x\in \mathbb{X}_n}B_{\epsilon}(x)\]
Let $X$ be a space and let $\mathbb{X}_n = \{x_1,...,x_n\}$ be a finite sample of $X$.
Q: How to infer topological properties of $X$ from $\mathbb{X}_n$?
Point cloud
$\mathbb{X}_n \subset \mathbb{R}^D$
Evolving thickenings
Let $X$ be a topological space and let $\mathbb{X}_n = \{x_1,...,x_n\}$ be a finite sample of $X$.
Q: How to infer topological properties of $X$ from $\mathbb{X}_n$?
Point cloud
$\mathbb{X}_n \subset \mathbb{R}^D$
Evolving thickenings
Filtration of simplicial complexes
Let $X$ be a topological space and let $\mathbb{X}_n = \{x_1,...,x_n\}$ be a finite sample of $X$.
Q: How to infer topological properties of $X$ from $\mathbb{X}_n$?
Point cloud
$\mathbb{X}_n \subset \mathbb{R}^D$
Filtration of simplicial complexes
Persistence diagram
Gardner et al. 'Toroidal topology of population activity in grid cells'. Nature. (2022)
Reise, Fernandez, Dominguez, Harrington, Beguerisse-Diaz. 'Topological fingerprints for audio identification'. SIAM Journal of Data Science (2024)
Reise, Fernandez, Dominguez, Harrington, Beguerisse-Diaz. 'Topological fingerprints for audio identification'. SIAM Journal of Data Science (2024)
$~~~~$
$~~~~~~~~~~~~~~~~t_0~~~~~~~~~~~~~~~~t_1~~~~~~~~~~~~~~~~t_2~~~~~~~~~~~~~~~t_3~~~~~~~~~~~~~~~t_4~~\dots~~~~~~~~~~~~~~~~~~t'_0~~~~~~~~~~~~~~~~t'_1~~~~~~~~~~~~~~~~t'_2~~~~~~~~~~~~~~~~t'_3~~~~~~~~~~~~~~~t'_4~~\dots$
Fernandez X., Mateos D. 'Topological biomarkers for real-time detection of epileptic seizures'. Preprint (2024)
Let $\mathbb{X}_n = \{x_1,...,x_n\}\subseteq \mathbb{R}^D$ be a finite sample.
Let $\mathbb{X}_n = \{x_1,...,x_n\}\subseteq \mathbb{R}^D$ be a finite sample.
Assume that:
The path taken by a ray between two given points is the path that can be traversed in the least time.
That is, it is the extreme of the functional \[ \gamma\mapsto \int_{0}^1\eta(\gamma_t)||\dot{\gamma}_t|| dt \] with $\eta$ is the refraction index.
Let $\mathcal M \subseteq \mathbb{R}^D$ be a manifold and let $f\colon\mathcal{M}\to \mathbb{R}_{>0}$ be a smooth density.
For $q>0$, the deformed Riemannian distance* in $\mathcal{M}$ is \[d_{f,q}(x,y) = \inf_{\gamma} \int_{I}\frac{1}{f(\gamma_t)^{q}}||\dot{\gamma}_t|| dt \] over all $\gamma:I\to \mathcal{M}$ with $\gamma(0) = x$ and $\gamma(1)=y$.
* Here, if $g$ is the inherited Riemannian tensor, then $d_{f,q}$ is the Riemannian distance induced by $g_q= f^{-2q} g$.
Let $\mathbb{X}_n = \{x_1,...,x_n\}\subseteq \mathbb{R}^D$ be a finite sample.
For $p> 1$, the Fermat distance between $x,y\in \mathbb{R}^D$ is defined by \[ d_{\mathbb{X}_n, p}(x,y) = \inf_{\gamma} \sum_{i=0}^{r}|x_{i+1}-x_i|^{p} \] over all paths $\gamma=(x_0, \dots, x_{r+1})$ of finite length with $x_0=x$, $x_{r+1} = y$ and $\{x_1, x_2, \dots, x_{r}\}\subseteq \mathbb{X}_n$.
$d_{\mathbb X_n, p}$ is an estimator of $d_{f,q}$ if $q=(p-1)/d$.
Let $\mathcal{M}$ be a closed smooth $d$-dimensional manifold embedded in $\mathbb{R}^D$.
\[\big(\mathbb{X}_n, C(n,p,d) d_{\mathbb{X}_n,p}\big)\xrightarrow[n\to \infty]{GH}\big(\mathcal{M}, d_{f,q}\big) ~~~ \text{ for } q = (p-1)/d\]
Recall that $$d_{H}\big((X, d)(Y,d)\big) = \max \big\{\sup_{x\in X}d(x,Y), \sup_{y\in Y}d(X,y)\big\}, ~~\text{for }X,Y\subseteq (Z,d)$$ $$d_{GH}\big((X, d_X),(Y,d_Y)\big)= \inf_{\substack{Z \text{ metric space}\\ f:X\to Z, g:Y\to Z \text{ isometries}}}d_H(f(X), g(Y))$$
Let $\mathcal{M}$ be a closed smooth $d$-dimensional manifold embedded in $\mathbb{R}^D$.
\[\big(\mathbb{X}_n, C(n,p,d) d_{\mathbb{X}_n,p}\big)\xrightarrow[n\to \infty]{GH}\big(\mathcal{M}, d_{f,q}\big) ~~~ \text{ for } q = (p-1)/d\]
Theorem (F., Borghini, Mindlin, Groisman, 2023)
Let $\mathbb{X}_n$ be a sample of a closed manifold $\mathcal M$ of dimension $d$, drawn according to a density $f\colon \mathcal M\to \mathbb R$.
Given $p>1$ and $q=(p-1)/d$, there exists a constant $\mu = \mu(p,d)$ such that for every $\lambda \in \big((p-1)/pd, 1/d\big)$ and $\varepsilon>0$ there exist $\theta>0$ satisfying
\[
\mathbb{P}\left( d_{GH}\left(\big(\mathcal{M}, d_{f,q}\big), \big(\mathbb{X}_n, {\scriptstyle \frac{n^{q}}{\mu}} d_{\mathbb{X}_n, p}\big)\right) > \varepsilon \right) \leq \exp{\left(-\theta n^{(1 - \lambda d) /(d+2p)}\right)}
\]
for $n$ large enough.
$O(n^3)$
reducible to $O(n^2*k*\log(n))$ using the $k$-NN-graph (for $k = O(\log n)$ the geodesics belong to the $k$-NN graph with high probability).
fermat
ximenafernandez/intrinsicPH
\[\big(\mathbb{X}_n, C(n,p,d) d_{\mathbb{X}_n,p}\big)\xrightarrow[n\to \infty]{GH}\big(\mathcal{M}, d_{f,q}\big) ~~~ \text{ for } q = (p-1)/d\]
+Stability \[d_B\Big( \mathrm{dgm}\big(\mathrm{Filt}(X, d_X)\big), \mathrm{dgm}\big(\mathrm{Filt}(Y, d_Y\big)\Big)\leq 2 d_{GH}\big((X,d_X),(Y,d_Y)\big)\]
\[\big(\mathbb{X}_n, C(n,p,d) d_{\mathbb{X}_n,p})\big)\xrightarrow[n\to \infty]{GH}\big(\mathcal{M}, d_{f,q}\big) ~~~ \text{ for } q = (p-1)/d\]
+Stability \[d_B\Big( \mathrm{dgm}\big(\mathrm{Filt}(X, d_X)\big), \mathrm{dgm}\big(\mathrm{Filt}(Y, d_Y\big)\Big)\leq 2 d_{GH}\big((X,d_X),(Y,d_Y)\big)\]
$\Downarrow$\[\mathrm{dgm}(\mathrm{Filt}(\mathbb{X}_n, {C(n,p,d)} d_{\mathbb{X}_n,p}))\xrightarrow[n\to \infty]{B}\mathrm{dgm}(\mathrm{Filt}(\mathcal{M}, d_{f,q})) ~~~ \text{ for } q = (p-1)/d\]
Source data: PhysioNet Database https://physionet.org/about/database/
Source data: Private experiments. Laboratory of Dynamical Systems, University of Buenos Aires.