These are notes from a talk given at the Stanford applied topology seminar by Gunnar Carlsson from 9 Oct 2009. The main function of this blog post is to get me an easily accessible point of access for the ideas in that talk.
Coordinatization
First off, a few words on what we mean by coordinatization: as in algebraic geometry, we say that a coordinate function is some
or possibly some
, with all the niceness properties we’d expect to see in the context we’re working.
A particularly good example is Principal Component Analysis which yields a split linear automorphism on the ambient space that maximizes spread of the data points in the initial coordinates.
Topological coordinatization
The core question we’re working with right now is this:
Given a space (point cloud) X, and a (persistent) view of
, can we use some map
to generate a map
inducing that map?
In topology there are ways around to compute spaces of maps directly from available homological information. The key phrase here is the Eilenberg-Moore spectral sequence, by which we compute
using
. We note that
with the compact open topology.
We fix a field k of coefficients. We can find a spectral sequence that computes

Variations of this spectral sequence allow us to compute
. A homotopy class of maps thus corresponds to a cohomology element from
.
Monad approximation
Define
(unordered pairs)
and similarily unordered tuples

we can get a map, for pointed spaces 


In the limit, we get the union
.
The Dold-Thom theorem tells us:

Suppose we are really looking for maps
.
We pick chain complexes
. There is a hom chain complex
consisting of graded maps
with the homotopy/induced boundary

For this complex, we get
is the family of chain homotopy classes of chain maps
.
Recall:
are chain homotopic if there is a family
such that
.
Homotopic space maps yield homotopic chain complex maps. And thus

How, though, do we get from
to
?
Cosimplicial spaces
A simplicial space is a contravariant
. It has a total space or realization given by
![\coprod X_n\times\Delta[n]/\sim \coprod X_n\times\Delta[n]/\sim](/latexrender/pictures/d1b9714668d5e1dd4ecd46986b3a9cee.png)
In the dual setting of cosimplicial spaces, the total space is a subspace of
.
There is a cosimplicial space

whose total space is
.
Furthermore, there is a spectral sequence starting at
(well understood space!)
going through

which recovers
for us.
The mapping space
maps to
, which sends
to chain maps. Differentials give us conditions on chain maps to come from actual maps on the topological level
.
Suppose now that
is a simplicial complex, and
is known, say isomorphic to
.
We pick
as a model space, and work in
. We pick out conditions form the spectral sequence above to pick out chain maps that might come from topological maps.
Suppose we had some chain map
that sends
.
We look for a chain homotopic map
such that
is a simplex (this will be WAY too optimistic, usually) or at least
for small
and
close together in graph distances.
Yi Ding: there are likely many simplicial maps
inducing any given
. This makes optimizations on quality of maps (as measured in smallness properties of preimages of given simplices) troublesome, since we have a large search space and are likely to get stuck in local minima.
The topological approach creates very many maps, but we just want ONE – albeit with quality assurances. The big question is how to get there:
Shrinking the space? Homotopy collapses to reduce the number of generators?
Ask for more side conditions? If so, which?
Minimize the size of preimages and kernels – smoothing the resulting map?
Run with one condition until we reach a minimum, then change conditions to get out of that particular sink, repeat until it all looks stable?
Thus we need to search for strategies to improve convergence to actually good maps.
The questions we pose are similar to:
Jeff Ericksson, et al.: Given X,
, how do we find a good representative
?
Finding good maps is really about finding good representatives in
. Given simplicial bases
we can pick out a first basis
(Kronecker
) for
.
Finally: many problems in data analysis call for analyzing contractible spaces. So if we pick out the notion of a boundary, doing all of this relative to a boundary would give us increased power to use the resulting methods in data analysis.
Dold-Thom theorem?… interesting stuff…