These are notes from a talk given at the Stanford applied topology seminar by Gunnar Carlsson from 9 Oct 2009. The main function of this blog post is to get me an easily accessible point of access for the ideas in that talk.
First off, a few words on what we mean by coordinatization: as in algebraic geometry, we say that a coordinate function is some or possibly some , with all the niceness properties we’d expect to see in the context we’re working.
A particularly good example is Principal Component Analysis which yields a split linear automorphism on the ambient space that maximizes spread of the data points in the initial coordinates.
The core question we’re working with right now is this:
Given a space (point cloud) X, and a (persistent) view of , can we use some map to generate a map inducing that map?
In topology there are ways around to compute spaces of maps directly from available homological information. The key phrase here is the Eilenberg-Moore spectral sequence, by which we compute using . We note that with the compact open topology.
We fix a field k of coefficients. We can find a spectral sequence that computes
Variations of this spectral sequence allow us to compute . A homotopy class of maps thus corresponds to a cohomology element from .
and similarily unordered tuples
we can get a map, for pointed spaces
In the limit, we get the union .
The Dold-Thom theorem tells us:
Suppose we are really looking for maps .
We pick chain complexes . There is a hom chain complex consisting of graded maps with the homotopy/induced boundary
For this complex, we get
is the family of chain homotopy classes of chain maps .
Recall: are chain homotopic if there is a family such that .
Homotopic space maps yield homotopic chain complex maps. And thus
How, though, do we get from to ?
A simplicial space is a contravariant . It has a total space or realization given by
In the dual setting of cosimplicial spaces, the total space is a subspace of .
There is a cosimplicial space
whose total space is .
Furthermore, there is a spectral sequence starting at (well understood space!)
which recovers for us.
The mapping space maps to , which sends to chain maps. Differentials give us conditions on chain maps to come from actual maps on the topological level .
Suppose now that is a simplicial complex, and is known, say isomorphic to .
We pick as a model space, and work in . We pick out conditions form the spectral sequence above to pick out chain maps that might come from topological maps.
Suppose we had some chain map that sends .
We look for a chain homotopic map such that is a simplex (this will be WAY too optimistic, usually) or at least for small and close together in graph distances.
Yi Ding: there are likely many simplicial maps inducing any given . This makes optimizations on quality of maps (as measured in smallness properties of preimages of given simplices) troublesome, since we have a large search space and are likely to get stuck in local minima.
The topological approach creates very many maps, but we just want ONE – albeit with quality assurances. The big question is how to get there:
Shrinking the space? Homotopy collapses to reduce the number of generators?
Ask for more side conditions? If so, which?
Minimize the size of preimages and kernels – smoothing the resulting map?
Run with one condition until we reach a minimum, then change conditions to get out of that particular sink, repeat until it all looks stable?
Thus we need to search for strategies to improve convergence to actually good maps.
The questions we pose are similar to:
Jeff Ericksson, et al.: Given X, , how do we find a good representative ?
Finding good maps is really about finding good representatives in . Given simplicial bases we can pick out a first basis
(Kronecker ) for .
Finally: many problems in data analysis call for analyzing contractible spaces. So if we pick out the notion of a boundary, doing all of this relative to a boundary would give us increased power to use the resulting methods in data analysis.