| Current Activities |
|
In Diversity Selection, the object is to generate a small subset of a set of
objects that span the entire metric space of the full set. One method
of doing this is to perform a Hierarchical
Clustering of the objects up to a given threshold. The subset of objects
would then be those objects that are closest to the centroid of each set.
A different method is to use a Greedy Procedure, which was originally done to produce the non-homologous set of peptide chains in the Protein Data Bank. This procedure is as follows:
This procedure generates a subset of objects that have an inter-object distance greater than the threshold. What is needed is a procedure to determine the distance between each pair of objects. |