Bottom - Index of papers - Previous - Next - Paper in HTML - Abstract - CUBIC
| Title: | UniqueProt: creating representative protein sequence sets |
| Author: | Sven Mika & Burkhard Rost |
| Quote: | Nucl Acids Res, 2003, 31, 3642-3644 |
UniqueProt is a practical and easy to use web-service designed to create representative, unbiased data sets of protein sequences. The largest possible representative sets are found through a simple greedy algorithm using the HSSP-value to establish sequence similarity. UniqueProt is not a real clustering program in the sense that the 'representatives' are not at the centres of well-defined clusters since the definition of such clusters is problem-specific. Overall, UniqueProt is a reasonable fast solution for bias in data sets. The service is accessible at http://cubic.bioc.columbia.edu/services/uniqueprot; a command-line version for Linux is downloadable from this website.
Top -
Index of papers -
Paper in HTML -
Abstract -
Paper as PDF -
Appendix -
CUBIC