pr060534k_si_001.pdf (65.69 kB)
Protein Family Classification with Partial Least Squares
journal contribution
posted on 2007-02-02, 00:00 authored by Stephen O. Opiyo, Etsuko N. MoriyamaThe quality of protein function predictions relies on appropriate training of protein classification methods.
Performance of these methods can be affected when only a limited number of protein samples are
available, which is often the case in divergent protein families. Whereas profile hidden Markov models
and PSI-BLAST presented significant performance decrease in such cases, alignment-free partial least-squares classifiers performed consistently better even when used to identify short fragmented
sequences.
Keywords: partial least square • physico-chemical properties • amino acid composition • profile hidden Markov
model • G-protein coupled receptors