figshare
Browse
spombe.predictions.csv (1.29 MB)

S. Pombe predictions

Download (1.29 MB)
dataset
posted on 2018-06-21, 19:38 authored by Linh TranLinh Tran, Tobias Hamp, Burkhard Rost

About this dataset: These are the most reliable 50,000 PPI predictions for S. pome. It is a CSV file.


Motivation: Protein-protein interactions (PPIs) play a key role in many cellular processes. Most annotations of PPIs mix experimental and computational data. The mix optimizes coverage, but obfuscates the annotation origin. Some resources excel at focusing on reliable experimental data. Here, we focused on new pairs of interacting proteins for several model organisms based solely on sequence-based prediction methods.


Results: We extracted reliable experimental data about which proteins interact (binary) for eight diverse model organisms from public databases, namely from Escherichia coli, Schizosaccharomyces pombe, Plasmodium falciparum, Drosophila melanogaster, Caenorhabditis elegans, Mus musculus, Rattus norvegicus, Arabidopsis thaliana, and for the previously used Homo sapiens and Saccharomyces cerevisiae. Those data were the base to develop a PPI prediction method for each model organism. The method used evolutionary information through a profile-kernel Support Vector Machine (SVM). With the resulting eight models, we predicted all possible protein pairs in each organism and made the top predictions available through a web application. Almost all of the PPIs made available were predicted between proteins that have not been observed in any interaction, in particular for less well-studied organisms. Thus, our work complements existing resources and is particularly helpful for designing experiments because of its uniqueness. Experimental annotations and computational predictions are strongly influenced by the fact that some proteins have many partners and others few. To optimize machine learning, the newly methods explicitly ignored such a network-structure. This might be another strength of our approach. The database interface representing our results is accessible from https://rostlab.org/services/ppipair/.


Please cite us when you are using this data:

@article{tran2018profppidb,
  title={ProfPPIdb: pairs of physical protein-protein interactions predicted for entire proteomes},
  author={Tran, Linh and Hamp, Tobias and Rost, Burkhard},
  journal={bioRxiv},
  pages={332510},
  year={2018},
  publisher={Cold Spring Harbor Laboratory}
}

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC