XMAn-A Homo sapiens Mutated Disease Peptides Database
datasetposted on 16.08.2016 by Iulia M. Lazar
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
To enable the identification of mutated peptide sequences in complex biological samples, in this work, a disease protein database with mutation information collected from several public resources such as OMIM and UniProtKB, was developed. In-house developed Perl-scripts were used to search and process the data, and to translate each gene-level mutation into a mutated peptide sequence. The disease mutation database comprises a total of 27,148 peptide entries from 2913 protein IDs. A description line for each entry provides the parent protein ID and name, the cDNA- and protein-level mutation site and type, the originating database, and the tissue type and corresponding hits. The database is FASTA formatted to enable data retrieval by commonly used tandem MS search engines.