XMAn-A Homo sapiens Mutated Cancer Peptides Database
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
To enable the identification of mutated peptide sequences in complex
biological samples, in this work, a cancer protein database with mutation information collected from several
public resources such as COSMIC, IARC P53, OMIM and UniProtKB, was developed. In-house
developed Perl-scripts were used to search and process the data, and to
translate each gene-level mutation into a mutated peptide sequence. The
cancer mutation database comprises a total of 872,125 peptide entries from 25,642 protein IDs. A description
line for each entry provides the parent protein ID and name, the cDNA- and
protein-level mutation site and type, the originating database, and the cancer tissue type and corresponding hits. The database is FASTA
formatted to enable data retrieval by commonly used tandem MS search engines.