figshare
Browse

Function Unknown Families of homologous proteins (FUnkFams)

Published on by Katherine Pollard
The number and proportion of genes with no known function are growing rapidly. To quantify this phenomenon and provide criteria for prioritizing genes for functional characterization, we developed a bioinformatics pipeline that identifies robustly defined protein families with no annotated domains, ranks these with respect to phylogenetic breadth, and identifies them in metagenomics data. We applied this approach to 271,965 protein families from the SFams database and discovered many with no functional annotation, including >118,000 families lacking any known protein domain. From these, we prioritized 6,668 conserved protein families with at least three sequences from organisms in at least two distinct classes. This project catalogs data associated with these Function Unknown Families (FUnkFams), a “most wanted” list of genes to functionally characterize.

Cite items from this project

DataCite
3 Biotech
3D Printing in Medicine
3D Research
3D-Printed Materials and Systems
4OR
AAPG Bulletin
AAPS Open
AAPS PharmSciTech
Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg
ABI Technik (German)
Academic Medicine
Academic Pediatrics
Academic Psychiatry
Academic Questions
Academy of Management Discoveries
Academy of Management Journal
Academy of Management Learning and Education
Academy of Management Perspectives
Academy of Management Proceedings
Academy of Management Review

cite all items

Funding

NSF (#DMS-1563159), Gordon & Betty Moore Foundation (#3300)

Share

email