Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND)

Hanson, Robert

Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND)

Posted on 2016-05-04 - 05:00

Abstract Background The presence of population structure in a sample may confound the search for important genetic loci associated with disease. Our four samples in the Family Investigation of Nephropathy and Diabetes (FIND), European Americans, Mexican Americans, African Americans, and American Indians are part of a genome- wide association study in which population structure might be particularly important. We therefore decided to study in detail one component of this, individual genetic ancestry (IGA). From SNPs present on the Affymetrix 6.0 Human SNP array, we identified 3 sets of ancestry informative markers (AIMs), each maximized for the information in one the three contrasts among ancestral populations: Europeans (HAPMAP, CEU), Africans (HAPMAP, YRI and LWK), and Native Americans (full heritage Pima Indians). We estimate IGA and present an algorithm for their standard errors, compare IGA to principal components, emphasize the importance of balancing information in the ancestry informative markers (AIMs), and test the association of IGA with diabetic nephropathy in the combined sample. Results A fixed parental allele maximum likelihood algorithm was applied to the FIND to estimate IGA in four samples: 869 American Indians; 1385 African Americans; 1451 Mexican Americans; and 826 European Americans. When the information in the AIMs is unbalanced, the estimates are incorrect with large error. Individual genetic admixture is highly correlated with principle components for capturing population structure. It takes ~700 SNPs to reduce the average standard error of individual admixture below 0.01. When the samples are combined, the resulting population structure creates associations between IGA and diabetic nephropathy. Conclusions The identified set of AIMs, which include American Indian parental allele frequencies, may be particularly useful for estimating genetic admixture in populations from the Americas. Failure to balance information in maximum likelihood, poly-ancestry models creates biased estimates of individual admixture with large error. This also occurs when estimating IGA using the Bayesian clustering method as implemented in the program STRUCTURE. Odds ratios for the associations of IGA with disease are consistent with what is known about the incidence and prevalence of diabetic nephropathy in these populations.

CITE THIS COLLECTION

DataCite

3 Biotech

3D Printing in Medicine

3D Research

3D-Printed Materials and Systems

4OR

AAPG Bulletin

AAPS Open

AAPS PharmSciTech

Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg

ABI Technik (German)

Academic Medicine

Academic Pediatrics

Academic Psychiatry

Academic Questions

Academy of Management Discoveries

Academy of Management Journal

Academy of Management Learning and Education

Academy of Management Perspectives

Academy of Management Proceedings

Academy of Management Review

Williams, Robert; Elston, Robert; Kumar, Pankaj; Knowler, William; Abboud, Hanna; Adler, Sharon; et al. (2016). Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND). figshare. Collection. https://doi.org/10.6084/m9.figshare.c.3608633.v1

https://doi.org/10.6084/m9.figshare.c.3608633.v1

or

Select your citation style and then place your mouse over the citation text to select it.

SHARE

email

Usage metrics

Read the peer-reviewed publication

Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND)

AUTHORS (33)

RW

Robert Williams

RE

Robert Elston

PK

Pankaj Kumar

WK

William Knowler

HA

Hanna Abboud

SA

Sharon Adler

DB

Donald Bowden

JD

Jasmin Divers

BF

Barry Freedman

RI

Robert Igo

EI

Eli Ipp

SI

Sudha Iyengar

PK

Paul Kimmel

MK

Michael Klag

OK

Orly Kohn

CL

Carl Langefeld

DL

David Leehey

RN

Robert Nelson

SN

Susanne Nicholas

MP

Madeleine Pahl

KEYWORDS

Individual genetic ancestry Population structure SNP Diabetic nephropathy

Search Collections

need help?

Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND)

CITE THIS COLLECTION

SHARE

Usage metrics

Read the peer-reviewed publication

AUTHORS (33)

CATEGORIES

KEYWORDS