figshare
Browse
1/1
3 files

Biomarker Benchmark - GSE15296

Version 6 2016-03-17, 22:20
Version 5 2016-03-17, 21:37
Version 4 2016-02-23, 23:38
Version 3 2016-02-04, 23:21
Version 2 2016-02-04, 22:26
Version 1 2016-02-02, 21:56
dataset
posted on 2016-03-17, 22:20 authored by Anna GuyerAnna Guyer, Stephen PiccoloStephen Piccolo

[NOTICE: This data set has been deprecated. Please see our new version of the data (and additional data sets) here: https://osf.io/mhk93 ]


"In the present work, we have used whole genome expression profiling of peripheral blood samples from 51 patients with biopsy-proven acute kidney transplant rejection and 24 patients with excellent function and biopsy-proven normal transplant histology. The results demonstrate that there are 1738 probesets on the Affymetrix HG-U133 Plus 2.0 GeneChip representing 1472 unique genes which are differentially expressed in the peripheral blood during an acute kidney transplant rejection. By ranking these results we have identified minimal sets of 50 to 150 probesets with predictive classification accuracies for AR of greater than 90% established with several different prediction tools including DLDA and PAM. We have demonstrated that a subset of peripheral blood gene expression signatures can also diagnose four different subtypes of AR (Banff Borderline, IA, IB and IIA) and the top 100 ranked classifiers have greater than 89% predictive accuracy. Finally, we have demonstrated that there are gene signatures for early and late AR defined as less than or greater than one year post-transplant with greater than 86% predictive accuracies. We also confirmed that there are 439 time-independent gene classifiers for AR. Based on these results, we conclude that peripheral blood gene expression profiling can be used to diagnose AR at any time in the first 5 years post-transplant in the setting of acute kidney transplant dysfunction not caused by BK nephropathy, other infections, drug-induced nephrotoxicity or ureteral obstruction.

Keywords: kidney transplantation, peripheral blood, DNA microarrays, acute kidney rejection, biomarkers"

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE15296

We have included gene-expression data, the outcome (class) being predicted, and any clinical covariates. When gene-expression data were processed in multiple batches, we have provided batch information. Each data set is organized into a file set, where each contains all pertinent files for an individual dataset. The gene expression files have been normalized using both the SCAN and UPC methods using the SCAN.UPC package in Bioconductor (https://www.bioconductor.org/packages/release/bioc/html/SCAN.UPC.html). We summarized the data at the gene level using the BrainArray resource (http://brainarray.mbni.med.umich.edu/Brainarray/Database/CustomCDF/20.0.0/ensg.asp). We used Ensembl identifiers. The class, clinical, and batch data were hand curated to ensure consistency ("tidy data" formatting). In addition, the data files have been formatted to be imported easily into the ML-Flex machine learning package (http://mlflex.sourceforge.net/).

History

Usage metrics

    Categories

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC