Pan-genome of 10,000 E. coli isolates
Version 2 2020-09-25, 14:29Version 2 2020-09-25, 14:29
Version 1 2020-09-21, 09:56Version 1 2020-09-21, 09:56
dataset
posted on 2020-09-25, 14:29 authored by Gal HoreshGal Horesh, Eva HeinzEva Heinz, Nick ThomsonThe complete metadata of 10,146 high quality E. coli genomes isolated from human hosts (F1).
Description and complete profiling of 50 E. coli lineages which represent the majority of publicly available human-isolated E. coli genomes (F2).
Phylogenetic trees presented in the manuscript (with 500 genomes and with 50 genomes).
The complete pan-genome of the collection which includes:
A FASTA file containing the representative sequence of each gene of the gene pool (F3).
Complete gene presence-absence across all isolates (F4).
The frequency of each gene within each of the lineages (F5)
Representative sequences from each lineage of the final set of genes in the gene-pool (i.e. a representative sequence from each lineage) (F6)
Funding
Wellcome Sanger Institute [206194]
History
Usage metrics
Categories
Keywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC