figshare
Browse

sorry, we can't preview this file

Hel308sequenceDependentData.mat (241.76 MB)

craig2018data

Download (241.76 MB)
dataset
posted on 2018-11-16, 23:30 authored by Jonathan CraigJonathan Craig

This data experimental data for analyzing sequence-dependent translocation of Hel308 helicase. The data can be opened in matlab, loading a data structure called ‘processedData’. ‘processedData’ fields are organized mostly as cell arrays, with the first cell corresponding to the experiment as defined by the field .sequenceNames, and the second cell to individual reads



This structure contains the following fields

.currents - extracted ion current states (picoampere)

.alignments – alignment of ion current states to reference sequence (level number).

.levelToPos – conversion function from level number to DNA position (nt)

.dwellTime – duration of current state.

.class – classification of step type as defined in Craig 2017 (PNAS). 1 = backward following forwards, 2 = forwards following forwards, 3 = forwards following forwards, 4 = backwards following backwards. Probability of backwards step is defined by the expression number of 1 / (number of 1 + number of 3)

.eventCounts – number of analyzed reads

.sequenceNames – sequences names as defined in manuscript

.sequences – full DNA sequences written 5prime to 3prime

.generation – heteromer generation as defined in manuscript

.rawData – ion current vs. time data sampled at 5 kHz (1 data point = 200 microseconds of data)

Funding

High Accuracy Nanopore Sequencing.

National Human Genome Research Institute

Find out more...

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC