pith. machine review for the scientific record. sign in

arxiv: 1508.02930 · v1 · submitted 2015-08-04 · 🧬 q-bio.GN · math.PR

Recognition: unknown

End-to-End Optimization of High Throughput DNA Sequencing

Authors on Pith no claims yet
classification 🧬 q-bio.GN math.PR
keywords processsequencingfragmentsbaseend-to-endensemblefragmentgeometry
0
0 comments X
read the original abstract

At the core of high throughput DNA sequencing platforms lies a bio-physical surface process that results in a random geometry of clusters of homogenous short DNA fragments typically hundreds of base pairs long - bridge amplification. The statistical properties of this random process and length of the fragments are critical as they affect the information that can be subsequently extracted, i.e., density of successfully inferred DNA fragment reads. The ensemble of overlapping DNA fragment reads are then used to computationally reconstruct the much longer target genome sequence, e.g, ranging from hundreds of thousands to billions of base pairs. The success of the reconstruction in turn depends on having a sufficiently large ensemble of DNA fragments that are sufficiently long. In this paper using stochastic geometry we model and optimize the end-to-end process linking and partially controlling the statistics of the physical processes to the success of the computational step. This provides, for the first time, a framework capturing salient features of such sequencing platforms that can be used to study cost, performance or sensitivity of the sequencing process.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.