pith. sign in

arxiv: 1710.01278 · v1 · pith:NKK2UM5Gnew · submitted 2017-10-03 · 🧬 q-bio.GN · q-bio.QM· stat.ML

Dilated Convolutions for Modeling Long-Distance Genomic Dependencies

classification 🧬 q-bio.GN q-bio.QMstat.ML
keywords long-distanceconvolutionsdependenciesdilatedmodelinggenomehumanmodel
0
0 comments X
read the original abstract

We consider the task of detecting regulatory elements in the human genome directly from raw DNA. Past work has focused on small snippets of DNA, making it difficult to model long-distance dependencies that arise from DNA's 3-dimensional conformation. In order to study long-distance dependencies, we develop and release a novel dataset for a larger-context modeling task. Using this new data set we model long-distance interactions using dilated convolutional neural networks, and compare them to standard convolutions and recurrent neural networks. We show that dilated convolutions are effective at modeling the locations of regulatory markers in the human genome, such as transcription factor binding sites, histone modifications, and DNAse hypersensitivity sites.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.