The Open Connectome Project Data Cluster: Scalable Analysis and Vision for High-Throughput Neuroscience

Alexander S. Szalay; Daniel R. Berger; Davi D. Bock; Dean Kleissas; Eric Perlman; Jeff Lichtman; Joshua T. Vogelstein; Karl Deisseroth; Kunal Lillaney; Kwanghun Chung

arxiv: 1306.3543 · v2 · pith:3ZMAK3IDnew · submitted 2013-06-15 · 💻 cs.DC · cs.CE· q-bio.NC

The Open Connectome Project Data Cluster: Scalable Analysis and Vision for High-Throughput Neuroscience

Randal Burns , William Gray Roncal , Dean Kleissas , Kunal Lillaney , Priya Manavalan , Eric Perlman , Daniel R. Berger , Davi D. Bock

show 12 more authors

Kwanghun Chung Logan Grosenick Narayanan Kasthuri Nicholas C. Weiler Karl Deisseroth Michael Kazhdan Jeff Lichtman R. Clay Reid Stephen J. Smith Alexander S. Szalay Joshua T. Vogelstein R. Jacob Vogelstein

This is my paper

classification 💻 cs.DC cs.CEq-bio.NC

keywords dataclusterspatialsystemanalysishigh-throughputparallelscalable

0 comments

read the original abstract

We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes---neural connectivity maps of the brain---using the parallel execution of computer vision algorithms on high-performance compute clusters. These services and open-science data sets are publicly available at http://openconnecto.me. The system design inherits much from NoSQL scale-out and data-intensive computing architectures. We distribute data to cluster nodes by partitioning a spatial index. We direct I/O to different systems---reads to parallel disk arrays and writes to solid-state storage---to avoid I/O interference and maximize throughput. All programming interfaces are RESTful Web services, which are simple and stateless, improving scalability and usability. We include a performance evaluation of the production system, highlighting the effectiveness of spatial data organization.

This paper has not been read by Pith yet.

The Open Connectome Project Data Cluster: Scalable Analysis and Vision for High-Throughput Neuroscience

discussion (0)