Poretools a toolkit for analyzing nanopore sequence data, bioRxiv, 2014-07-24

Motivation Nanopore sequencing may be the next disruptive technology in genomics. Nanopore sequencing has many attractive properties including the ability to detect single DNA molecules without prior amplification, the lack of reliance on expensive optical components, and the ability to sequence very long fragments. The MinION from Oxford Nanopore Technologies (ONT) is the first nanopore sequencer to be commercialised and made available to early-access users. The MinION(TM) is a USB-connected, portable nanopore sequencer which permits real-time analysis of streaming event data. A cloud-based service is available to translate events into nucleotide base calls. However, software support to deal with such data is limited, and the community lacks a standardized toolkit for the analysis of nanopore datasets. Results We introduce poretools, a flexible toolkit for manipulating and exploring datasets generated by nanopore sequencing devices from MinION for the purposes of quality control and downstream analysis. Poretools operates directly on the native FAST5 (a variant of the HDF5 standard) file format produced by ONT and provides a wealth of format conversion utilities and data exploration and visualization tools. Availability and implementation Poretools is open source software and is written in Python as both a suite of command line utilities and a Python application programming interface. Source code and user documentation are freely available in Github at httpsgithub.comarq5xporetools Contact n.j.loman@bham.ac.uk, aaronquinlan@gmail.com Supplementary information An IPython notebook demonstrating the use and functionality of poretools in greater detail is available from the Github repository.

biorxiv bioinformatics 0-100-users 2014

 

Created with the audiences framework by Jedidiah Carlson

Powered by Hugo