1479. PathogenDB: A modular software suite integrating genomic clinical microbiology & epidemiology
Session: Poster Abstract Session: HAI: Surveillance and Public Reporting
Friday, October 28, 2016
Room: Poster Hall
Background: Next-generation sequencing (NGS) technologies have reduced the cost of acquiring genomic data from active infections in hospitals, with the potential to rapidly characterize patient-to-patient transmission with extreme precision. A barrier to widespread adoption of NGS in clinical microbiology is a lack of easy-to-use software for converting these data into species identifications, phylogenies, and drug susceptibilities. A clinical application should ideally provide a unified pipeline that could be deployed at a clinical microbiology lab, running semi-automated analyses that inform infection control interventions.

Methods: We developed a modular open-source software suite called PathogenDB that implements major functionalities needed for genomic clinical microbiology and pathogen surveillance. A central laboratory information management system runs on a standard open-source Linux/Apache/MySQL/PHP stack. A modular genomics workflow, PathogenDB-pipeline, was publicly released in 2014. It automates de novo assembly of reads with HGAP, circularizes contigs with Circlator, annotates genes with Prokka, and predicts epigenetic motifs. The pipeline also post-processes assemblies to evaluate quality and provide visualizations using a custom genome browser (ChromoZoom). A comparative genomics module, PathogenDB-comparison, performs semi-automated phylogenetic analysis with Mugsy and RAxML.

Results: PathogenDB-pipeline has been used to assemble and annotate 232 genomes from 7 species, and runs in <12 hours end-to-end. At an urban tertiary-care hospital, PathogenDB-comparison has genomically characterized one MRSA outbreak, two transmissions via solid organ transplant, and pseudo-outbreaks of S. maltophilia and B. cepacia. Both software packages are freely available on GitHub.

Conclusion: We have created modular, open-source software that automates significant portions of a genomic clinical microbiology workflow and can characterize transmissions within an outbreak. Further work could add visualizations based on epidemiological trend data and geospatial analysis, allowing rapid, unprecedented insight into transmission events and potential outbreaks occurring within a NGS-equipped hospital.

Theodore Pak, AB, Mitchell Sullivan, PhD, Oliver Attie, PhD, Elizabeth Webster, BS, Andrew Kasarskis, PhD, Harm Van Bakel, PhD and Ali Bashir, PhD, Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY


T. Pak, None

M. Sullivan, None

O. Attie, None

E. Webster, None

A. Kasarskis, None

H. Van Bakel, None

A. Bashir, None

Previous Abstract | Next Abstract >>

Findings in the abstracts are embargoed until 12:01 a.m. CDT, Wednesday Oct. 26th with the exception of research findings presented at the IDWeek press conferences.