Software
Categories: pipelines, bioinformatics tools, and statistics and visualization tools.
Pipelines
Sunbeam
Sunbeam is a modular, extensible pipeline for the analysis of metagenomic sequencing data.
Sunbeam Project on GitHub
Sunbeam Documentation
Current developers: Charlie Bushman, Kyle Bittinger (PI)
Language: Python, Bash
License: GPL2+
Citation: Clarke EL, Taylor LJ, Zhao C, Connell A, Lee JJ, Fett B, Bushman FD, Bittinger K. Sunbeam: an extensible pipeline for analyzing metagenomic sequencing experiments. Microbiome. 2019 Mar 22;7(1):46. PMID: 30902113; PMCID: PMC6429786.
INSPIIRED
A software suite designed to study viral integration sites and the longitudinal outcomes of gene therapy patients.
INSPIIRED Project on GitHub
Current developers: John Everett, Rick Bushman (PI)
Language: R
License: GPL3
Citation: Berry CC, Nobles C, Six E, Wu Y, Malani N, Sherman E, Dryga A, Everett JK, Male F, Bailey A, Bittinger K, Drake MJ, Caccavelli L, Bates P, Hacein-Bey-Abina S, Cavazzana M, Bushman FD. INSPIIRED: Quantification and Visualization Tools for Analyzing Integration Site Distributions. Mol Ther Methods Clin Dev. 2016 Dec 18;4:17-26. PMID: 28344988; PMCID: PMC5363318.
Bioinformatics
Unassigner
Species-level assignment for 16S rRNA marker gene sequences, but with a twist.
Unassigner Project on GitHub
Current developers: Ceylan Tanes, Kyle Bittinger (PI)
Language: Python
License: GPL2+
GNUVID
Gene Novelty Unit-based Virus IDentification for SARS-CoV-2.
GNUVID Project on GitHub
Current developers: Ahmed Moustafa (PI), Paul Planet (PI)
Language: Python
License: GPL3
Citation: Moustafa AM, Planet PJ. Emerging SARS-CoV-2 Diversity Revealed by Rapid Whole-Genome Sequence Typing. Genome Biol Evol. 2021 Sep 1;13(9):evab197. PMID: 34432021; PMCID: PMC8449825.
WhatsGNU
What's Gene Novelty Unit: A Tool For Identifying Proteomic Novelty.
WhatsGNU Project on GitHub
Current developers: Ahmed Moustafa (PI), Paul Planet (PI)
Language: Python
License: GPL3
Citation: Moustafa AM, Planet PJ. WhatsGNU: a tool for identifying proteomic novelty. Genome Biol. 2020 Mar 5;21(1):58. PMID: 32138767; PMCID: PMC7059281.
BROCC
Consensus-based taxonomic assignment using BLAST results.
BROCC Project on GitHub
Current developer: Kyle Bittinger (PI)
Language: Python
License: GPL3
Citation: Dollive S, Peterfreund GL, Sherrill-Mix S, Bittinger K, Sinha R, Hoffmann C, Nabel CS, Hill DA, Artis D, Bachman MA, Custers-Allen R, Grunberg S, Wu GD, Lewis JD, Bushman FD. A tool kit for quantifying eukaryotic rRNA gene sequences from human microbiome samples. Genome Biol. 2012 Jul 3;13(7):R60. PMID: 22759449; PMCID: PMC4053730.
DNAbc
Identify DNA barcodes in FASTQ data files and write demultiplexed data.
DNAbc Project on GitHub
Current developer: Kyle Bittinger (PI)
Language: Python
License: GPL2
primertrim
Detect short primer sequences in FASTQ reads and trim the reads accordingly.
primertrim Project on GitHub
Current developer: Charlie Bushman, Kyle Bittinger (PI)
Language: Python
License: GPL2
Stackebrandt Curves
Compare whole-genome similarity to 16S rRNA gene similarity.
Stackebrandt Curves Project on GitHub
Current developer: Kyle Bittinger (PI)
Language: Python
License: GPL3
okfasta
Utilities for FASTA-format sequence files, implemented in pure Python.
okfasta Project on GitHub
Current developer: Kyle Bittinger
Language: Python
License: MIT
Statistics and Visualization
mirix
Model the susceptibility of a bacterial community to antibiotics.
mirix Project on GitHub
Current developers: Ceylan Tanes, Kyle Bittinger (PI)
Language: R
License: GPL3
Citation: Tu V, Ren Y, Tanes C, Mukhopadhyay S, Daniel SG, Li H, Bittinger K. A quantitative approach to measure and predict microbiome response to antibiotics. bioRxiv 2023.01.27.525904.
ZIBR
Zero-inflated beta random effect model for microbiome data.
ZIBR Project on GitHub
Current developers: Eric Chen, Hongzhe Li (PI)
Language: R
License: GPL2
Citation: Chen EZ, Li H. A two-part mixed-effects model for analyzing longitudinal microbiome compositional data. Bioinformatics. 2016 Sep 1;32(17):2611-7. Epub 2016 May 14. PMID: 27187200; PMCID: PMC5860434.
ZIR
Zero-Inflated (Wilcoxon and Kruskal–Wallis) Rank Test.
ZIR Project on GitHub
Current developers: Eric Chen, Hongzhe Li (PI)
Language: R
License: GPL2
Citation: Wanjie Wang, Eric Z. Chen, Hongzhe Li. Truncated Rank-Based Tests for Two-Part Models with Excessive Zeros and Applications to Microbiome Data.
microflowseq
Analyze microbiome data from bacterial cell sorting (mFLOW-Seq) experiments.
microflowseq Project on GitHub
Current developers: Ceylan Tanes, Charlie Bushman, Kyle Bittinger (PI)
Language: R
License: MIT
Citation: Conrey PE, Denu L, O'Boyle KC, Rozich I, Green J, Maslanka J, Lubin JB, Duranova T, Haltzman BL, Gianchetti L, Oldridge DA, De Luna N, Vella LA, Allman D, Spergel JM, Tanes C, Bittinger K, Henrickson SE, Silverman MA. IgA deficiency destabilizes homeostasis toward intestinal microbes and increases systemic immune dysregulation. Sci Immunol. 2023 May 26;8(83):eade2335. Epub 2023 May 26. PMID: 37235682.
polyafit
Identify enriched features in paired microbiome samples.
polyafit Project on GitHub
Current developer: Kyle Bittinger (PI)
Language: R
License: GPL2+
Citation: Charlson ES, Bittinger K, Chen J, Diamond JM, Li H, Collman RG, Bushman FD. Assessing bacterial populations in the lung by replicate analysis of samples from the upper and lower respiratory tracts. PLoS One. 2012;7(9):e42786. Epub 2012 Sep 6. PMID: 22970118; PMCID: PMC3435383.
usedist
Efficiently create, modify, and extract information from distance matrices.
usedist Project on GitHub
Current developer: Kyle Bittinger
Language: R
License: GPL3
adonisplus
A data-pipeline-friendly interface to the PERMANOVA or Adonis test.
adonisplus Project on GitHub
Current developer: Kyle Bittinger
Language: R
License: GPL3
abdiv
Reference implementation for measures of alpha and beta diversity.
abdiv Project on GitHub
Current developer: Kyle Bittinger
Language: R
License: MIT
pheatbuilder
A data-pipeline-friendly interface to build heatmap charts.
pheatbuilder Project on GitHub
Current developer: Kyle Bittinger
Language: R
License: GPL3
taxafmt
Format and parse taxonomic assignments, especially for microbiome data.
taxafmt Project on GitHub
Current developer: Kyle Bittinger
Language: R
License: MIT