Perelman School of Medicine at the University of Pennsylvania

Grice Lab

HmmUFOtu


Latest Version

Click HmmUFOtu for latest version

News

New pre-built database gg_99_otus_GTR available!

Introduction

HmmUFOtu is an HMM based Ultra-fast OTU assignment tool for baterial 16S and amplicon sequencing research, it has two core algorithms, the CSFM-index (Consensus Sequence FM-index) powered banded-HMM algorithm, and SEP (Seed-Estimate-Place) local phylogenetic-placement based taxonomy assignment algorithm.

The main program hmmufotu takes single or paired-end NGS FASTA/FASTQ reads and generate taxonomy assignment results of every read. The main program hmmufotu-sum then generates phylogeny-based OTUs, a reference tree based OTU-tree, and consensus-based representative sequences for the OTUs. See the details on GitHub.

Supported models

HmmUFOtu supports all major DNA substitution models and an optional Discrete Gamma (dΓ) model (Yang 1994) for capturing among-site variations.

Download

Please download the source code (written in pure C++98) or pre-compiled binaries from GitHub.

Pre-built databases

You need to build an HmmUFOtu database before assigning taxonomies to your 16S or other amplicon sequencing reads. You can build your own database using hmmufotu-build (which may take ~10 mins with 6 processors), or alternatively download the pre-built databases below. Note: HmmUFOtu is backwards compatible; you don't need to download the databases again even if HmmUFOtu has been updated since your last download.

  • gg_97_otus_GTR GreenGenes (v13.8) species-level (97% OTU) reference + GTR DNA model. This is recommended for most bacteria 16S studies.
  • gg_97_otus_TN93 GreenGenes (v13.8) species-level (97% OTU) reference + TN93 DNA model
  • gg_97_otus_HKY85 GreenGenes (v13.8) species-level (97% OTU) reference + HKY85 DNA model
  • gg_79_otus_GTR GreenGenes (v13.8) middle-level (79% OTU) reference + GTR DNA model
  • gg_79_otus_TN93 GreenGenes (v13.8) middle-level (79% OTU) reference + TN93 DNA model
  • gg_79_otus_HKY85 GreenGenes (v13.8) middle-level (79% OTU) reference + HKY85 DNA model
  • gg_99_otus_GTR (part0, part1) GreenGenes (v13.8) strain-level (99% OTU) reference + GTR DNA model. Warning: you may need at least 48 GB free memory to use this database.

Citations

Please cite:  Zheng Q, Bartow-McKenney C, Meisel JS, Grice EA. HmmUFOtu: An HMM and phylogenetic placement based ultra-fast taxonomic assignment and OTU picking tool for microbiome amplicon sequencing studies. Genome Biol. 2018 19(1):82. doi: 10.1186/s13059-018-1450-0. PMID: 29950165

Contact us

Please contact Qi Zheng or Elizabeth Grice with any questions.