Welcome to Mycobacterium tuberculosis H37Rv

Introduction

Most bacterial ORFs are identified by automated prediction algorithms. However, these algorithms often fail to identify ORFs lacking canonical features such as a length of >50 codons or the presence of an upstream Shine-Dalgarno sequence.

Here, we use ribosome profiling approaches to identify actively translated ORFs in Mycobacterium tuberculosis. Most of the ORFs we identify have not been previously described, indicating that the M. tuberculosis transcriptome is pervasively translated. The newly described ORFs are predominantly short, with many encoding proteins of ≤50 amino acids. Codon usage of the newly discovered ORFs suggests that most have not been subject to purifying selection, and hence are unlikely to contribute to cell fitness. Nevertheless, we identify 90 new ORFs (median length of 52 codons) that bear the hallmarks of purifying selection. Thus, our data suggest that pervasive translation of short ORFs in Mycobacterium tuberculosis serves as a rich source for the evolution of new functional proteins.

Publication

The publication describing this project can be accessed via the following DOI: https://doi.org/10.7554/eLife.73980

Data availability

The raw sequencing data used in this study can be accessed via the European Nucleotide Archive repositories with accession numbers E-MTAB-8039 and E-MTAB-10695.

The processed files used in this JBrowse2 genome browser can be downloaded here.

JBrowse2

Access Area

Original data

Annotation tracks

We provide multiple annotation tracks for the Mycobacterium tuberculosis H37Rv genome.
All annotation files can be loaded and unloaded using their respective check box in the track selector.
Annotation files are indexed and specific genes can be searched by gene name, locus tag or gene id using the search box in the top center of the JBrowse2 instance.
This JBrowse2 instance includes:

Track Description
The Reference Annotation The reference annotation for Mycobacterium tuberculosis H37Rv
mtb TSS Transcription start sites
Putative ORFs List of putative ORFs (potential isoforms or novel ORFs)
Process Sites Process Sites

Coverage tracks

Coverage tracks show the normalized read counts per nucleotide position. There are multiple mapping types used (threeprime/fiveprime/global). The mapping method of each file is marked in their description.

Shareable Links

JBrowse2 allows you to easily share observations with your colleagues and collaborators.
Using the "share"-button next to the session name, you can automatically generate a link. Anyone that uses this link will see the exact session you are currently looking at. This includes colors, zoom level, scale, loaded tracks etc...
This is a very powerful tool for sharing potential discoveries.

JBrowse2 user guide

For more information on how to use JBrowse2, please have a look at the comprehensive user guide provided by the JBrowse team. The chapter on basic usage is highly recommended to get familiar with the JBrowse2 interface.

Contributors

This project is a collaboration between the universities of Freiburg and Würzburg.

Wade Lab
  • Prof. Dr. Joseph T. Wade
  • Archer J Wang
  • Carol Smith
Derbyshire and Gray Lab
  • Dr. Todd A Gray
  • Dr. Keith Derbyshire
  • Jill G Canestrari
Champion Lab
  • Prof. Dr. Matthew M Champion

References

  1. Smith C., Canestrari JG., Wang AJ., Champion MM., Derbyshire KM., Gray TA. and Wade JT. Pervasive translation in Mycobacterium tuberculosis. eLife 11:e73980.
    https://doi.org/10.7554/eLife.73980