Overview 2014

Overview 2012

Overview 2010

Overview 2009

Overview 2008

ExpaRNA-P: simultaneous exact pattern matching and folding of RNAs

Christina Otto, Mathias Mohl, Steffen Heyne, Mika Amit, Gad M. Landau, Rolf Backofen, Sebastian Will

In: BMC Bioinformatics, 2014, 15(1), 6602

BackgroundIdentifying sequence-structure motifs common to two RNAs can speed up the comparison of structural RNAs substantially. The core algorithm of the existent approach ExpaRNA solves this problem for a priori known input structures. However, such structures are rarely known; moreover, predicting them computationally is no rescue, since single sequence structure prediction is highly unreliable.ResultsThe novel algorithm ExpaRNA-P computes exactly matching sequence-structure motifs in entire Boltzmann-distributed structure ensembles of two RNAs; thereby we match and fold RNAs simultaneously, analogous to the well-known inverted question marksimultaneous alignment and folding inverted question mark of RNAs. While this implies much higher flexibility compared to ExpaRNA, ExpaRNA-P has the same very low complexity (quadratic in time and space), which is enabled by its novel structure ensemble-based sparsification. Furthermore, we devise a generalized chaining algorithm to compute compatible subsets of ExpaRNA-P inverted question marks sequence-structure motifs. Resulting in the very fast RNA alignment approach ExpLoc-P, we utilize the best chain as anchor constraints for the sequence-structure alignment tool LocARNA. ExpLoc-P is benchmarked in several variants and versus state-of-the-art approaches. In particular, we formally introduce and evaluate strict and relaxed variants of the problem; the latter makes the approach sensitive to compensatory mutations. Across a benchmark set of typical non-coding RNAs, ExpLoc-P has similar accuracy to LocARNA but is four times faster (in both variants), while it achieves a speed-up over 30-fold for the longest benchmark sequences ( inverted question mark400nt). Finally, different ExpLoc-P variants enable tailoring of the method to specific application scenarios. ExpaRNA-P and ExpLoc-P are distributed as part of the LocARNA package. The source code is freely available at http://www.bioinf.uni-freiburg.de/Software/ExpaRNA-P.Conclusions ExpaRNA-P inverted question marks novel ensemble-based sparsification reduces its complexity to quadratic time and space. Thereby, ExpaRNA-P significantly speeds up sequence-structure alignment while maintaining the alignment quality. Different ExpaRNA-P variants support a wide range of applications.

Available:
pdf (1879 KB)   doi:10.1186/s12859-014-0404-0   pmid:25551362   BibTeX Entry ( Otto_Mohl_Heyne-ExpaR_simul_exact-2014 )

Local Exact Pattern Matching for Non-Fixed RNA Structures.

Mika Amit, Rolf Backofen, Steffen Heyne, Gad M. Landau, Mathias Möhl, Christina Otto, Sebastian Will

In: IEEE/ACM Trans. Comput. Biology Bioinform., 2014, 11(1), 219-230

Detecting local common sequence-structure regions of RNAs is a biologically important problem. Detecting such regions allows biologists to identify functionally relevant similarities between the inspected molecules. We developed dynamic programming algorithms for finding common structure-sequence patterns between two RNAs. The RNAs are given by their sequence and a set of potential base pairs with associated probabilities. In contrast to prior work on local pattern matching of RNAs, we support the breaking of arcs. This allows us to add flexibility over matching only fixed structures; potentially matching only a similar subset of specified base pairs. We present an O(n3) algorithm for local exact pattern matching between two nested RNAs, and an O(n3 log n) algorithm for one nested RNA and one bounded-unlimited RNA. In addition, an algorithm for approximate pattern matching is introduced that for two given nested RNAs and a number k, finds the maximal local pattern matching score between the two RNAs with at most k mismatches in O(n3k2) time. Finally, we present an O(n3) algorithm for finding the most similar subforest between two nested RNAs.

Available:
pdf (784 KB)   doi:10.1109/TCBB.2013.2297113   BibTeX Entry ( amit14:_local_exact_patter_match_non_fixed_struc )

Activation of a GPCR leads to eIF4G phosphorylation at the 5' cap and to IRES-dependent translation

Kelly Leon, Thomas Boulo, Astrid Musnier, Julia Morales, Christophe Gauthier, Laurence Dupuy, Steffen Heyne, Rolf Backofen, Anne Poupon, Patrick Cormier, Eric Reiter, Pascale Crepieux

In: J Mol Endocrinol, 2014, 52(3), 373-82

The control of mRNA translation has been mainly explored in response to activated tyrosine kinase receptors. In contrast, mechanistic details on the translational machinery are far less available in the case of ligand-bound G protein-coupled receptors (GPCRs). In this study, using the FSH receptor (FSH-R) as a model receptor, we demonstrate that part of the translational regulations occurs by phosphorylation of the translation pre-initiation complex scaffold protein, eukaryotic initiation factor 4G (eIF4G), in HEK293 cells stably expressing the FSH-R. This phosphorylation event occurred when eIF4G was bound to the mRNA 5' cap, and probably involves mammalian target of rapamycin. This regulation might contribute to cap-dependent translation in response to FSH. The cap-binding protein eIF4E also had its phosphorylation level enhanced upon FSH stimulation. We also show that FSH-induced signaling not only led to cap-dependent translation but also to internal ribosome entry site (IRES)-dependent translation of some mRNA. These data add detailed information on the molecular bases underlying the regulation of selective mRNA translation by a GPCR, and a topological model recapitulating these mechanisms is proposed.

Available:
doi:10.1530/JME-14-0009   pmid:24711644   BibTeX Entry ( Leon_Boulo_Musnier-Activ_GPCR_leads-2014 )

Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches

Nuno D. Mendes, Steffen Heyne, Ana T. Freitas, Marie-France Sagot, Rolf Backofen

In: Bioinformatics, 2012, 28(23), 3034-41

MOTIVATION: The computational search for novel microRNA (miRNA) precursors often involves some sort of structural analysis with the aim of identifying which type of structures are prone to being recognized and processed by the cellular miRNA-maturation machinery. A natural way to tackle this problem is to perform clustering over the candidate structures along with known miRNA precursor structures. Mixed clusters allow then the identification of candidates that are similar to known precursors. Given the large number of pre-miRNA candidates that can be identified in single-genome approaches, even after applying several filters for precursor robustness and stability, a conventional structural clustering approach is unfeasible. RESULTS: We propose a method to represent candidate structures in a feature space, which summarizes key sequence/structure characteristics of each candidate. We demonstrate that proximity in this feature space is related to sequence/structure similarity, and we select candidates that have a high similarity to known precursors. Additional filtering steps are then applied to further reduce the number of candidates to those with greater transcriptional potential. Our method is compared with another single-genome method (TripletSVM) in two datasets, showing better performance in one and comparable performance in the other, for larger training sets. Additionally, we show that our approach allows for a better interpretation of the results. AVAILABILITY AND IMPLEMENTATION: The MinDist method is implemented using Perl scripts and is freely available at http://www.cravela.org/?mindist=1. CONTACT: backofen@informatik.uni-freiburg.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Available:
supplement.pdf (986 KB)   pdf (2048 KB)   doi:10.1093/bioinformatics/bts574   pmid:23052038   BibTeX Entry ( Mendes_Heyne_Freitas-Navig_the_unexp-2012 )

GraphClust: alignment-free structural clustering of local RNA secondary structures

Steffen Heyne, Fabrizio Costa, Dominic Rose, Rolf Backofen

In: Bioinformatics, 2012, 28(12), i224-i232

MOTIVATION: Clustering according to sequence-structure similarity has now become a generally accepted scheme for ncRNA annotation. Its application to complete genomic sequences as well as whole transcriptomes is therefore desirable but hindered by extremely high computational costs. RESULTS: We present a novel linear-time, alignment-free method for comparing and clustering RNAs according to sequence and structure. The approach scales to datasets of hundreds of thousands of sequences. The quality of the retrieved clusters has been benchmarked against known ncRNA datasets and is comparable to state-of-the-art sequence-structure methods although achieving speedups of several orders of magnitude. A selection of applications aiming at the detection of novel structural ncRNAs are presented. Exemplarily, we predicted local structural elements specific to lincRNAs likely functionally associating involved transcripts to vital processes of the human nervous system. In total, we predicted 349 local structural RNA elements. AVAILABILITY: The GraphClust pipeline is available on request. CONTACT: backofen@informatik.uni-freiburg.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Available:
pdf (483 KB)   supplement.pdf (1000 KB)   doi:10.1093/bioinformatics/bts224   pmid:22689765   BibTeX Entry ( Heyne:Costa:Rose:Graph_align_struc:2012 )

Exact Pattern Matching for RNA Structure Ensembles

Christina Schmiedl, Mathias Möhl, Steffen Heyne, Mika Amit, Gad M. Landau, Sebastian Will, Rolf Backofen

In: Proceedings of the 16th International Conference on Research in Computational Molecular Biology (RECOMB 2012), LNCS, 2012, 7262, 245-260

ExpaRNA's core algorithm computes, for two fixed RNA structures, a maximal non-overlapping set of maximal exact matchings. We introduce an algorithm ExpaRNA-P that solves the lifted problem of finding such sets of exact matchings in entire Boltzmann-distributed structure ensembles of two RNAs. Due to a novel kind of structural sparsification, the new algorithm maintains the time and space complexity of the algorithm for fixed input structures. Furthermore, we generalized the chaining algorithm of ExpaRNA in order to compute a compatible subset of ExpaRNA-P's exact matchings.We show that ExpaRNA-P outperforms ExpaRNA in BRAliBase 2.1 benchmarks, where we pass the chained exact matchings as anchor constraints to the RNA alignment tool LocARNA. Compared to LocARNA, this novel approach shows similar accuracy but is six times faster.

Available:
pdf (1799 KB)   doi:10.1007/978-3-642-29627-7_27   BibTeX Entry ( Schmiedl:etal:_exparnap:RECOMB2012 )

Local Exact Pattern Matching for Non-fixed RNA Structures

Mika Amit, Rolf Backofen, Steffen Heyne, Gad M. Landau, Mathias Möhl, Christina Schmiedl, Sebastian Will

In: Proceedings of the 23th Annual Symposium on Combinatorial Pattern Matching (CPM 2012), LNCS, 2012, 7354, 306-320

Detecting local common sequence-structure regions of RNAs is a biologically meaningful problem. By detecting such regions, biologists are able to identify functional similarity between the inspected molecules. We developed dynamic programming algorithms for finding common structure-sequence patterns between two RNAs. The RNAs are given by their sequence and a set of potential base pairs with associated probabilities. In contrast to prior work which matches fixed structures, we support the arc breaking edit operation; this allows to match only a subset of the given base pairs. We present an O(n3) algorithm for local exact pattern matching between two nested RNAs, and an O(n3 logn) algorithm for one nested RNA and one bounded-unlimited RNA.

Available:
pdf (650 KB)   doi:10.1007/978-3-642-31265-6_25   BibTeX Entry ( Amit-local_exact_pattern_matching-CPM2012 )

Freiburg RNA Tools: a web server integrating IntaRNA, ExpaRNA and LocARNA

Cameron Smith, Steffen Heyne, Andreas S. Richter, Sebastian Will, Rolf Backofen

In: Nucleic Acids Research, 2010, 38 Suppl, W373-7

The Freiburg RNA tools web server integrates three tools for the advanced analysis of RNA in a common web-based user interface. The tools IntaRNA, ExpaRNA and LocARNA support the prediction of RNA-RNA interaction, exact RNA matching and alignment of RNA, respectively. The Freiburg RNA tools web server and the software packages of the stand-alone tools are freely accessible at http://rna.informatik.uni-freiburg.de.

Available:
pdf (1502 KB)   doi:10.1093/nar/gkq316   pmid:20444875   BibTeX Entry ( Smith:Heyne:Richter:Freib_RNA_Tools:NAR2010 )

Lightweight Comparison of RNAs Based on Exact Sequence-Structure Matches

Steffen Heyne, Sebastian Will, Michael Beckstette, Rolf Backofen

In: Bioinformatics, 2009, 25(16), 2095-2102

MOTIVATION: Specific functions of ribonucleic acid (RNA) molecules are often associated with different motifs in the RNA structure. The key feature that forms such an RNA motif is the combination of sequence and structure properties. In this article, we introduce a new RNA sequence-structure comparison method which maintains exact matching substructures. Existing common substructures are treated as whole unit while variability is allowed between such structural motifs. Based on a fast detectable set of overlapping and crossing substructure matches for two nested RNA secondary structures, our method ExpaRNA (exact pattern of alignment of RNA) computes the longest collinear sequence of substructures common to two RNAs in O(H*nm) time and O(nm) space, where H << n.m for real RNA structures. Applied to different RNAs, our method correctly identifies sequence-structure similarities between two RNAs. RESULTS: We have compared ExpaRNA with two other alignment methods that work with given RNA structures, namely RNAforester and RNA_align. The results are in good agreement, but can be obtained in a fraction of running time, in particular for larger RNAs. We have also used ExpaRNA to speed up state-of-the-art Sankoff-style alignment tools like LocARNA, and observe a tradeoff between quality and speed. However, we get a speedup of 4.25 even in the highest quality setting, where the quality of the produced alignment is comparable to that of LocARNA alone. AVAILABILITY: The presented algorithm is implemented in the program ExpaRNA, which is available from our website (http://www.bioinf.uni-freiburg.de/Software).

Available:
supplementary_data.pdf (632 KB)   pdf (425 KB)   doi:10.1093/bioinformatics/btp065   pmid:19189979   BibTeX Entry ( Heyne:Will:Beckstette:Backofen:BI_ExpaRNA_2009 )

Lightweight comparison of RNAs based on exact sequence-structure matches

Steffen Heyne, Sebastian Will, Michael Beckstette, Rolf Backofen

In: Proceedings of the German Conference on Bioinformatics (GCB'2008), Lecture Notes in Informatics (LNI), 2008, P-136, 189-198

Specific functions of RNA molecules are often associated to different motifs in the RNA structure. The key feature is that the combination of sequence and structure properties form such an RNA motif. In this paper we introduce a new RNA sequence-structure comparison method which maintains exact matching substructures. Existing common substructures are treated as whole unit while variability is allowed between such structural motifs. Based on a fast detectable set of overlapping and crossing substructure matches for two nested RNA secondary structures, our method computes the longest colinear sequence of substructures common to two RNAs in O(n2m2) time and O(nm) space. Applied to different RNAs, our method correctly identifies sequence-structure similarities between two RNAs. The results of our experiments are in good agreement with existing alignment-based methods, but can be obtained in a fraction of running time, in particular for larger RNAs. The proposed algorithm is implemented in the program expaRNA, which is available from our website (www.bioinf.uni-freiburg.de/Software).

Available:
pdf (1263 KB)   BibTeX Entry ( Heyne:Will:Beckstette:Backofen:GCB08 )