TargetRNA3

What is TargetRNA3?

TargetRNA3 uses machine learning to predict targets of small regulatory RNAs (sRNAs) throughout a genome. At its core, TargetRNA3 employs a gradient boosting classification algorithm that has been trained on thousands of evinced interactions between sRNAs and their regulatory targets in various prokaryotes. When making target predictions, the machine learning algorithm uses a variety of features indicative of regulatory interactions, including the thermodynamics of the interaction and potential homologous interactions in related organisms, if available.

Input

As input, TargetRNA3 requires the name of a genome and the sequence of a small regulatory RNA (sRNA). When a user enters a few characters from the genome name, a drop-down box appears from which the user can select their genome of interest. For the sRNA sequence, the sequence may be in FASTA format or not.

By default, TargetRNA3 reports candidate regulatory targets of the sRNA that have a probability greater than or equal to 0.5 and a p-value less than or equal to 0.05. These input parameters can be adjusted. To increase the sensitivity of TargetRNA3 and predict more targets, the probability parameter can be lowered.

Output

As output, TargetRNA3 produces a plot indicating regions of the sRNA that participate in target interactions, a plot indicating regions of the sRNA that are conserved in other genomes, and a ranked list of candidate targets that it predicts for the sRNA. For each candidate target, TargetRNA3 reports:

the name and annotation of the target
a predicted structure and energy (based on thermodynamics of hybridization and structural accessibility of the sRNA and target) for the sRNA:target interaction
the location of the interaction within the sRNA and within the target (relative to the start of the target)
the probability, as determined by the machine learning algorithm, that there is a regulatory interaction between the sRNA and target
a corresponding p-value for the regulatory interaction

The main measure of the likelihood that the sRNA interacts with a candidate target is the probability. Probabilities greater than 0.5 indicate that the candidate is more likely than not to be a target, as determined by the machine learning algorithm. Probabilities less than 0.5 indicate that the candidate is more likely not to be a target, as determined by the machine learning algorithm.

Savvy users may note that energies are not perfectly correlated with probabilities. This is because energies are just one of many features used by the machine learning algorithm to determine probabilities. Other features are not reported by the webserver for the sake of concision. Advanced users are encouraged to make use of the source code.

Interested in searching a different genome from those provided?

The TargetRNA3 website enables searching any prokaryotic genome listed as reference or representative by NCBI's RefSeq, as well as many others. There are thousands of such genomes. The site does not provide support for searching any/all prokaryotic genomes. There are too many. If you are interested in using TargetRNA3 to search a genome other than those provided, you have two options.

Option #1. Download the source code and run TargetRNA3 on your own machine. The User Manual describes how to add a new genome to TargetRNA3's search.

Option #2. We do our best to support reasonable requests for individual genomes to be added to the TargetRNA3 website. We receive many such requests, so we kindly ask for your understanding and patience . Unfortunately, we cannot process batch requests, e.g., "please add these 20 genomes." If you would like to request that a genome be added to the website, please follow these steps:

Kindly double-check that the genome is not already available on the website.
Find the NCBI RefSeq genome assembly identifier for your genome of interest, e.g., GCF_?????????.1. This assembly accession identifier can be found in the first column here for archaea and here for bacteria. We use this identifier to add a genome. If your genome of interest does not have such an identifier, we will not be able to add it to the website and you will need to use Option #1 above and run TargetRNA3 on your own machine.
Submit the Contact Form and provide the assembly accession identifier (GCF_?????????.1) for your genome of interest.
Over the next week or so, check the website for inclusion of your genome (make sure to refresh the TargetRNA3 webpage). We process requests about once a week. We don't normally notify you when requests have been processed, so please check the website.

Source Code

TargetRNA3 source code is available on GitHub

Citing TargetRNA3

TargetRNA3: predicting prokaryotic RNA regulatory targets with machine learning. Tjaden B. Genome Biology, 24:276, 2023.