RNA-Framework Module Pipeline Specification
Pipeline Details
- Name:
RNA-Framework Module - Pipeline UUID:
b780db2315004c02b836cb5d0903183c - Version:
1.0.1 - View Pipeline:
Overview
RNA-Framework Module pipeline is designed for processing, analyzing, and visualizing RNA chemical probing data (e.g., SHAPE, DMS, MaP approaches). It provides a comprehensive, modular suite of tools that handle the complete workflow from raw sequencing reads to secondary structure prediction, including trimming, aligning, computing mutation/RT-stop profiles, generating reactivities, performing normalization, and producing structure-ready outputs.
Key Use cases:
- SHAPE-MaP Data Analysis: Processing SHAPE chemical probing experiments with mutational profiling to determine RNA secondary structure constraints.
- DMS-MaPseq Experiments: Analyzing dimethyl sulfate probing data to identify accessible adenine and cytosine residues in RNA structures.
- RNA Structure Prediction: Integrating chemical probing reactivities with computational folding algorithms to predict experimentally-informed secondary structures.
Features
- Modular Design: Supports customizable workflows with plug-and-play modules (rf-index, rf-map, rf-count, rf-norm, rf-combine, rf-fold) that can be turned on/off as desired.
- Multiple Aligner Support: Includes both Bowtie and Bowtie2 alignment options optimized for chemical probing read characteristics.
- Chemical Probing Specificity: Tailored algorithms for handling mutation signatures from different probing chemistries (SHAPE, DMS, etc.).
- Comprehensive Quality Control: Implements FastQC and MultiQC for sequencing quality assessment and summary reporting.
- Flexible Normalization Methods: Supports multiple scoring and normalization approaches (Ding et al. 2014, Rouskin et al. 2014, Siegfried et al. 2014, Zubradt et al. 2016).
- Structure Prediction Integration: Incorporates both ViennaRNA and RNAstructure folding engines with experimental constraint integration.
- Replicate Handling: Automated combining of multiple experimental replicates with correlation analysis.
Input/Output Specification
Inputs
Required
Sequencing Reads
- Description: FASTQ files containing raw sequencing reads from chemical probing experiments.
- Format: .fastq.gz
- Example File Path: /path/to/input/sample.fastq.gz
Reference Genome/Transcripts
- Description: FASTA file containing RNA sequences to be used as mapping references.
- Format: .fasta
- Example File Path: /path/to/reference/transcripts.fasta
Optional Inputs
Pre-built Bowtie Index
- Description: Pre-computed Bowtie/Bowtie2 index files for reference sequences.
- Format: Bowtie index files (.ebwt or .bt2)
- Example File Path: /path/to/index/reference_index
Outputs
Reported Outputs
- MultiQC Report:
- Description: Comprehensive quality control summary across all samples and processing steps
- Format: .html
- Example File Path: /output/multiqc/multiqc_report.html
- Visualization App: Web browser
-
Location: MultiQC folder
-
Normalized Reactivity Profiles:
- Description: XML files containing normalized per-nucleotide reactivity values
- Format: .xml
- Example File Path: /output/rf_norm/transcript_reactivities.xml
- Visualization App: RNA structure visualization tools
-
Location: rf_norm folder
-
Predicted Secondary Structures:
- Description: RNA secondary structure predictions in dot-bracket notation or connectivity table format
- Format: .db/.ct
- Example File Path: /output/rf_fold/structures.db
- Visualization App: RNA structure viewers (e.g., VARNA, RNAstructure)
- Location: rf_fold folder
Supporting Outputs
- Aligned BAM Files:
- Description: Mapped sequencing reads in binary alignment format
- Format: .bam
-
Example File Path: /intermediate/rf_map/aligned_reads.bam
-
Mutation Count Files:
- Description: Per-nucleotide mutation and coverage statistics
- Format: .rc (RNA Framework count format)
-
Example File Path: /intermediate/rf_count/mutation_counts.rc
-
FastQC Reports:
- Description: Individual sample quality control reports
- Format: .html
- Example File Path: /intermediate/fastqc/sample_fastqc.html
Associated Processes
- FastQC
- MultiQC
- RNAFramework Combine
- RNAFramework Count
- RNAFramework Fold
- RNAFramework Index
- RNAFramework Map
- RNAFramework Norm
References & Additional Documentation
- Related Papers: Incarnato et al., 2018. RNA Framework: an all-in-one toolkit for the analysis of RNA structures and post-transcriptional modifications. Nucleic Acids Res. 46(16):e97
- Pipeline Repository: RNA Framework official repository
- Workflow Diagram: Available in pipeline description pages
- License: Open source license (see RNA Framework documentation)