translation mode example


The BamQuery translation mode was designed to search BAM files from Ribo-seq data. In this mode, BamQuery can be used as a means to verify the presence of ribosome profiling reads that overlap with peptide MCSs.

Aware that the length of Ribo-seq reads varies between ~24-32 nucletotides, BamQuery counts reads according to their percentage overlapping with the MCS. For example, if a Ribo-seq read overlaps with 70% of the MCS in a given region, BamQuery counts this read as 0.7 instead of 1.

Note

BamQuery counts all reads for which the overlap with the MCSs is equal or higher to 60%.

Input

In this mode, instead of BAM_directories.tsv BamQuery expects a BAM_ribo_directories.tsv that includes the Ribo-seq datasets.

Command line:

BamQuery.py ./translation_example/Input translation_example v38_104 --mode translation

Input folder path_to_input_folder must containt the files : BAM_ribo_directories.tsv and peptides.tsv.

Output

BamQuery creates an output directory in the same path as the input folder.

This directory contains 3 folders and the main results are organized as follows:

├── alignments
│   ├── missed_peptides.info
│   └── translation_example_info_alignments.xlsx
├── logs
│   ├── BamQuery_Res_translation_example.log
│   └── Information_BAM_directories.log
├── plots
│   └── heat_maps
│       └── translation_evidence_heatmap
│           ├── average_translation_expression_heatmap
│           │   ├── norm_info.csv
│           │   ├── translation_example_ribo_norm_all_tissues.pdf
│           │   └── translation_example_ribo_norm_selected_tissues.pdf
│           └── total_translation_expression_heatmap
│               ├── translation_example_ribo_counts.csv
│               ├── translation_example_ribo_counts.pdf
│               ├── translation_example_ribo_norm.csv
│               └── translation_example_ribo_norm.pdf
└── res_translation
    └── translation_example_ribo_count_info.xlsx

The output files (alignments and logs) in BamQuery translation mode are similar to those in normal mode, see `output_normal_mode_example`_ for detailed information about the output files : missed_peptides.info, sc_example_info_alignments.xlsx and logs.


plots

plots
└── heat_maps
        └── translation_evidence_heatmap
                ├── average_translation_expression_heatmap
                │   ├── norm_info.csv
                │   ├── translation_example_ribo_norm_all_tissues.pdf
                │   └── translation_example_ribo_norm_selected_tissues.pdf
                └── total_translation_expression_heatmap
                                                ├── translation_example_ribo_counts.csv
                                                ├── translation_example_ribo_counts.pdf
                                                ├── translation_example_ribo_norm.csv
                                                └── translation_example_ribo_norm.pdf

heat_maps

This folder contains the heat maps representing the translation expression level of all peptides queried.

average_translation_expression_heatmap folder: |br| Heat maps representing the mean translation expression for each peptide queried in the tissues associated with the BAM/CRAM ribo-seq files.

norm_info.csv: reports, for each peptide queried, the mean and median values of rphm in the tissues associated with the BAM/CRAM ribo-seq files.

_norm_all_tissues.pdf : Heat map representing the mean level of translation expression associated with tissue types, computed from all samples in the tissue.

_norm_selected_tissues.pdf: Heat map representing the mean level of translation expression associated with tissue types, computed from selected tissues (short list of tissues).

total_translation_expression_heatmap folder: |br| Heat maps representing the mean translation expression and total number of ribo-seq reads for each peptide queried in each BAM/CRAM ribo-seq file.

_ribo_counts.csv: reports, for each peptide queried, the total number of ribo-seq reads in each BAM/CRAM ribo-seq file.

_ribo_counts.pdf: heat map representing the mean number of ribo-seq reads in each BAM/CRAM ribo-seq file.

_ribo_norm.csv: reports, for each peptide queried, the mean rphm values in each BAM/CRAM ribo-seq file.

_ribo_norm.pdf: heat map representing the mean translation expression level of each peptide in each BAM/CRAM ribo-seq file.

Warning

  1. Heat maps are produced for searches with less than 400 peptides.

  2. Heat map in total_translation_expression_heatmap is produced only if the number of BAM/CRAM files queried are less than 100 tissues.