Professional Documents
Culture Documents
guigolab / bamstats
Dismiss
Join GitHub today
GitHub is home to over 40 million developers working together to host and
review code, manage projects, and build software together.
Sign up
emi80 Bump version to 0.3.3 for release Latest commit a97df7c 14 days ago
annotation Fix wrong feature merging and interleaving with unsorted annotation - c… 14 days ago
data Add check for int8 NH tag in general stats - fix #18 11 months ago
stats Add check for int8 NH tag in general stats - fix #18 11 months ago
utils Add method to stats.Map to output JSON - close #16 last year
README.md [ci skip] Update installation instructions in readme to use latest re… 10 months ago
process_test.go Add check for int8 NH tag in general stats - fix #18 11 months ago
README.md
Bamstats
build passing coverage 85%
Bamstats is a command line tool written in Go for computing mapping statistics from a BAM file.
Installation instructions
Use one of the following methods to install Bamstats .
https://github.com/guigolab/bamstats 1/3
18/09/2019 GitHub - guigolab/bamstats: A command line tool to compute mapping statistics from a BAM file
The easiest way is to download a pre-compiled binary from Github releases. Here is an example for installing the latest
released version on Linux 64bit:
go get github.com/guigolab/bamstats/cmd/bamstats
Provided statistics
Bamstats can currently compute the following mapping statistics:
general
genome coverage
RNA-seq
General
The general mapping statistics include:
If the data is paired-end, a section for read-pairs is also reported. In addition to the above metrics, the section contains a
map of the insert size length and the corresponding support as number of reads.
Genome coverage
The genome coverage ststistics are computed for RNA-seq data and include counts for the following genomic regions:
exon
intron
exonic_intronic
intergenic
others
The above metrics are computed for continuous and split mapped reads. An aggregated total is computed across elements
and read types too.
The --uniq (or -u ) command line flag allows reporting of genome coverage statistics for uniquely mapped reads too.
RNA-seq
The RNA-seq statistics follow IHEC reccomendations for RNA-seq data quality metrics. They include counts for the following
regions:
mapped
intergenic
rRNA
https://github.com/guigolab/bamstats 2/3
18/09/2019 GitHub - guigolab/bamstats: A command line tool to compute mapping statistics from a BAM file
duplicates
Output examples:
Some examples of the program output can be found in the data folder ot this GitHub repository:
General Stats
Genomic coverage stats
Genomic coverage stats with uniquely mapped reads (Note that the coverageUniq stats are reported as an additional
JSON object)
RNA-seq stats
License
This software is release under a BSD-style license. Please check the LICENSE file for more details.
https://github.com/guigolab/bamstats 3/3