reapr

universal tool for genome assembly evaluation

Install

All systems
curl cmd.cat/reapr.sh
Debian Debian
apt-get install reapr
Ubuntu
apt-get install reapr
image/svg+xml Kali Linux
apt-get install reapr
Windows (WSL2)
sudo apt-get update sudo apt-get install reapr
Raspbian
apt-get install reapr

reapr

universal tool for genome assembly evaluation

REAPR is a tool that evaluates the accuracy of a genome assembly using mapped paired end reads, without the use of a reference genome for comparison. It can be used in any stage of an assembly pipeline to automatically break incorrect scaffolds and flag other errors in an assembly for manual inspection. It reports mis-assemblies and other warnings, and produces a new broken assembly based on the error calls. The software requires as input an assembly in FASTA format and paired reads mapped to the assembly in a BAM file. Mapping information such as the fragment coverage and insert size distribution is analysed to locate mis-assemblies. REAPR works best using mapped read pairs from a large insert library (at least 1000bp). Additionally, if a short insert Illumina library is also available, REAPR can combine this with the large insert library in order to score each base of the assembly.