fitgcp
fitting genome coverage distributions with mixture models
Install
- All systems
-
curl cmd.cat/fitgcp.sh
- Debian
-
apt-get install fitgcp
- Ubuntu
-
apt-get install fitgcp
- Kali Linux
-
apt-get install fitgcp
- Windows (WSL2)
-
sudo apt-get update
sudo apt-get install fitgcp
- Raspbian
-
apt-get install fitgcp
- Dockerfile
- dockerfile.run/fitgcp
fitgcp
fitting genome coverage distributions with mixture models
Genome coverage, the number of sequencing reads mapped to a position in a genome, is an insightful indicator of irregularities within sequencing experiments. While the average genome coverage is frequently used within algorithms in computational genomics, the complete information available in coverage profiles (i.e. histograms over all coverages) is currently not exploited to its full extent. Thus, biases such as fragmented or erroneous reference genomes often remain unaccounted for. Making this information accessible can improve the quality of sequencing experiments and quantitative analyses. fitGCP is a framework for fitting mixtures of probability distributions to genome coverage profiles. Besides commonly used distributions, fitGCP uses distributions tailored to account for common artifacts. The mixture models are iteratively fitted based on the Expectation-Maximization algorithm.