Add methurator tool#609
Open
edogiuili wants to merge 24 commits into
Open
Conversation
…ethurator is specified
Member
|
Warning Newer version of the nf-core template is available. Your pipeline is using an old version of the nf-core template: 3.5.1. For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation. |
Author
|
Tests 6,7,8,9 and 10 (and 3 only for conda profile) fail but I haven't changed them or anything related to their workflows. Might be that these tests were already failing? |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
In this PR, I add methurator as an additional QC tool for both the Bismark and bwa-meth subworkflows.
Methurator cannot currently be used with the TAPS/bwa-mem subworkflow because it relies on MethylDackel to extract CpG methylation information from BAM files.
Methurator is a Python package designed to estimate CpG sequencing saturation in DNA methylation sequencing data. It complements Preseq by estimating the number of sequencing reads required to achieve coverage of at least N CpGs with a minimum read depth of r. Because the estimator is based on a frequency-of-frequencies matrix derived from methylation count files, it can also be used to estimate library complexity for RRBS datasets.
Notes
topicchannel, following the latest nf-core guidelines.fasta_index_methylseqlint test currently fails because I had to introduce a newmethuratorboolean parameter to trigger generation of theFASTA.FAIindex. This index is required by Methurator due to its dependency on MethylDackel.PR checklist
nf-core pipelines lint).nextflow run . -profile test,docker --outdir <OUTDIR>).nextflow run . -profile debug,test,docker --outdir <OUTDIR>).docs/usage.mdis updated.docs/output.mdis updated.CHANGELOG.mdis updated.README.mdis updated (including new tool citations and authors/contributors).