-
Notifications
You must be signed in to change notification settings - Fork 87
DSL2: Nf test conversion: shortDNA #1144
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 8 commits
Commits
Show all changes
16 commits
Select commit
Hold shift + click to select a range
cd12ca7
copy the test-profile nf-test file
merszym 3598dc3
WIP: create nf-test file for test_shortdna profile
merszym 2871ab5
linting. remove completed todo
TCLamnidis f02670b
update test
TCLamnidis a9b44a6
update shortDNA snapshot
merszym 1ae4f8a
Merge branch 'dev' into nf-test-conversion-shortDNA
merszym b436e11
fix linting
merszym efc5bd4
Merge branch 'nf-test-conversion-shortDNA' of github.com:nf-core/eage…
merszym 6b5d582
Merge branch 'nf-core-template-merge-3.4.1' of github.com:nf-core/eag…
merszym bffe7a4
Update shortDNA test-profile
merszym 6c247a6
Merge branch 'nf-core-template-merge-3.4.1' of github.com:nf-core/eag…
merszym cd76da1
update snapshots
merszym 8da6b38
Update tests/shortdna.nf.test
merszym 3659bd5
Update tests/shortdna.nf.test
merszym e05bb08
Merge branch 'dev' into nf-test-conversion-shortDNA
TCLamnidis 36c3da1
Update snapshots after template merge
merszym File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,141 @@ | ||
| nextflow_pipeline { | ||
|
|
||
| name "Test pipeline: NFCORE_EAGER" | ||
| script "main.nf" | ||
| tag "pipeline" | ||
| tag "nfcore_eager" | ||
| tag "test_shortdna" | ||
|
|
||
| test("test_shortdna_profile") { | ||
merszym marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
|
|
||
| when { | ||
| params { | ||
| outdir = "$outputDir" | ||
| } | ||
| } | ||
|
|
||
| then { | ||
|
|
||
| /////////////////// | ||
| // DOCUMENTATION // | ||
| /////////////////// | ||
|
|
||
| // The contents of each top level results directory should be tested with individually named snapshots. | ||
| // Within each snapshot, there should be two to three distinct variables, that contain the files to be tested. | ||
| // - stable_name_<dir> is for files with variable md5sums (i.e. content) so only names will be compared | ||
| // - stable_content_<dir> is for files with stable md5sums (i.e. content) so md5sums will be compared | ||
| // - bams_<dir> is for BAM files, where the headerMD5 is checked for stability (since the content can be unstable) | ||
| // If a directory is fully stable, you can drop `stable_name_*` | ||
| // If a directory contains no BAMs, you can drop `bams_*` | ||
|
|
||
| // Generate with: nf-test test --tag test --profile docker,test --update-snapshot | ||
| // Test with: nf-test test --tag test --profile docker,test | ||
merszym marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| // NOTE: BAMs are always only stable in name, because: | ||
| // a) sharding breaks header since the shard that was first is named in the header (Fixed in https://github.com/nf-core/eager/pull/1112) | ||
| // b) the order of the reads in the BAMs is not stable (sorted, but reads that share a start position can be in any order) | ||
| // point b) also causes BAIs to be unstable. | ||
| // c) Merging of multiple BAMs with duplicate @RG / @PG tags can cause the header to be unstable (particularly in the case of shards/lanes) | ||
|
|
||
| ////////////////////// | ||
| // DEFINE VARIABLES // | ||
| ////////////////////// | ||
|
|
||
| // Define exclusion patterns for files with unstable contents | ||
| // NOTE: When a section needs more than a couple of small patterns, consider adding a variable to store the patterns here | ||
| // This is particularly important if the patterns excluded in the stable content section should be included in the stable name section | ||
| def unstable_patterns_auth = [ | ||
| '**/mapped_reads_gc-content_distribution.txt', | ||
| '**/mapped_reads_nucleotide_content.txt', | ||
| '**/genome_gc_content_per_window.png', | ||
| '**/*.{svg,pdf,html,png,json}', | ||
| '**/DamageProfiler.log', | ||
| '**/{3,5}p_freq_misincorporations.txt', | ||
| '**/DNA_comp_genome.txt', | ||
| '**/DNA_composition_sample.txt', | ||
| '**/misincorporation.txt', | ||
| '**/genome_results.txt', | ||
| '**/3pGtoA_freq.txt', | ||
| '**/5pCtoT_freq.txt', | ||
| '**/lgdistribution.txt', | ||
| '**/*c_curve.txt', | ||
| "**/coverage_across_reference.txt", | ||
| "**/coverage_histogram.txt", | ||
| "**/duplication_rate_histogram.txt", | ||
| "**/genome_fraction_coverage.txt", | ||
| "**/mapping_quality_across_reference.txt", | ||
| "**/mapping_quality_histogram.txt", | ||
| ] | ||
merszym marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
|
||
| // Check that no files are missing/added | ||
| // Command legend: Result directory to index , includeDir: include dirs?, ignore: exclude patterns , ignoreFile: exclude pattern list , include: include patterns | ||
| def stable_name_all = getAllFilesFromDir("$outputDir/" , includeDir: false , ignore: ['pipeline_info/*'] , ignoreFile: null , include: ['*', '**/*'] ) | ||
|
|
||
| // Authentication | ||
| def stable_content_authentication = getAllFilesFromDir("$outputDir/authentication" , includeDir: false , ignore: unstable_patterns_auth , ignoreFile: null , include: ['*', '**/*'] ) | ||
| def stable_name_authentication = getAllFilesFromDir("$outputDir/authentication" , includeDir: false , ignore: null , ignoreFile: null , include: unstable_patterns_auth) | ||
|
|
||
| // Deduplication | ||
| // NOTE: even the flagstats are unstable, so we only check the names | ||
| def stable_name_deduplication = getAllFilesFromDir("$outputDir/deduplication" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*'] ) | ||
|
|
||
| // Final_bams | ||
| // NOTE: BAMs are unstable, since upstream BAMs are unstable. | ||
| def stable_name_final_bams = getAllFilesFromDir("$outputDir/final_bams" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*'] ) | ||
|
|
||
| // Mapping (incl. bam_input flasgstat) | ||
| def stable_content_mapping = getAllFilesFromDir("$outputDir/mapping" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.flagstat'] ) | ||
| def stable_name_mapping = getAllFilesFromDir("$outputDir/mapping" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{bam,bai}'] ) | ||
|
|
||
| // Preprocessing | ||
| // NOTE: FastQC html appears stable, but I worry it might just include a day timestamp instead of a full timestamp. To keep the expression simpler I removed both from checksum testing. | ||
| def stable_content_preprocessing = getAllFilesFromDir("$outputDir/preprocessing" , includeDir: false , ignore: ['**/*.{zip,log,html}'], ignoreFile: null , include: ['**/*'] ) | ||
| def stable_name_preprocessing = getAllFilesFromDir("$outputDir/preprocessing" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{zip,log,html}'] ) | ||
|
|
||
| // Genotyping | ||
| // NOTE: single file is created, with unstable content due to unstable BAM input. | ||
| def stable_name_genotyping = getAllFilesFromDir("$outputDir/genotyping" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.glf.gz'] ) | ||
|
|
||
| // Metagenomics | ||
| def unstable_patterns_meta = [ // contain the creation date | ||
| '**/*.{biom,log,megan,rma6}', | ||
| '**/*table.tsv', | ||
| '**/log.txt' | ||
| ] | ||
|
|
||
| def stable_content_metagenomics = getAllFilesFromDir("$outputDir/metagenomics" , includeDir: false , ignore: unstable_patterns_meta , ignoreFile: null , include: ['**/*'] ) | ||
| def stable_name_metagenomics = getAllFilesFromDir("$outputDir/metagenomics" , includeDir: false , ignore: null , ignoreFile: null , include: unstable_patterns_meta ) | ||
|
|
||
| // MultiQC | ||
| def stable_name_multiqc = getAllFilesFromDir("$outputDir/multiqc" , includeDir: false , ignore: null , ignoreFile: null , include: ['*', '**/*'] ) | ||
|
|
||
| /////////////////////// | ||
| // DEFINE ASSERTIONS // | ||
| /////////////////////// | ||
|
|
||
| assertAll( | ||
| { assert workflow.success }, | ||
| // This checks that there are no missing or additional output files. | ||
| // Also a good starting point to look at all the files in the output folder than need to be checked in subsequent sections. | ||
| { assert snapshot( stable_name_all*.name ).match("all_files") }, | ||
|
|
||
| // Checking changes to contents of each section | ||
| // NOTE: Keep the order of the sections in the alphanumeric order of the output directories. | ||
| // Each section should first check stable_content, stable_name second (if applicable). | ||
| { assert snapshot( stable_content_authentication , stable_name_authentication*.name ).match("authentication") }, | ||
| { assert snapshot( stable_name_deduplication*.name ).match("deduplication") }, | ||
| { assert snapshot( stable_name_final_bams*.name ).match("final_bams") }, | ||
| // NOTE: The snapshot section for mapping cannot be named 'mapping'. See https://github.com/askimed/nf-test/issues/279 | ||
| { assert snapshot( stable_content_mapping , stable_name_mapping*.name ).match("mapping_output") }, | ||
| { assert snapshot( stable_content_preprocessing , stable_name_preprocessing*.name ).match("preprocessing") }, | ||
| // { assert snapshot( stable_content_readfiltering , stable_name_readfiltering*.name ).match("read_filtering") }, | ||
| { assert snapshot( stable_name_genotyping*.name ).match("genotyping") }, | ||
| { assert snapshot( stable_content_metagenomics , stable_name_metagenomics*.name ).match("metagenomics") }, | ||
| { assert snapshot( stable_name_multiqc*.name ).match("multiqc") }, | ||
|
|
||
| // Versions | ||
| { assert new File("$outputDir/pipeline_info/nf_core_eager_software_mqc_versions.yml").exists() }, | ||
|
|
||
| ) | ||
| } | ||
| } | ||
| } | ||
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.