Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in checkGrep(grep(".A.txt", files)) when running maegatk #5

Open
uqjlu8 opened this issue Mar 14, 2022 · 39 comments
Open

Error in checkGrep(grep(".A.txt", files)) when running maegatk #5

uqjlu8 opened this issue Mar 14, 2022 · 39 comments

Comments

@uqjlu8
Copy link

uqjlu8 commented Mar 14, 2022

Hi
I am trying to run maegatk on my dataset. I have installed all the modules required as stated in the tutorial.
java, bwa, bedtools, freebayes, R (4.1.2, with data.table, Matrix, GenomicRanges, SummarizedExperiment). I am running it on python 3.7

I have tried to run the program on both the test dataset, and my own dataset using the commands below:

maegatk bcall --input $bam -o $resul_out -c $ncores -b $barcodes -mr $minReads -z

I keep getting the same error in both instances:

Mon Mar 14 15:46:24 AEST 2022: maegatk v0.1.1
Mon Mar 14 15:46:24 AEST 2022: Found bam file: Data/test_maester.bam for genotyping.
Mon Mar 14 15:46:24 AEST 2022: Will determine barcodes with at least: 100 mitochondrial reads.
Mon Mar 14 15:46:24 AEST 2022: User specified mitochondrial genome matches .bam file
Mon Mar 14 15:46:30 AEST 2022: Finished determining/splitting barcodes for genotyping.
Mon Mar 14 15:46:31 AEST 2022: Genotyping samples with 24 threads
Error in checkGrep(grep(".A.txt", files)) :
Improper folder specification; file missing / extra file present. See documentation
Calls: importMito -> checkGrep
Execution halted

I have attached the a list of all the files generated using (ls -lRh $result_folder), scatter.log, gather.log

test_result_file_list.txt
maegatk.snakemake_scatter.log.txt
maegatk.snakemake_gather.log.txt

Any help would be greatly appreciated.

Thanks

@WenxuLiu
Copy link

i got the same problem..
expecting resolutions.

@andrecossa5
Copy link

Hi,

I am experiencing the same problem on test data.
Thank you for handling this :)

@uqjlu8
Copy link
Author

uqjlu8 commented Apr 2, 2022

Also, the definition of -mr (minimum reads) was used incorrectly. The function used to call -C in freebayes is actually minimum alternative read, which is not the same thing as minimum read.

@caleblareau
Copy link
Owner

Also, the definition of -mr (minimum reads) was used incorrectly. The function used to call -C in freebayes is actually minimum alternative read, which is not the same thing as minimum read.

Here we use -mr only in the context of the fgbio consensus demultiplexing (

fgcalltwo = fgbio + " CallMolecularConsensusReads -t "+umi_barcode+" -i "+temp_bam1+" -o " + temp_bam2 +" -M " + min_reads
)

@caleblareau
Copy link
Owner

Hi @uqjlu8, your error stems from an improperly specified bam file header:

Error in the scatter log (line 83):

ERROR::READ_GROUP_NOT_FOUND:Record 1, Read name NS500239:389:H3F5JBGXC:1:12210:7650:20169_GTCACAATCGGATGGA-2+CTGTCACATT, RG ID on SAMRecord not found in header: Gen22_LG_D22:0:1:H3F5JBGXC:1

This biostars post discusses it a bit more: https://www.biostars.org/p/50338/

In short, this is an error upstream of the maegatk took and something that would require you to check with how you processed your single-cell sequencing data. May I ask what pipeline you used to go from fastq to bam files?

@uqjlu8
Copy link
Author

uqjlu8 commented Apr 9, 2022 via email

@caleblareau
Copy link
Owner

just to clarify-- did the test data work OK for you?

@uqjlu8
Copy link
Author

uqjlu8 commented Apr 23, 2022 via email

@ttab963
Copy link

ttab963 commented Jul 7, 2022

Hello @caleblareau,
I also face same problem.

Test data or my own data cannot proceed at making ~/final/SAMPLE_NAME.A.txt.gz part.
Files are intact and I am quite sure this(https://www.biostars.org/p/50338/) is not the point.

Hope you provide any response.
Thanks.

@caleblareau
Copy link
Owner

caleblareau commented Oct 11, 2022 via email

@showteeth
Copy link

I got the same problem with the test data.

I noticed that the version is 0.1.2 in setup.py, and the PyPi version is 0.1.1. Can the difference between the two versions solve this problem?

YB

@caleblareau
Copy link
Owner

version 0.1.2 was for indel calling, so I don't think it'll impact anything for you. You can try though-- just clone the repo then install it from the local branch:

``
python3 -m pip install -e maegatk

@si3
Copy link

si3 commented Dec 20, 2023

Hello

I am running into a similar issue. It seems like this might be related to this:
caleblareau/mgatk#22

I ran maegatk with --snake-sdout flag and the error stems from the following:

AttributeError in line 54 of ~/venv3/lib/python3.6/site-packages/maegatk/bin/snake/Snakefile.maegatk.Gather
'InputFiles' object has no attribute 'As'

Any idea on how to work around this?

Stef

@noranekonobokkusu
Copy link
Collaborator

Hi @si3 the README file is now slightly expanded and explains what intermediate output files you should expect. If there is no As in Gather, then it means that line 35 here fails and the output of Scatter is incomplete, so I suggest looking into the Scatter log. Scatter should have generated a set of files in temp_bam,ready_bam,sparse_matrices folders. If you are keeping intermediate files, you can check which files are missing, it should also be reflected in Scatter's log.
These are the files that should be present for each CB:

-> l temp_bam/ | head
total 215944
-rw-r--r-- 1 safina vangalenlab  1387286 Jan  6 15:57 CCCGTCGTGGTA-1.temp0.bam
-rw-r--r-- 1 safina vangalenlab  1365836 Jan  6 15:57 CCCGTCGTGGTA-1.temp1.bam
-rw-r--r-- 1 safina vangalenlab 13993678 Jan  6 15:57 CCCGTCGTGGTA-1.temp1.5.sam
-rw-r--r-- 1 safina vangalenlab  1320355 Jan  6 15:57 CCCGTCGTGGTA-1.temp1.5.bam
-rw-r--r-- 1 safina vangalenlab   489313 Jan  6 15:57 CCCGTCGTGGTA-1.temp2.bam
-rw-r--r-- 1 safina vangalenlab  1925330 Jan  6 15:57 CCCGTCGTGGTA-1.temp0.fastq
-rw-r--r-- 1 safina vangalenlab  1123027 Jan  6 15:57 GACCGTGCATTT-1.temp0.bam
-rw-r--r-- 1 safina vangalenlab  1122782 Jan  6 15:57 GACCGTGCATTT-1.temp1.bam
-rw-r--r-- 1 safina vangalenlab 11246101 Jan  6 15:57 GACCGTGCATTT-1.temp1.5.sam
(base) 

-> l ready_bam/ | head
total 6728
-rw-r--r-- 1 safina vangalenlab 216594 Jan  6 15:57 CCCGTCGTGGTA-1.qc.bam
-rw-r--r-- 1 safina vangalenlab     96 Jan  6 15:57 CCCGTCGTGGTA-1.qc.bam.bai
-rw-r--r-- 1 safina vangalenlab 219598 Jan  6 15:57 GACCGTGCATTT-1.qc.bam
-rw-r--r-- 1 safina vangalenlab     96 Jan  6 15:57 GACCGTGCATTT-1.qc.bam.bai
-rw-r--r-- 1 safina vangalenlab 189703 Jan  6 15:58 CCACAAAACATG-1.qc.bam
-rw-r--r-- 1 safina vangalenlab     96 Jan  6 15:58 CCACAAAACATG-1.qc.bam.bai
-rw-r--r-- 1 safina vangalenlab 261193 Jan  6 15:58 GGCGCTAATGAA-1.qc.bam
-rw-r--r-- 1 safina vangalenlab     96 Jan  6 15:58 GGCGCTAATGAA-1.qc.bam.bai
-rw-r--r-- 1 safina vangalenlab 219108 Jan  6 15:59 TTCCTACGCAAT-1.qc.bam
(base) 

-> l sparse_matrices/ | head
total 27904
-rw-r--r-- 1 safina vangalenlab 159767 Jan  6 15:57 CCCGTCGTGGTA-1.A.txt
-rw-r--r-- 1 safina vangalenlab 159806 Jan  6 15:57 CCCGTCGTGGTA-1.C.txt
-rw-r--r-- 1 safina vangalenlab  77507 Jan  6 15:57 CCCGTCGTGGTA-1.G.txt
-rw-r--r-- 1 safina vangalenlab 125697 Jan  6 15:57 CCCGTCGTGGTA-1.T.txt
-rw-r--r-- 1 safina vangalenlab 308142 Jan  6 15:57 CCCGTCGTGGTA-1.coverage.txt
-rw-r--r-- 1 safina vangalenlab 162181 Jan  6 15:57 GACCGTGCATTT-1.A.txt
-rw-r--r-- 1 safina vangalenlab 160649 Jan  6 15:57 GACCGTGCATTT-1.C.txt
-rw-r--r-- 1 safina vangalenlab  77111 Jan  6 15:57 GACCGTGCATTT-1.G.txt
-rw-r--r-- 1 safina vangalenlab 129431 Jan  6 15:57 GACCGTGCATTT-1.T.txt

Do you have all of these files?

@NBurnaevskiy
Copy link

Same error here. Did you have a chance to find a fix?
Thank you.

@noranekonobokkusu
Copy link
Collaborator

Hi @NBurnaevskiy,

were you able to identify which part of the pipeline failed? Try going through the content of intermediate files as described in https://github.com/caleblareau/maegatk?tab=readme-ov-file#output-files

@NBurnaevskiy
Copy link

@noranekonobokkusu ,
Thank you for your reply.
I looked into the temp directory. There is expected number of bam files in the 'barcoded_bams' folder. However, 'temp_bam' and 'ready_bam' are empty. 'sparse_matrices' is also empty.
Would you have ideas how to proceed?

@noranekonobokkusu
Copy link
Collaborator

Yes, try looking into the snakemake-scatter text log file in the logs/ folder, it should have some error messages.

Also note that a recently added option --skip-barcodesplit allows you to skip the barcode splitting step if it already finished successfully. It shouldn't matter if you are running maegatk on a small test dataset.

@NBurnaevskiy
Copy link

I actually don't have that file.
Here is the content of the 'logs'
filterlogs
rmdupslogs
base.maegatk.log
maegatk.parameters.txt
maegatk.snakemake_gather.log

@noranekonobokkusu
Copy link
Collaborator

@NBurnaevskiy this is new. Are you running it on a test dataset? Are there any errors (or any messages) in the output of this run? (like sh.o or sh.e files, or just stdout/err?)

I also noticed that yesterday you opened and then closed a yaml-related issue. Might it be that you introduced a fix that causes snakemake-scatter not to be executed?

@noranekonobokkusu
Copy link
Collaborator

If you are running a test file (which is recommended for debugging), can you delete the entire output directory and re-run it from scratch? And then see what is in logs/base.maegatk.log and whether you have /.internal/parseltongue/snake.scatter.yaml which are instructions for scatter

@NBurnaevskiy
Copy link

Yes, we had to introduce a fix into yaml command. We followed the instruction from error message and yaml documentation.
We had error message:

"AttributeError:
"dump()" has been removed, use

yaml = YAML(typ='unsafe', pure=True)
yaml.dump(...)

instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/cli.py", line 300

yaml.dump(dict1, yaml_file, default_flow_style=False, Dumper=yaml.RoundTripDumper)"

In the cli.py file we changed line 300 from
yaml.dump(dict1, yaml_file, default_flow_style=False, Dumper=yaml.RoundTripDumper)

to
yaml=YAML()
yaml.default_flow_style = False
yaml.dump(dict1, yaml_file)

That allow the script not to crash as that point.
We will now to change to what error message recommended:
yaml = YAML(typ='unsafe', pure=True)
yaml.dump(dict1, yaml_file, default_flow_style=False, Dumper=yaml.RoundTripDumper)

Do you think that could cause current issue with temp files?

@noranekonobokkusu
Copy link
Collaborator

I suspect recent modifications @caleblareau introduced on yaml are not present in the current python package on PyPi.
Can you try replacing your entire cli.py with the code from https://github.com/caleblareau/maegatk/blob/master/maegatk/cli.py, and then try suggestions from my previous comment and report results?

@NBurnaevskiy
Copy link

I just ran a script on the test file (with our current yaml).
Exactly the same result.

@NBurnaevskiy
Copy link

I am waiting for our admin to change yaml file.

@noranekonobokkusu
Copy link
Collaborator

You can try installing maegatk locally which would allow you to modify the code yourself. Old yaml commands got outdated with the current version of yaml. I would try updating the maegatk code rather than downgrading yaml version.

@NBurnaevskiy
Copy link

Just got response from our admin. They did exactly what you suggested, replaced old yaml file with updated one.

Test run failed with identical symptoms.

Yes, we do have have file snake.scatter.yaml in ./internal/parseltongue folder

@NBurnaevskiy
Copy link

Error log from maegatk.snakemake_gather.log

Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.gather.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Job stats:
job count


all 1
make_depth_table 1
make_final_sparse_matrices 1
total 3

Select jobs to execute...

[Thu Mar 21 10:13:43 2024]
rule make_final_sparse_matrices:
output: /home/nburnaevskiy/maegatk_test/output/final/maegatk.A.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.C.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.G.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.T.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.coverage.txt.gz
jobid: 2
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/final/maegatk.A.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.C.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.coverage.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.G.txt.gz, /home/nburnaevskiy/maegatk_test/output/final/maegatk.T.txt.gz
resources: tmpdir=/tmp

[Thu Mar 21 10:13:43 2024]
rule make_depth_table:
output: /home/nburnaevskiy/maegatk_test/output/final/maegatk.depthTable.txt
jobid: 1
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/final/maegatk.depthTable.txt
resources: tmpdir=/tmp

Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.gather.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.gather.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
gzip: /home/nburnaevskiy/maegatk_test/output/final/maegatk.A.txt: No such file or directory
gzip: /home/nburnaevskiy/maegatk_test/output/final/maegatk.C.txt: No such file or directory
gzip: /home/nburnaevskiy/maegatk_test/output/final/maegatk.G.txt: No such file or directory
gzip: /home/nburnaevskiy/maegatk_test/output/final/maegatk.T.txt: No such file or directory
gzip: /home/nburnaevskiy/maegatk_test/output/final/maegatk.coverage.txt: No such file or directory
Waiting at most 5 seconds for missing files.
[Thu Mar 21 10:13:44 2024]
Finished job 1.
1 of 3 steps (33%) done
MissingOutputException in rule make_final_sparse_matrices in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Gather, line 33:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/final/maegatk.A.txt.gz
/home/nburnaevskiy/maegatk_test/output/final/maegatk.C.txt.gz
/home/nburnaevskiy/maegatk_test/output/final/maegatk.G.txt.gz
/home/nburnaevskiy/maegatk_test/output/final/maegatk.T.txt.gz
/home/nburnaevskiy/maegatk_test/output/final/maegatk.coverage.txt.gz
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Complete log: .snakemake/log/2024-03-21T101342.762479.snakemake.log

@NBurnaevskiy
Copy link

Now the tool produced slightly more results but crashed anyway.
We finally the file maegatk.snakemake_scatter.log.
Looks like yaml gives a problem in another file.

Its content
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Job stats:
job count


all 1
make_sample_list 1
process_one_sample 10
total 12

Select jobs to execute...

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/GGGTTCGCCTCC-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGGTTCGCCTCC-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGGTTCGCCTCC-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/GGGTTCGCCTCC-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.coverage.txt
jobid: 8
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/GGGTTCGCCTCC-1.depth.txt
wildcards: sample=GGGTTCGCCTCC-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/TGGTAGTGAACC-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TGGTAGTGAACC-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TGGTAGTGAACC-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/TGGTAGTGAACC-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.coverage.txt
jobid: 10
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/TGGTAGTGAACC-1.depth.txt
wildcards: sample=TGGTAGTGAACC-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/ACAATTAGCACT-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/ACAATTAGCACT-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/ACAATTAGCACT-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/ACAATTAGCACT-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.coverage.txt
jobid: 2
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/ACAATTAGCACT-1.depth.txt
wildcards: sample=ACAATTAGCACT-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/CCACAAAACATG-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCACAAAACATG-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCACAAAACATG-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/CCACAAAACATG-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.coverage.txt
jobid: 3
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/CCACAAAACATG-1.depth.txt
wildcards: sample=CCACAAAACATG-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/CTAACCCGGAAT-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CTAACCCGGAAT-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CTAACCCGGAAT-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/CTAACCCGGAAT-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.coverage.txt
jobid: 5
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/CTAACCCGGAAT-1.depth.txt
wildcards: sample=CTAACCCGGAAT-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/GGCGCTAATGAA-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGCGCTAATGAA-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGCGCTAATGAA-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/GGCGCTAATGAA-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.coverage.txt
jobid: 7
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/GGCGCTAATGAA-1.depth.txt
wildcards: sample=GGCGCTAATGAA-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/GTACCCACAGCC-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GTACCCACAGCC-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GTACCCACAGCC-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/GTACCCACAGCC-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.coverage.txt
jobid: 9
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/GTACCCACAGCC-1.depth.txt
wildcards: sample=GTACCCACAGCC-1
resources: tmpdir=/tmp

[Thu Mar 21 12:11:51 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/TTCCTACGCAAT-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TTCCTACGCAAT-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TTCCTACGCAAT-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/TTCCTACGCAAT-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.coverage.txt
jobid: 11
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/TTCCTACGCAAT-1.depth.txt
wildcards: sample=TTCCTACGCAAT-1
resources: tmpdir=/tmp

Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
[Thu Mar 21 12:11:52 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/CCCGTCGTGGTA-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCCGTCGTGGTA-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCCGTCGTGGTA-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/CCCGTCGTGGTA-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.coverage.txt
jobid: 4
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/CCCGTCGTGGTA-1.depth.txt
wildcards: sample=CCCGTCGTGGTA-1
resources: tmpdir=/tmp

Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
[Thu Mar 21 12:11:52 2024]
rule process_one_sample:
input: /home/nburnaevskiy/maegatk_test/output/.internal/samples/GACCGTGCATTT-1.bam.txt
output: /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GACCGTGCATTT-1.qc.bam, /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GACCGTGCATTT-1.qc.bam.bai, /home/nburnaevskiy/maegatk_test/output/qc/depth/GACCGTGCATTT-1.depth.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.A.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.C.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.G.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.T.txt, /home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.coverage.txt
jobid: 6
reason: Missing output files: /home/nburnaevskiy/maegatk_test/output/qc/depth/GACCGTGCATTT-1.depth.txt
wildcards: sample=GACCGTGCATTT-1
resources: tmpdir=/tmp

Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Waiting at most 5 seconds for missing files.
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Config file /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml is extended by additional config specified via the command line.
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 10
Rules claiming more threads will be scaled down.
Select jobs to execute...
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
Traceback (most recent call last):
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17, in
config = yaml.load(stream, Loader=yaml.Loader)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1085, in load
error_deprecation('load', 'load', arg=_error_dep_arg, comment=_error_dep_comment)
File "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/ruamel/yaml/main.py", line 1039, in error_deprecation
raise AttributeError(s, name=None)
AttributeError:
"load()" has been removed, use

yaml = YAML(typ='rt')
yaml.load(...)

and register any classes that you use, or check the tag attribute on the loaded data,
instead of file "/net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 17

config = yaml.load(stream, Loader=yaml.Loader)

Waiting at most 5 seconds for missing files.
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGGTTCGCCTCC-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGGTTCGCCTCC-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/GGGTTCGCCTCC-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGGTTCGCCTCC-1.coverage.txt
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TGGTAGTGAACC-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TGGTAGTGAACC-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/TGGTAGTGAACC-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TGGTAGTGAACC-1.coverage.txt
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/ACAATTAGCACT-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/ACAATTAGCACT-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/ACAATTAGCACT-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/ACAATTAGCACT-1.coverage.txt
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCACAAAACATG-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCACAAAACATG-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/CCACAAAACATG-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCACAAAACATG-1.coverage.txt
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/GGGTTCGCCTCC-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGGTTCGCCTCC-1.qc.bam GGGTTCGCCTCC-1
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/TGGTAGTGAACC-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TGGTAGTGAACC-1.qc.bam TGGTAGTGAACC-1
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/ACAATTAGCACT-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/ACAATTAGCACT-1.qc.bam ACAATTAGCACT-1
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGCGCTAATGAA-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGCGCTAATGAA-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/GGCGCTAATGAA-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GGCGCTAATGAA-1.coverage.txt
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GTACCCACAGCC-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GTACCCACAGCC-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/GTACCCACAGCC-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GTACCCACAGCC-1.coverage.txt
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/CCACAAAACATG-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCACAAAACATG-1.qc.bam CCACAAAACATG-1
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CTAACCCGGAAT-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CTAACCCGGAAT-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/CTAACCCGGAAT-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CTAACCCGGAAT-1.coverage.txt
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/GGCGCTAATGAA-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GGCGCTAATGAA-1.qc.bam GGCGCTAATGAA-1
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TTCCTACGCAAT-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TTCCTACGCAAT-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/TTCCTACGCAAT-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/TTCCTACGCAAT-1.coverage.txt
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/GTACCCACAGCC-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GTACCCACAGCC-1.qc.bam GTACCCACAGCC-1
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCCGTCGTGGTA-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCCGTCGTGGTA-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/CCCGTCGTGGTA-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/CCCGTCGTGGTA-1.coverage.txt
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/CTAACCCGGAAT-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CTAACCCGGAAT-1.qc.bam CTAACCCGGAAT-1
MissingOutputException in rule process_one_sample in file /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/snake/Snakefile.maegatk.Scatter, line 21:
Job 0 completed successfully, but some output files are missing. Missing files after 5 seconds. This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait:
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GACCGTGCATTT-1.qc.bam
/home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GACCGTGCATTT-1.qc.bam.bai
/home/nburnaevskiy/maegatk_test/output/qc/depth/GACCGTGCATTT-1.depth.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.A.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.C.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.G.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.T.txt
/home/nburnaevskiy/maegatk_test/output/temp/sparse_matrices/GACCGTGCATTT-1.coverage.txt
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/TTCCTACGCAAT-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/TTCCTACGCAAT-1.qc.bam TTCCTACGCAAT-1
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/GACCGTGCATTT-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/GACCGTGCATTT-1.qc.bam GACCGTGCATTT-1
python /net/module/sw/maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py /home/nburnaevskiy/maegatk_test/output/.internal/parseltongue/snake.scatter.yaml /home/nburnaevskiy/maegatk_test/output/temp/barcoded_bams/CCCGTCGTGGTA-1.bam /home/nburnaevskiy/maegatk_test/output/temp/ready_bam/CCCGTCGTGGTA-1.qc.bam CCCGTCGTGGTA-1
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Complete log: .snakemake/log/2024-03-21T121150.050858.snakemake.log

@noranekonobokkusu
Copy link
Collaborator

Did they replace the yaml command or the python file in the installation folder (like, in ~/.local/lib/python3.9/site-packages/maegatk/)? snakemake_gather failed because it didn't have required input files. Are you sure you are not getting any other error messages at any stage of running it, and that you replaced cli.py entirely and didn't change anything else? and that you didn't have any existing (even empty) output directories like temp_bam and ready_bam before rerunning the command? I don't see a way to reproduce the absence of snakemake_scatter output.

@noranekonobokkusu
Copy link
Collaborator

@NBurnaevskiy just saw your new comment, let me take a look.

@NBurnaevskiy
Copy link

Ksenia, thank you for all your responses.

@noranekonobokkusu
Copy link
Collaborator

I see, so yaml syntax in oneSample_maegatk.py got outdated, too. I will recreate in on my computer (I currently have an older yaml so everything seems to work) and try fixing it.

@NBurnaevskiy
Copy link

we are iteratively working through issues. yaml update creates problems in few places.
first we updated file oneSample_maegatk.py, to follow new yaml syntax.
we updated line 16 and 17 to make this
with open(configFile, 'r') as stream:
yaml = YAML(typ='rt')
config = yaml.load(stream, Loader=yaml.Loader)

Then we updated header to change this
from ruamel import yaml
into this
from ruamel.yaml import YAML

Now we got an error:
File ".../maegatk/0.2.0/lib/python3.10/site-packages/maegatk/bin/python/oneSample_maegatk.py", line 18, in
config = yaml.load(stream, Loader=yaml.Loader)
AttributeError: 'YAML' object has no attribute 'Loader'. Did you mean: 'Reader'?

@noranekonobokkusu
Copy link
Collaborator

I am also trying the same - they dropped the load function https://yaml.readthedocs.io/en/latest/, but I am still figuring out what is the correct way to rewrite that

@NBurnaevskiy
Copy link

maybe it should be
config = yaml.load(stream, Loader=Loader)
?

@noranekonobokkusu
Copy link
Collaborator

It seems dropping the argument altogether allows the pipeline to proceed and start generating files in temp_bam/:
yaml = YAML(typ='rt')
config = yaml.load(stream)

@noranekonobokkusu
Copy link
Collaborator

Ok, I can confirm that config = yaml.load(stream) allows the pipeline to produce the final .rds file. @caleblareau can you confirm these changes are enough?

from ruamel.yaml import YAML
...
yaml = YAML(typ='rt')
config = yaml.load(stream)

@NBurnaevskiy
Copy link

I can confirm that the script produced final output files.
There were some complains in the log, like this
samtools: /net/module/sw/maegatk/0.2.0/bin/../lib/libtinfow.so.6: no version information available (required by samtools)

but hopefully they are not critical.

I will now try real data and let you know if it completes.
Thank you very much Ksenia. I hope that this tools will useful for us.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

9 participants