Dear team,
The GenomicDB was failed (same command and dataset) with different errors across two file systems.
Summary is as follows:
1. BeeGFS file system:
$ cat Cq5A.chunk_1.GenomicsDBImport.11705869.err | tail -13
01:08:38.145 INFO GenomicsDBImport - Starting batch input file preload
01:09:38.916 INFO GenomicsDBImport - Finished batch preload
01:09:38.916 INFO GenomicsDBImport - Importing batch 1 with 564 samples
[TileDB::FileSystem] Error: (write_to_file) Cannot write to file; File writing error; path=/ibex/scratch/projects/c2071/1000quinoa/naga/1128_Phase2/RESULTS/gVCF/Cq5A.1/Cq5A$1$8993623/__06ef12cd-3e3e-445b-8a17-78d392d6e35347261215135488_1600294179899/ExcessHet.tdb; errno=121(Remote I/O error)
[TileDB::WriteState] Error: Cannot write segment to file.
02:44:19.847 erro NativeGenomicsDB - pid=208539 tid=208928 VariantStorageManagerException exception : Error while writing to TileDB array
TileDB error message : [TileDB::WriteState] Error: Cannot write segment to file
[TileDB::FileSystem] Error: (write_to_file) Cannot write to file; File writing error; path=/ibex/scratch/projects/c2071/1000quinoa/naga/1128_Phase2/RESULTS/gVCF/Cq5A.1/Cq5A$1$8993623/__06ef12cd-3e3e-445b-8a17-78d392d6e35347261215135488_1600294179899/ExcessHet.tdb; errno=121(Remote I/O error)
terminate called after throwing an instance of 'std::exception'
what(): std::exception
2. NFS file system
$ cat Cq5A.chunk_1.GenomicsDBImport.11739158.err | tail -13
11:49:12.994 INFO GenomicsDBImport - Starting batch input file preload
11:52:16.216 INFO GenomicsDBImport - Shutting down engine
[September 21, 2020 11:52:16 AM AST] org.broadinstitute.hellbender.tools.genomicsdb.GenomicsDBImport done. Elapsed time: 8.28 minutes.
Runtime.totalMemory()=19555942400
A USER ERROR has occurred: Couldn't read file. Error was: Failure while waiting for FeatureReader to initialize with exception: org.broadinstitute.hellbender.exceptions.UserException: Failed to create reader from file:///ibex/scratch/projects/c2071/1000quinoa/OUT/Sept2020/VCF/S7E1_batch2.snps.indels.g.vcf.gz
Set the system property GATK_STACKTRACE_ON_USER_EXCEPTION (--java-options '-DGATK_STACKTRACE_ON_USER_EXCEPTION=true') to print the stack trace.
Note:
- export TILEDB_DISABLE_FILE_LOCKING=1 environment variable was set before running GenomicDB
- The same NFS file system error was observed in BeeGFS file system during the execution of GenomicDB with different chromosome name/chunks. I used to overcome this issues
Background:
GATK version: 4.1.8
GenomicDB native library version: 1.3.0-e701905
Command used:
gatk GenomicsDBImport --variant $INPUT --genomicsdb-workspace-path $gVCF/$ChrName.$size --intervals $ChrName:$Start-$End --reader-threads $CORES"
Where,
$INPUT is list of Haplotypecaller gVCF files from 1128 samples.
$ChrName is Cq5A
$size is 1,2,3 ..8 (8 chunks)
$ChrName:$Start-$End is based on this below summary:
Cq5A split into 8 parts and here is the summary.
$ cat Chromosome_distribution.txt | grep Cq5A
Cq5A split into 8 parts
Chromosome name:Cq5A, Chunk number: 1, and Interval(Start:1-End:8993623)
Chromosome name:Cq5A, Chunk number: 2, and Interval(Start:8993624-End:17987246)
Chromosome name:Cq5A, Chunk number: 3, and Interval(Start:17987247-End:26980869)
Chromosome name:Cq5A, Chunk number: 4, and Interval(Start:26980870-End:35974492)
Chromosome name:Cq5A, Chunk number: 5, and Interval(Start:35974493-End:44968115)
Chromosome name:Cq5A, Chunk number: 6, and Interval(Start:44968116-End:53961738)
Chromosome name:Cq5A, Chunk number: 7, and Interval(Start:53961739-End:62955361)
Chromosome name:Cq5A, Chunk number: 8, and Interval(Start:62955362-End:64666259)
Observations:
- Chunks 1,2,3,7 and 8 are failed (Chunks 4,5 and 6 are successful) in BeeGFS
- Chunks 1,2,3,4 and 7 are failed (chunks 5,6 and 8 are successful) in NFS.
- For example:
At BeeGFS:
$ cat Cq5A.chunk_5.GenomicsDBImport.11705873.out
Tool returned:
true
At NFS
$ cat Cq5A.chunk_5.GenomicsDBImport.11739162.out
Tool returned:
true
System environment:
OS: CentOS Linux release 7.7.1908 (Core)
Java version: 1.8.0_242
openjdk version "1.8.0_242"
OpenJDK Runtime Environment (build 1.8.0_242-b08)
OpenJDK 64-Bit Server VM (build 25.242-b08, mixed mode)
Thanks and Regards,
Naga