Skip to content

vg haplotypes crashed: top-level chain 7964 is a loop; haplotype sampling cannot be used with this graph #4820

@GooLey1025

Description

@GooLey1025

error log from vg haplotype

$ vg haplotypes -v 2 -t 24 -H targets.hapl targets.gbz 
warning[vg haplotypes] Thread count (-t) is greater than the maximum number of threads available (16), capping to that value
Loading GBZ from targets.gbz
[vg haplotypes] Generating haplotype information
[vg haplotypes] Guessing that distance index is targets.dist
[vg haplotypes] Loading distance index from targets.dist
[vg haplotypes] Building minimizer index
Building MinimizerIndex with k = 29, w = 11, payload = none
1846289 keys (1782180 unique)
Minimizer occurrences: 1973556
Load factor: 0.44019
Construction time: 0.656898 seconds
[vg haplotypes] Built the minimizer index in 0.65691 seconds
[vg haplotypes] Guessing that r-index is targets.ri
Loading r-index from targets.ri
HaplotypePartitioner: 362227 fragments for 2978 haplotype sequences
Partitioning parameters:
- target length 10000 bp
- 32 jobs
Determining construction jobs
error[vg haplotypes] HaplotypePartitioner::partition_haplotypes(): top-level chain 7964 is a loop; haplotype sampling cannot be used with this graph
.This can sometimes be resolved by using the vg index -P option to specify a reference backbone when computing the distance index.

My gbz generated procedure:

vg find -x mc-cactus.xg -R target_regions.paths.bed -c 2000 -L \
| vg combine - > targets.vg

vg view -g targets.vg > targets.gfa

vg gbwt \
  -G targets.gfa \
  --max-node 1024 \
  --num-jobs 16 \
  -p \
  -g targets.gbz

vg index -t 24 -j targets.dist --no-nested-distance targets.gbz -P  FaHC_P8_Nipponbare_TEJ
vg gbwt -p -r targets.ri -Z targets.gbz
vg haplotypes -v 2 -t 24 -H targets.hapl targets.gbz 

What I am doing here is to extract the subgraph of MC-whole-genome-graph, then I can do giraffe mapping on the specific regions to save computation time.

I am not sure whehter my -P FaHC_P8_Nipponbare_TEJ provided is right?
I have generated the relevant backbone-reference paths for debugging:

$ vg paths -L -x targets.gbz | grep "FaHC_P8_Nip" | head -n 10
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[30749]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[163334]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[174338]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[178544]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[183832]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[214569]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[244971]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[299632]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[299783]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr1[300236] 
$ vg paths -L -x targets.gbz | grep "FaHC_P8_Nip" | tail -n 10
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22744737]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22765169]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22814343]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22828158]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22834274]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22855416]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22862931]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22888249]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22901706]
FaHC_P8_Nipponbare_TEJ#0#P8.Nipponbare.TEJ.Chr9[22936364]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions