GraphChainer

GraphChainer is an accurate aligner of long reads to a variation graph, based on co-linear chaining.

Compiling

To compile, run these:

Running

Quickstart: ./bin/GraphChainer -t 4 -f reads.fastq -g graph.gfa -a out.gam

Key parameters:

Parameters related to colinear chaining:

Constructing an (acyclic) variation graph

Use vg and run:

vg construct -t 30 -a -r {ref} -v {vcf} -R 22 -p -m 3000000

Datasets availability

The graphs built for the experiments of GraphChainer can be found in Zenodo at https://doi.org/10.5281/zenodo.7729494 , https://doi.org/10.5281/zenodo.6875064 and at https://doi.org/10.5281/zenodo.6587252

The real read sets can be found in Zenodo ar TODO

The evaluation pipeline used in the paper can be found at https://github.com/algbio/GraphChainer-scripts

Citation

If you use GraphChainer, please cite as:

Jun Ma, Manuel Cáceres, Leena Salmela, Veli Mäkinen, Alexandru I. Tomescu. Chaining for accurate alignment of erroneous long reads to acyclic variation graphs. Bioinformatics, 2023, 39(8), btad460 https://doi.org/10.1093/bioinformatics/btad460.

Credits

GraphChainer is built on the excellent code base of GraphAligner, which is released under MIT License. GraphAligner is described in the paper GraphAligner: Rapid and Versatile Sequence-to-Graph Alignment by Mikko Rautiainen and Tobias Marschall.