Levinson, G. & Gutman, G. A. Slipped-strand mispairing: a significant mechanism for DNA sequence evolution. Mol. Biol. Evol. 4, 203–221 (1987).
Fan, H. & Chu, J.-Y. A short overview of brief tandem repeat mutation. Genom. Proteom. Bioinform. 5, 7–14 (2007).
Shriver, M. D., Jin, L., Chakraborty, R. & Boerwinkle, E. VNTR allele frequency distributions below the stepwise mutation mannequin: a pc simulation method. Genetics 134, 983–993 (1993).
Wright, J. M. Mutation at VNTRs: are minisatellites the evolutionary progeny of microsatellites? Genome 37, 345–347 (1994).
Willems, T. et al. The panorama of human STR variation. Genome Res. 24, 1894–1904 (2014).
Ren, J., Gu, B. & Chaisson, M. J. P. vamos: variable-number tandem repeats annotation utilizing environment friendly motif units. Genome Biol. 24, 175 (2023).
Noyes, M. D. et al. Familial long-read sequencing will increase yield of de novo mutations. Am. J. Hum. Genet. 109, 631–646 (2022).
DeJesus-Hernandez, M. et al. Expanded GGGGCC hexanucleotide repeat in noncoding area of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron 72, 245–256 (2011).
Depienne, C. & Mandel, J.-L. 30 years of repeat growth problems: what have we discovered and what are the remaining challenges? Am. J. Hum. Genet. 108, 764–785 (2021).
Mirceta, M., Shum, N., Schmidt, M. H. M. & Pearson, C. E. Fragile websites, chromosomal lesions, tandem repeats, and illness. Entrance. Genet. 13, 985975 (2022).
Hannan, A. J. Repeat DNA expands our understanding of autism spectrum dysfunction. Nature 589, 200–202 (2021).
Hannan, A. J. Tandem repeats mediating genetic plasticity in well being and illness. Nat. Rev. Genet. 19, 286–298 (2018).
Stanley, U. et al. Forensic DNA profiling: autosomal brief tandem repeat as a outstanding marker in crime investigation. Malays. J. Med. Sci. 27, 22–35 (2020).
Corridor, C. L. et al. Correct profiling of forensic autosomal STRs utilizing the Oxford Nanopore Applied sciences MinION machine. Forensic Sci. Int. Genet. 56, 102629 (2022).
Warner, J. P. et al. A basic methodology for the detection of huge CAG repeat expansions by fluorescent PCR. J. Med. Genet. 33, 1022–1026 (1996).
Jeffreys, A. J., Wilson, V. & Thein, S. L. Hypervariable ‘minisatellite’ areas in human DNA. Nature 314, 67–73 (1985).
Dolzhenko, E. et al. ExpansionHunter: a sequence-graph primarily based device to research variation in brief tandem repeat areas. Bioinformatics 35, 4754–4756 (2019).
Willems, T. et al. Genome-wide profiling of heritable and de novo STR variations. Nat. Strategies 14, 590–592 (2017).
Mousavi, N., Shleizer-Burko, S., Yanicky, R. & Gymrek, M. Profiling the genome-wide panorama of tandem repeat expansions. Nucleic Acids Res. 47, e90 (2019).
Dolzhenko, E. et al. Characterization and visualization of tandem repeats at genome scale. Nat. Biotechnol. https://doi.org/10.1038/s41587-023-02057-3 (2024).
Chiu, R., Rajan-Babu, I.-S., Friedman, J. M. & Birol, I. Straglr: discovering and genotyping tandem repeat expansions utilizing complete genome long-read sequences. Genome Biol. 22, 224 (2021).
Nurk, S. et al. The entire sequence of a human genome. Science 376, 44–53 (2022).
Aganezov, S. et al. A whole reference genome improves evaluation of human genetic variation. Science 376, eabl3533 (2022).
Rhie, A. et al. The entire sequence of a human Y chromosome. Nature 621, 344–354 (2023).
Olson, N. D. et al. Variant calling and benchmarking in an period of full human genome sequences. Nat. Rev. Genet. 24, 464–483 (2023).
Majidian, S., Agustinho, D. P., Chin, C.-S., Sedlazeck, F. J. & Mahmoud, M. Genomic variant benchmark: if you happen to can not measure it, you can not enhance it. Genome Biol. 24, 221 (2023).
Wagner, J. et al. Benchmarking difficult small variants with linked and lengthy reads. Cell Genom. 2, 100128 (2022).
Zook, J. M. et al. A strong benchmark for detection of germline giant deletions and insertions. Nat. Biotechnol. 38, 1347–1355 (2020).
Wagner, J. et al. Curated variation benchmarks for difficult medically related autosomal genes. Nat. Biotechnol. 40, 672–680 (2022).
English, A. C., Menon, V. Ok., Gibbs, R. A., Metcalf, G. A. & Sedlazeck, F. J. Truvari: refined structural variant comparability preserves allelic range. Genome Biol. 23, 271 (2022).
Yang, J. & Chaisson, M. J. P. TT-Mars: structural variants evaluation primarily based on haplotype-resolved assemblies. Genome Biol. 23, 110 (2022).
Audano, P. A. & Beck, C. R. Small polymorphisms are a supply of ancestral bias in structural variant breakpoint placement. Genome Res. 34, 7–19 (2024).
Fu, Y., Mahmoud, M., Muraliraman, V. V., Sedlazeck, F. J. & Treangen, T. J. Vulcan: improved long-read mapping and structural variant calling by way of dual-mode alignment. GigaScience 10, giab063 (2021).
Gelfand, Y., Rodriguez, A. & Benson, G. TRDB—the Tandem Repeats Database. Nucleic Acids Res. 35, D80–D87 (2007).
Halman, A., Dolzhenko, E. & Oshlack, A. STRipy: a graphical software for enhanced genotyping of pathogenic brief tandem repeats in sequencing knowledge. Hum. Mutat. 43, 859–868 (2022).
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
Saini, S., Mitra, I., Mousavi, N., Fotsing, S. F. & Gymrek, M. A reference haplotype panel for genome-wide imputation of brief tandem repeats. Nat. Commun. 9, 4397 (2018).
Benson, G. Tandem Repeats Finder: a program to research DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
Smit, A., Hubley, R. & Inexperienced, P. RepeatMasker. http://www.repeatmasker.org (2013).
Wlodzimierz, P., Hong, M. & Henderson, I. R. TRASH: tandem repeat annotation and structural hierarchy. Bioinformatics 39, btad308 (2023).
Novák, P., Neumann, P. & Macas, J. International evaluation of repetitive DNA from unassembled sequence reads utilizing RepeatExplorer2. Nat. Protoc. 15, 3745–3776 (2020).
Delucchi, M., Näf, P., Bliven, S. & Anisimova, M. TRAL 2.0: tandem repeat detection with round profile hidden Markov fashions and evolutionary aligner. Entrance. Bioinform. 1, 691865 (2021).
El-Sawy, M. & Deininger, P. Tandem insertions of Alu components. Cytogenet. Genome Res. 108, 58–62 (2004).
Moretti, T. R. et al. Inhabitants knowledge on the expanded CODIS core STR loci for eleven populations of significance for forensic DNA analyses in the US. Forensic Sci. Int. Genet. 25, 175–181 (2016).
Collins, R. L. et al. A structural variation reference for medical and inhabitants genetics. Nature 581, 444–451 (2020).
Stevanovski, I. et al. Complete genetic analysis of tandem repeat growth problems with programmable focused nanopore sequencing. Sci. Adv. 8, eabm5386 (2022).
Pellerin, D. et al. Deep intronic FGF14 GAA repeat growth in late-onset cerebellar ataxia. N. Engl. J. Med. 388, 128–141 (2022).
Tan, D. et al. CAG repeat growth in THAP11 is related to a novel spinocerebellar ataxia. Mov. Disord. 38, 1282–1293 (2023).
Mukamel, R. E. et al. Protein-coding repeat polymorphisms strongly form various human phenotypes. Science 373, 1499–1505 (2021).
Liu, Z. et al. Inconsistent genotyping name at DYS389 locus and implications for interpretation. Int. J. Authorized Med. 132, 1043–1048 (2018).
White, P. S., Tatum, O. L., Deaven, L. L. & Longmire, J. L. New, male-specific microsatellite markers from the human Y chromosome. Genomics 57, 433–437 (1999).
Vinces, M. D., Legendre, M., Caldara, M., Hagihara, M. & Verstrepen, Ok. J. Unstable tandem repeats in promoters confer transcriptional evolvability. Science 324, 1213–1216 (2009).
Sulovari, A. et al. Human-specific tandem repeat growth and differential gene expression throughout primate evolution. Proc. Natl Acad. Sci. USA 116, 23243–23253 (2019).
Annear, D. J. et al. Abundancy of polymorphic CGG repeats within the human genome recommend a broad involvement in neurological illness. Sci. Rep. 11, 2515 (2021).
Liao, W.-W. et al. A draft human pangenome reference. Nature 617, 312–324 (2023).
Ebert, P. et al. Haplotype-resolved various human genomes and built-in evaluation of structural variation. Science 372, eabf7117 (2021).
Garg, S. et al. Chromosome-scale, haplotype-resolved meeting of human genomes. Nat. Biotechnol. 39, 309–312 (2021).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo meeting utilizing phased meeting graphs with hifiasm. Nat. Strategies 18, 170–175 (2021).
Jarvis, E. D. et al. Semi-automated meeting of high-quality diploid human reference genomes. Nature 611, 519–531 (2022).
Dunn, T. & Narayanasamy, S. vcfdist: precisely benchmarking phased small variant calls in human genomes. Nat. Commun. 14, 8149 (2023).
Cleary, J. G. et al. Evaluating variant name information for efficiency benchmarking of next-generation sequencing variant calling pipelines. Preprint at bioRxiv https://doi.org/10.1101/023754 (2015).
Tan, A., Abecasis, G. R. & Kang, H. M. Unified illustration of genetic variants. Bioinformatics 31, 2202–2204 (2015).
Marco-Sola, S., Moure, J. C., Moreto, M. & Espinosa, A. Quick gap-affine pairwise alignment utilizing the wavefront algorithm. Bioinformatics 37, btaa777 (2020).
Sedlazeck, F. J. et al. Correct detection of advanced structural variations utilizing single-molecule sequencing. Nat. Strategies 15, 461–468 (2018).
Park, J., Kaufman, E., Valdmanis, P. N. & Bafna, V. TRviz: a Python library for decomposing and visualizing tandem repeat sequences. Bioinform. Adv. 3, vbad058 (2023).
Krause, A. et al. Junctophilin 3 (JPH3) growth mutations inflicting Huntington illness like 2 (HDL2) are frequent in South African sufferers with African ancestry and a Huntington illness phenotype. Am. J. Med. Genet. B 168, 573–585 (2015).
Wieben, E. D. et al. A typical trinucleotide repeat growth inside the transcription issue 4 (TCF4, E2-2) gene predicts Fuchs corneal dystrophy. PLoS ONE 7, e49083 (2012).
Jam, H. Z. et al. A deep inhabitants reference panel of tandem repeat variation. Nat. Commun. 14, 6711 (2023).
Bakhtiari, M., Shleizer-Burko, S., Gymrek, M., Bansal, V. & Bafna, V. Focused genotyping of variable quantity tandem repeats with adVNTR. Genome Res. 28, 1709–1719 (2018).
Sonay, T. B. et al. Tandem repeat variation in human and nice ape populations and its influence on gene expression divergence. Genome Res. 25, 1591–1599 (2015).
Quinlan, A. R. & Corridor, I. M. BEDTools: a versatile suite of utilities for evaluating genomic options. Bioinformatics 26, 841–842 (2010).
Howe, Ok. L. et al. Ensembl 2021. Nucleic Acids Res. 49, D884–D891 (2020).
English, A. Mission Adotto tandem-repeat areas and annotations. Zenodo 10.5281/zenodo.8387564 (2022).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).
English, A. Mission Adotto whole-genome variants. Zenodo 10.5281/zenodo.6975244 (2022).
Li, H. et al. An artificial-diploid benchmark for correct variant-calling analysis. Nat. Strategies 15, 595–597 (2018).
Chin, C.-S. et al. A diploid assembly-based benchmark for variants within the main histocompatibility advanced. Nat. Commun. 11, 4794 (2020).
Wootton, J. C. & Federhen, S. Statistics of native complexity in amino acid sequences and sequence databases. Comput. Chem. 17, 149–163 (1993).
Šošić, M. & Šikić, M. Edlib: a C/C++ library for quick, precise sequence alignment utilizing edit distance. Bioinformatics 33, btw753 (2016).
Bonfield, J. Ok. et al. HTSlib: C library for studying/writing high-throughput sequencing knowledge. GigaScience 10, giab007 (2021).
Katoh, Ok. & Standley, D. M. MAFFT a number of sequence alignment software program model 7: enhancements in efficiency and value. Mol. Biol. Evol. 30, 772–780 (2013).
English, A. et al. GIAB TandemRepeats benchmark v1.0. https://ftp-trace.ncbi.nlm.nih.gov/ReferenceSamples/giab/launch/AshkenazimTrio/HG002_NA24385_son/TandemRepeats_v1.0 (2023).
English, A. et al. GIAB TR comparability VCFs. Zenodo 10.5281/zenodo.10724503 (2024).
English, A. et al. Working area for the GIAB TR benchmarking challenge. GitHub https://github.com/ACEnglish/adotto (2023).
English, A. Structural variant toolkit for VCFs. GitHub https://github.com/ACEnglish/truvari (2023).
English, A. et al. Library for variant benchmarking stratification. GitHub https://github.com/ACEnglish/laytr (2023).
Olson, N. A snakemake primarily based pipeline to construct Adotto TR databases. GitHub https://github.com/nate-d-olson/adotto-smk (2023).
English, A. A rust implementation of regioneR for interval overlap permutation testing. GitHub https://github.com/ACEnglish/regioners (2023).