Papers for stringologist (2022)
Contents
- A Theoretical and Experimental Analysis of BWT Variants for String Collections.
- An FPT-Algorithm for Longest Common Subsequence Parameterized by the Maximum Number of Deletions.
- Arbitrary-Length Analogs to de Bruijn Sequences.
- Back-To-Front Online Lyndon Forest Construction.
- Beyond the Longest Letter-Duplicated Subsequence Problem.
- Bi-Directional r-Indexes.
- Cartesian Tree Subsequence Matching.
- Compact Text Indexing for Advanced Pattern Matching Problems: Parameterized, Order-Isomorphic, 2D, etc. (Invited Talk).
- Efficient Construction of the BWT for Repetitive Text Using String Compression.
- Front Matter, Table of Contents, Preface, Conference Organization.
- Indexable Elastic Founder Graphs of Minimum Height.
- Invitation to Combinatorial Reconfiguration (Invited Talk).
- Linear-Time Computation of Shortest Covers of All Rotations of a String.
- Longest Palindromic Substring in Sublinear Time.
- Making de Bruijn Graphs Eulerian.
- Mechanical Proving with Walnut for Squares and Cubes in Partial Words.
- Minimal Absent Words on Run-Length Encoded Strings.
- On Strings Having the Same Length- k Substrings.
- Parallel Algorithm for Pattern Matching Problems Under Substring Consistent Equivalence Relations.
- Partial Permutations Comparison, Maintenance and Applications.
- Permutation Pattern Matching for Doubly Partially Ordered Patterns.
- Polynomial-Time Equivalences and Refined Algorithms for Longest Common Subsequence Variants.
- Rectangular Tile Covers of 2D-Strings.
- Reduction Ratio of the IS-Algorithm: Worst and Random Cases.
- Reordering a Tree According to an Order on Its Leaves.
- The Dynamic k-Mismatch Problem.
- The Fine-Grained Complexity of Episode Matching.
- The Normalized Edit Distance with Uniform Operation Costs Is a Metric.
- Using Automata and a Decision Procedure to Prove Results in Pattern Matching (Invited Talk).
- {RePair} Grammars Are the Smallest Grammars for Fibonacci Words.
- A Benchmark of Entropy Coders for the Compression of Genome Sequencing Data.
- Burrows-Wheeler Transform on Purely Morphic Words.
- CSTs for Terabyte-Sized Data.
- Compressing the Tree of Canonical Huffman Coding.
- Computing Lexicographic Parsings.
- Computing Matching Statistics on Repetitive Texts.
- Converting RLBWT to LZ77 in smaller space.
- FM-Indexing Grammars Induced by Suffix Sorting for Long Patterns.
- Fast Coding of Haar Wavelet Trees.
- Graphs can be succinctly indexed for pattern matching in $O(\vert E\vert ^{2}+\vert V\vert ^{5/2})$ time.
- HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances.
- Linear-time Minimization of Wheeler DFAs.
- Lower Bounds for Lexicographical DFS Data Structures.
- On Dynamic Bitvector Implementations.
- On different variants of the Burrows-Wheeler-Transform of string collections.
- RLBWT Tricks.
- Selective Weighted Adaptive Coding.
- Simple Worst-Case Optimal Adaptive Prefix-Free Coding.
- Succinct Data Structure for Path Graphs.
- x3: Lossless Data Compressor.
- Logarithmic Equal-Letter Runs for BWT of Purely Morphic Words.
- Prefix Palindromic Length of the Sierpinski Word.
- An Improved Algorithm for Finding the Shortest Synchronizing Words.
- Approximate Circular Pattern Matching.
- Approximating Dynamic Time Warping Distance Between Run-Length Encoded Strings.
- Computing NP-Hard Repetitiveness Measures via MAX-SAT.
- Distinct Elements in Streams: An Algorithm for the (Text) Book.
- Insertion Time of Random Walk Cuckoo Hashing below the Peeling Threshold.
- Lyndon Arrays Simplified.
- Simple Worst-Case Optimal Adaptive Prefix-Free Coding.
- Estimating the Longest Increasing Subsequence in Nearly Optimal Time.
- Faster Pattern Matching under Edit Distance : A Reduction to Dynamic Puzzle Matching and the Seaweed Monoid of Permutation Matrices.
- Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal.
- Online List Labeling: Breaking the log2n Barrier.
- Strong XOR Lemma for Communication with Bounded Rounds : (extended abstract).
- Õ(n+poly(k))-time Algorithm for Bounded Tree Edit Distance.
- An Optimal-Time RLBWT Construction in BWT-Runs Bounded Space.
- Fully Functional Parameterized Suffix Trees in Compact Space.
- Galloping in Fast-Growth Natural Merge Sorts.
- Improved Approximation Algorithms for Dyck Edit Distance and RNA Folding.
- Improved Sublinear-Time Edit Distance for Preprocessed Strings.
- Listing, Verifying and Counting Lowest Common Ancestors in DAGs: Algorithms and Fine-Grained Lower Bounds.
- Computing Palindromes on a Trie in Linear Time.
- External-Memory Dictionaries with Worst-Case Update Cost.
- On the Complexity of Tree Edit Distance with Variables.
- Simple Order-Isomorphic Matching Index with Expected Compact Space.
- Succinct List Indexing in Optimal Time.
- Optimal Sequence Alignment to ED-Strings.
- Lyndon Partial Words and Arrays with Applications.
- Computing Longest (Common) Lyndon Subsequences.
- Linear Time Construction of Indexable Elastic Founder Graphs.
- Practical Space-Efficient Index for Structural Pattern Matching.
- Shortest Unique Palindromic Substring Queries in Semi-dynamic Settings.
- Space-Efficient B Trees via Load-Balancing.
- Elastic-Degenerate String Matching with 1 Error.
- Near-Optimal Search Time in δ-Optimal Space.
- Space-Efficient Data Structure for Next/Previous Larger/Smaller Value Queries.
- String Attractors and Infinite Words.
- On Uniformization in the Full Binary Tree.
- Streaming Word Problems.
- Computing Maximal Unique Matches with the r-Index.
- RLBWT Tricks.
- A Lower Bound for the n-queens Problem.
- An Improved Algorithm for The k-Dyck Edit Distance Problem.
- An Upper Bound and Linear-Space Queries on the LZ-End Parsing.
- Average Sensitivity of Dynamic Programming.
- Enumerating k-SAT functions.
- How Compression and Approximation Affect Efficiency in String Distance Measures.
- How many Clusters? - An algorithmic answer.
- Pattern Matching on Grammar-Compressed Strings in Linear Time.
- Selectable Heaps and Optimal Lazy Search Trees.
- Simulating a stack using queues.
- Splay trees on trees.
- Streaming Regular Expression Membership and Pattern Matching.
- Faster Exponential Algorithm for Permutation Pattern Matching.
- Simpler Adjacency Labeling for Planar Graphs with B-Trees.
- Accessing the Suffix Array via φ -1-Forest.
- Balancing Run-Length Straight-Line Programs.
- Compressed String Dictionaries via Data-Aware Subtrie Compaction.
- Computing All-vs-All MEMs in Run-Length-Encoded Collections of HiFi Reads.
- Computing the Parameterized Burrows-Wheeler Transform Online.
- Engineering Compact Data Structures for Rank and Select Queries on Bit Vectors.
- Genome Comparison on Succinct Colored de Bruijn Graphs.
- How Train-Test Leakage Affects Zero-Shot Retrieval.
- Internal Masked Prefix Sums and Its Connection to Fully Internal Measurement Queries.
- KATKA: A KRAKEN-Like Tool with k Given at Query Time.
- Matching Patterns with Variables Under Edit Distance.
- Maximal Closed Substrings.
- On Representing the Degree Sequences of Sublogarithmic-Degree Wheeler Graphs.
- On the Hardness of Computing the Edit Distance of Shallow Trees.
- On the Optimisation of the GSACA Suffix Array Construction Algorithm.
- Online Algorithms for Finding Distinct Substrings with Length and Multiple Prefix and Suffix Conditions.
- Pattern Matching Under DTW Distance.
- Quantum Time Complexity and Algorithms for Pattern Matching on Labeled Graphs.
- Reconstructing Parameterized Strings from Parameterized Suffix and LCP Arrays.
- Sorting Genomes by Prefix Double-Cut-and-Joins.
- Subsequence Covers of Words.
- Substring Complexities on Run-Length Compressed Strings.
- The Complexity of the Co-occurrence Problem.
- Existential Definability over the Subword Ordering.
- Probabilistic vs Deterministic Gamblers.
- Almost-optimal sublinear-time edit distance in the low distance regime.
- Dynamic suffix array with polylogarithmic queries and updates.
- Explicit binary tree codes with sub-logarithmic size alphabet.
- Improved approximation guarantees for shortest superstrings using cycle classification by overlap to length ratios.
- On the optimal time/space tradeoff for hash tables.
- Predecessor on the Ultra-Wide Word RAM.
- Unit-Disk Range Searching and Applications.
- Eulertigs: Minimum Plain Text Representation of k-mer Sets Without Repetitions in Linear Time.
- Fast Gapped k-mer Counting with Subdivided Multi-Way Bucketed Cuckoo Hash Tables.
- Haplotype Threading Using the Positional Burrows-Wheeler Transform.
- Locality-Sensitive Bucketing Functions for the Edit Distance.
- On Weighted k-mer Dictionaries.
- Prefix-Free Parsing for Building Large Tunnelled Wheeler Graphs.
- Suffix Sorting via Matching Statistics.
- Toward Optimal Fingerprint Indexing for Large Scale Genomics.
- phyBWT: Alignment-Free Phylogeny via eBWT Positional Clustering.
ACM Comput. Surv. 2022
- Indexing Highly Repetitive String Collections, Part I: Repetitiveness Measures.
- Indexing Highly Repetitive String Collections, Part II: Compressed Indexes.
ACM J. Exp. Algorithmics 2022
- Grammar Compression by Induced Suffix Sorting.
ACM Trans. Algorithms 2022
- A Simple Algorithm for Optimal Search Trees with Two-way Comparisons.
Algorithmica 2022
- A Faster Reduction of the Dynamic Time Warping Distance to the Longest Increasing Subsequence Length.
- Adaptive Succinctness.
- Computing Minimal Unique Substrings for a Sliding Window.
- Efficient Computation of Sequence Mappability.
- Fast and Longest Rollercoasters.
- Space Efficient Merging of de Bruijn Graphs and Wheeler Graphs.
- Streaming Dictionary Matching with Mismatches.
Algorithms Mol. Biol. 2022
- Space-efficient representation of genomic k-mer count tables.
Appl. Soft Comput. 2022
- Graph search and variable neighborhood search for finding constrained longest common subsequences in artificial and real gene sequences.
- Fast and compact matching statistics analytics.
CoRR 2022
- A New Class of String Transformations for Compressed Text Indexing.
- A theoretical and experimental analysis of BWT variants for string collections.
- Almost-Optimal Sublinear-Time Edit Distance in the Low Distance Regime.
- An Optimal-Time RLBWT Construction in BWT-runs Bounded Space.
- An n Hk-compressed searchable partial-sums data structure for static sequences of sublogarithmic positive integers.
- Approximate Circular Pattern Matching.
- Cartesian Tree Subsequence Matching.
- Computing Longest (Common) Lyndon Subsequences.
- Computing NP-hard Repetitiveness Measures via MAX-SAT.
- Computing maximal generalized palindromes.
- Dynamic Suffix Array with Polylogarithmic Queries and Updates.
- Efficient Construction of the BWT for Repetitive Text Using String Compression.
- Faster Pattern Matching under Edit Distance.
- Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices.
- LCP-dropout: Compression-based Multiple Subword Segmentation for Neural Machine Translation.
- Linear Time Construction of Indexable Elastic Founder Graphs.
- Logarithmic equal-letter runs for BWT of purely morphic words.
- Longest (Sub-)Periodic Subsequence.
- MONI can find k-MEMs.
- Memory Efficient Tries for Sequential Pattern Mining.
- Minimal Absent Words on Run-Length Encoded Strings.
- Multiple Genome Analytics Framework: The Case of All SARS-CoV-2 Complete Variants.
- Near-Optimal Search Time in δ-Optimal Space.
- Numeric Lyndon-based feature embedding of sequencing reads for machine learning approaches.
- OSM-tree: A Sortedness-Aware Index.
- Online List Labeling: Breaking the log2n Barrier.
- Online algorithms for finding distinct substrings with length and multiple prefix and suffix conditions.
- Õ(n+poly(k))-time Algorithm for Bounded Tree Edit Distance.
- Predecessor on the Ultra-Wide Word RAM.
- RePair Grammars are the Smallest Grammars for Fibonacci Words.
- Safety and Completeness in Flow Decompositions for RNA Assembly.
- Shortest Unique Palindromic Substring Queries in Semi-dynamic Settings.
- Space-Efficient STR-IC-LCS Computation.
- Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software Platform.
- Substring Complexities on Run-length Compressed Strings.
- The Efficiency of the ANS Entropy Encoding.
- What Does Dynamic Optimality Mean in External Memory?
- X3: Lossless Data Compressor.
Commun. ACM 2022
- Resolution of the burrows-wheeler transform conjecture.
- The compression power of the BWT: technical perspective.
IEEE Access 2022
- Compressing and Querying Integer Dictionaries Under Linearities and Repetitions.
Inf. Comput. 2022
- A periodicity lemma for partial words.
- Efficient representation and counting of antipower factors in words.
- LZRR: LZ77 parsing with right reference.
- Order-preserving pattern matching indeterminate strings.
- c-trie++: A dynamic trie tailored for fast prefix searches.
Inf. Process. Lett. 2022
- All-pairs suffix/prefix in optimal time using Aho-Corasick space.
- Palindromic trees for a sliding window and its applications.
Inf. Retr. J. 2022
- Applying burst-tries for error-tolerant prefix search.
J. Comput. Biol. 2022
- Finding Maximal Exact Matches Using the r-Index.
- MONI: A Pangenomic Index for Finding Maximal Exact Matches.
J. Comput. Sci. 2022
- A sorting algorithm based on ordered block insertions.
SN Comput. Sci. 2022
- Combining Forward Compression with PPM.
- Correction to: Graph Compression for Adjacency-Matrix Multiplication.
- Graph Compression for Adjacency-Matrix Multiplication.
Theor. Comput. Sci. 2022
- A data structure for substring-substring LCS length queries.
- An efficient algorithm for the longest common palindromic subsequence problem.
- Combinatorics of minimal absent words for a sliding window.
- Efficient and compact representations of some non-canonical prefix-free codes.
- In-place initializable arrays.
- Internal shortest absent word queries in constant time and linear space.
- Parameterized DAWGs: Efficient constructions and bidirectional pattern searches.
- Partial sums on the ultra-wide word RAM.
Theory Comput. Syst. 2022
- Factorizing Strings into Repetitions.