Stringology for Stringologist
- Towards an Efficient Text Sampling Approach for Exact and Approximate Matching.
- The n-ary Initial Literal and Literal Shuffle.
- Counting Lyndon Subsequences.
- Pitfalls of Algorithm Comparison.
- Refined Upper Bounds on the Size of the Condensed Neighbourhood of Sequences.
- Searching with Extended Guard and Pivot Loop.
- Automata Approach to Inexact Tree Pattern Matching Using 1-degree Edit Distance.
- Computational Substantiation of the d-step Conjecture for Distinct Squares Revisited.
- Reducing Time and Space in Indexed String Matching by Characters Distance Text Sampling.
- New Compression Schemes for Natural Number Sequences.
- Enumerative Data Compression with Non-Uniquely Decodable Codes.
- Conversion of Finite Tree Automata to Regular Tree Expressions By State Elimination.
- On Arithmetically Progressed Suffix Arrays.
- Greedy versus Optimal Analysis of Bounded Size Dictionary Compression and On-the-Fly Distributed Computing.
- Forward Linearised Tree Pattern Matching Using Tree Pattern Border Array.
- Fast Exact Pattern Matching in a Bitstream and 256-ary Strings.
- Left Lyndon Tree Construction.
- Fast Practical Computation of the Longest Common Cartesian Substrings of Two Strings.
- Tune-up for the Dead-Zone Algorithm.
- Simple KMP Pattern-Matching on Indeterminate Strings.
- Pointer-Machine Algorithms for Fully-Online Construction of Suffix Trees and DAWGs on Multiple Strings.
- Re-Pair in Small Space.
- Bidirectional Adaptive Compression.
- Translating Between Wavelet Tree and Wavelet Matrix Construction.
- Online Parameterized Dictionary Matching with One Gap.
- Computing Maximal Palindromes and Distinct Palindromes in a Trie.
- An Improvement of the Franek-Jennings-Smyth Pattern Matching Algorithm.
- Selective Dynamic Compression.
- Optimal Time and Space Construction of Suffix Arrays and LCP Arrays for Integer Alphabets.
- A Fast SIMD-Based Chunking Algorithm.
- Pattern Matching on Weighted Strings.
-
k-Abelian Pattern Matching: Revisited, Corrected, and Extended.
- Algorithms to Compute the Lyndon Array Revisited.
- Lexicalized Syntactic Analysis by Restarting Automata.
- On Baier’s Sort of Maximal Lyndon Substrings.
- Three Strategies for the Dead-Zone String Matching Algorithm.
- Synchronizing Dynamic Huffman Codes.
- Constrained Approximate Subtree Matching by Finite Automata.
- A Faster V-order String Comparison Algorithm.
- Discovery of Regulatory Motifs in DNA.
- O(n log n)-time Text Compression by LZ-style Longest First Substitution.
- Fast and Simple Algorithms for Computing both LCSk and LCSk+.
- Right-to-left Online Construction of Parameterized Position Heaps.
- Fibonacci Based Compressed Suffix Array.
- Parameterized Dictionary Matching with One Gap.
- Dismantling DivSufSort.
- On Reverse Engineering the Lyndon Tree.
- Speeding Up String Matching by Weak Factor Recognition.
- Range Queries Using Huffman Wavelet Trees.
- Faster Batched Range Minimum Queries.
- Counting Mismatches with SIMD.
- A Lempel-Ziv-style Compression Method for Repetitive Texts.
- The Linear Equivalence of the Suffix Array and the Partially Sorted Lyndon Array.
- Trade-offs in Query and Target Indexing for the Selection of Candidates in Protein Homology Searches.
- A Family of Exact Pattern Matching Algorithms with Multiple Adjacent Search Windows.
- Regular Expressions with Backreferences Re-examined.
- Online Recognition of Dictionary with One Gap.
- Dynamic Succinct Data Structures and Compressed Random Access Memory.
- Many-MADFAct: Concurrently Constructing MADFAs.
- A Resource-frugal Probabilistic Dictionary and Applications in (Meta)Genomics.
- Jumbled Matching with SIMD.
- The String Matching Algorithms Research Tool.
- Interpreting the Subset Construction Using Finite Sublanguages.
- Using Human Computation in Dead-zone based 2D Pattern Matching.
- Generating All Minimal Petri Net Unsolvable Binary Words.
- Fast Full Permuted Pattern Matching Algorithms on Multi-track Strings.
- Accelerated Partial Decoding in Wavelet Trees.
- Algorithms to Compute the Lyndon Array.
- Forced Repetitions over Alphabet Lists.
- Computing All Approximate Enhanced Covers with the Hamming Distance.
- Computing Smallest and Largest Repetition Factorizations in O(n log n) Time.
- A Family of Data Compression Codes with Multiple Delimiters.
- Dynamic Index and LZ Factorization in Compressed Space.
- The Use and Usefulness of Fibonacci Codes.
- Computing Left-Right Maximal Generic Words.
- A Formal Framework for Stringology.
- Enhanced Extraction from Huffman Encoded Files.
- Efficient Algorithm for δ-Approximate Jumbled Pattern Matching.
- Refined Tagging of Complex Verbal Phrases for the Italian Language.
- Combinatorics of the Interrupted Period.
- Alternative Algorithms for Order-Preserving Matching.
- Controlling the Chunk-Size in Deduplication Systems.
- Parameterized Matching: Solutions and Extensions.
- Tuning Algorithms for Jumbled Matching.
- A Faster Longest Common Extension Algorithm on Compressed Strings and its Applications.
- An Efficient Skip-Search Approach to the Order-Preserving Pattern Matching Problem.
- Quantum Leap Pattern Matching.
- Closed Factorization.
- A Process-Oriented Implementation of Brzozowski’s DFA Construction Algorithm.
- New Tabulation and Sparse Dynamic Programming Based Techniques for Sequence Similarity Problems.
- Multiple Pattern Matching Revisited.
- Alternative Algorithms for Lyndon Factorization.
- Two Squares Canonical Factorization.
- Speeding up Compressed Matching with SBNDM2.
- Approximation of Greedy Algorithms for Max-ATSP, Maximal Compression, Maximal Cycle Cover, and Shortest Cyclic Cover of Strings.
- Fast Regular Expression Matching Based On Dual Glushkov NFA.
- Metric Preserving Dense SIFT Compression.
- Two Simple Full-Text Indexes Based on the Suffix Array.
- On the Number of Distinct Squares.
- Computing Abelian Covers and Abelian Runs.
- Threshold Approximate Matching in Grammar-Compressed Strings.
- Improved Two-Way Bit-parallel Search.
- Using Correctness-by-Construction to Derive Dead-zone Algorithms.
- Random Access to Fibonacci Codes.
- Efficient Online Abelian Pattern Matching in Strings by Simulating Reactive Multi-Automata.
- Reducing Squares in Suffix Arrays.
- Maximal Palindromic Factorization.
- Optimal Partitioning of Data Chunks in Deduplication Systems.
- Graphs and Automata.
- Crochemore’s String Matching Algorithm: Simplification, Extensions, Applications.
- Degenerate String Reconstruction from Cover Arrays.
- Sorting Suffixes of a Text via its Lyndon Factorization.
- The Sum of Exponents of Maximal Repetitions in Standard Sturmian Words.
- Swap Matching in Strings by Simulating Reactive Automata.
- Towards a Very Fast Multiple String Matching Algorithm for Short Patterns.
- Compact Complete Inverted Files for Texts and Directed Acyclic Graphs Based on Sequence Binary Decision Diagrams .
- On Morphisms Generating Run-Rich Strings.
- Computing Reversed Lempel-Ziv Factorization Online.
- Improved and Self-Tuned Occurrence Heuristics.
- Finding Distinct Subpalindromes Online.
- Deciding the Density Type of a Given Regular Language.
- Weak Factor Automata: Comparing (Failure) Oracles and Storacles.
- Parallel Suffix Array Construction by Accelerated Sampling.
- A Multiobjective Approach to the Weighted Longest Common Subsequence Problem.
- A Computational Framework for Determining Square-maximal Strings.
- Quasi-linear Time Computation of the Abelian Periods of a Word.
- BlastGraph: Intensive Approximate Pattern Matching in Sequence Graphs and de-Bruijn Graphs.
- The Number of Cubes in Sturmian Words.
- Correctness-by-Construction in Stringology.
- An Efficient Parallel Determinisation Algorithm for Finite-state Automata.
- Similarity Based Deduplication with Small Data Chunks.
- New and Efficient Approaches to the Quasiperiodic Characterisation of a String.
- LZW Data Compression on Large Scale and Extreme Distributed Systems.
- Failure Deterministic Finite Automata.
- Computing Longest Common Substring/Subsequence of Non-linear Texts.
- Binary Image Compression via Monochromatic Pattern Substitution: A Sequential Speed-Up.
- 2001-2010: Ten Years of Exact String Matching Algorithms.
- Algorithmics of Posets Generated by Words over Partially Commutative Alphabets.
- Finding Long and Multiple Repeats with Edit Distance.
- Variations of Forward-SBNDM.
- Efficient Eager XPath Filtering over XML Streams.
- Improving Exact Search of Multiple Patterns From a Compressed Suffix Array.
- Computing Abelian Periods in Words.
- On Compile Time Knuth-Morris-Pratt Precomputation.
- Notes on Sequence Binary Decision Diagrams: Relationship to Acyclic Automata and Complexities of Binary Set Operations.
- Computing the Number of Cubic Runs in Standard Sturmian Words.
- Minimization of Acyclic DFAs.
- Observations On Compressed Pattern-Matching with Ranked Variables in Zimin Words.
- Inexact Graph Matching by “Geodesic Hashing” for the Alignment of Pseudoknoted RNA Secondary Structures.
- A Parameterized Formulation for the Maximum Number of Runs Problem.
- Analyzing Edit Distance on Trees: Tree Swap Distance is Intractable.
- Improving Deduplication Techniques by Accelerating Remainder Calculations.
- Inferring Strings from Suffix Trees and Links on a Binary Alphabet.
- An Improved Version of the Runs Algorithm Based on Crochemore’s Partitioning Algorithm.
- Tiling Binary Matrices in Haplotyping: Complexity, Models and Algorithms.
- Inferring Strings from Runs.
- (In)approximability Results for Pattern Matching Problems.
- A Space-Efficient Implementation of the Good-Suffix Heuristic.
- Binary Image Compression via Monochromatic Pattern Substitution: Effectiveness and Scalability.
- The Number of Runs in a Ternary Word.
- Approximate String Matching Allowing for Inversions and Translocations.
- Simple Tree Pattern Matching for Trees in the Prefix Bar Notation.
- Tight and Simple Web Graph Compression.
- Bounded Number of Squares in Infinite Repetition-Constrained Binary Words.
- Formal Characterizations of FA-based String Processors.
- On the Complexity of Variants of the k Best Strings Problem.
- Average Number of Runs and Squares in Necklace.
- Improving Automata Efficiency by Stretching and Jamming.
- Reactive Links to Save Automata States.
- New Simple Efficient Algorithms Computing Powers and Runs in Strings.
- Practical Fixed Length Lempel Ziv Coding.
- An Efficient Algorithm for Approximate Pattern Matching with Swaps.
- An Input Sensitive Online Algorithm for LCS Computation.
- On the Usefulness of Backspace.
- Compressing Bi-Level Images by Block Matching on a Tree Architecture.
- Asymptotic Behaviour of the Maximal Number of Squares in Standard Sturmian Words.
- String Suffix Automata and Subtree Pushdown Automata.
- On Bijective Variants of the Burrows-Wheeler Transform.
- Bit-parallel Algorithms for Computing all the Runs in a String.
- Crochemore’s Repetitions Algorithm Revisited - Computing Runs.
- Constant-memory Iterative Generation of Special Strings Representing Binary Trees.
- Filter Based Fast Matching of Long Patterns by Using SIMD Instructions.
- On-line Construction of a Small Automaton for a Finite Set of Words.
- Finding All Covers of an Indeterminate String in O(n) Time on Average.
- Feature Extraction for Image Pattern Matching with Cellular Automata.
- Taxonomies of Regular Tree Algorithms.
- Reducing Repetitions.
- On Minimizing Deterministic Tree Automata.
- Parallel Algorithms for Degenerate and Weighted Sequences Derived from High Throughput Sequencing Technologies.
- Combining Text Compression and String Matching: The Miracle of Self-Indexing.
- Adapting Boyer-Moore-Like Algorithms for Searching Huffman Encoded Texts.
- Finding Characteristic Substrings from Compressed Texts.
- Delta Encoding in a Compressed Domain.
- Validation and Decomposition of Partially Occluded Images with Holes.
- Searching for Jumbled Patterns in Strings.
- Fast Optimal Algorithms for Computing All the Repeats in a String.
- An Adaptive Hybrid Pattern-Matching Algorithm on Indeterminate Strings.
- Dynamic Burrows-Wheeler Transform.
- Infinite Smooth Lyndon Words.
- Parameterized Suffix Arrays for Binary Strings.
- In-place Update of Suffix Array while Recoding Words.
- Average Value of Sum of Exponents of Runs in Strings.
- The Road Coloring and Cerny Conjecture.
- Usefulness of Directed Acyclic Subword Graphs in Problems Related to Standard Sturmian Words.
- New Efficient Bit-Parallel Algorithms for the delta-Matching Problem with alpha-Bounded Gaps in Musical Sequences.
- Huffman Coding with Non-Sorted Frequencies.
- A Concurrent Specification of an Incremental DFA Minimisation Algorithm.
- Edit Distance with Single-Symbol Combinations and Splits by Manolis.
- Speeding up Lossless Image Compression: Experimental Results on a Parallel Machine.
- On the Uniform Distribution of Strings.
- New Lower Bounds for the Maximum Number of Runs in a String.
- Lossless Image Compression by Block Matching on Practical Massively Parallel Architectures.
- Efficient Variants of the Backward-Oracle-Matching Algorithm.
- On Regular Expression Hashing to Reduce FA Size.
- Conservative String Covering of Indeterminate Strings.
- The Virtual Suffix Tree: An Efficient Data Structure for Suffix Trees and Suffix Arrays.
- A concurrent specification of Brzozowski’s DFA construction algorithm.
- 2D context-free grammars: Mathematical formulae recognition.
- Flipping letters to minimize the support of a string.
- On the problem of deciding if a polyomino tiles the plane by translation.
- Modeling delta encoding of compressed files.
- Sparse compact directed acyclic word graphs.
- The gapped-factor tree.
- Using alignment for multilingual text compression.
- Two-dimensional bitwise memory matrix: A tool for optimal parallel approximate pattern matching.
- On implementation and performance of table-driven DFA-based string processors.
- Can dist tables be merged in linear time - An Open Problem.
- A Markovian approach for the analysis of the gene structure.
- Working with compressed concordances.
- Efficient algorithms for (delta, gamma, alpha)-matching.
- Song classifications for dancing.
- Efficient automata constructions and approximate automata.
- An asymptotic lower bound for the maximal-number-of-runs function.
- On some combinatorial problems concerning the harmonic structure of musical chord sequences.
- FireµSat: An algorithm to detect microsatellites in DNA.
- FM-KZ: An even simpler alphabet-independent FM-index.
- Reachability on suffix tree graphs.
- Reordering finite automata states for fast string recognition.
- A space efficient bit-parallel algorithm for the multiple string matching problem.
- From suffix trees to suffix vectors.
- Alphabets in generic programming.
- Context-dependent stopper encoding.
- A missing link in root-to-frontier tree pattern matching.
- Asynchronous pattern matching - Metrics.
- Bounded size dictionary compression: Relaxing the LRU deletion heuristic.
- Backward pattern matching automaton.
- Bit-parallel computation of local similarity score matrices with unitary weights.
- A taxonomy of suffix array construction algorithms.
- Compressed pattern matching in JPEG images.
- Incremental string correction: Towards correction of XML documents.
- Flexible music retrieval in sublinear time.
- Approximation algorithm for the cyclic swap problem.
- A simple alphabet-independent FM-index.
- General pattern matching on regular collage system.
- Reconstructing a suffix array.
- A Simple Lossless Compression Heuristic for Grey Scale Images.
- Algorithms for the Constrained Longest Common Subsequence Problems.
- A Note on Bit-Parallel Alignment Computation.
- Conditional Inequalities and the Shortest Common Superstring Problem.
- BDD-Based Analysis of Gapped q-Gram Filters.
- A Framework for the Dynamic Implementation of Finite Automata for Performance Enhancement.
- A Fully Compressed Pattern Matching Algorithm for Simple Collage Systems.
- Sorting suffixes of two-pattern strings.
- Theoretical Issues of Searching Aerial Photographs: A Bird’s Eye View.
- Semi-Lossless Text Compression.
- A First Approach to Finding Common Motifs With Gaps.
- Arithmetic Coding in Parallel.
- Efficient Algorithms for the delta-Approximate String Matching Problem in Musical Sequences.
- Combinatorial Characterization of the Language Recognized by Factor and Suffix Oracles.
- Operation L-INSERT on Factor Automaton.
- Matching Numeric Strings under Noise.
- The Transformation Distance Problem Revisited.
- Learning the Morphological Features of a Large Set of Words.
- An Efficient Mapping for Score of String Matching.
- Approximate Seeds of Strings.
- A Linear Algorithm for the Detection of Evolutive Tandem Repeats.
- Forward-Fast-Search: Another Fast Variant of the Boyer-Moore String Matching Algorithm.
- Constructing Factor Oracles.
- Computing the Minimum k-Cover of a String.
- Computing the Repetitions in a Weighted Sequence.
- Image Recognition Using Finite Automata.
- A Note on Crochemore’s Repetitions Algorithm a Fast Space-Efficient Approach.
- A Recursive Function for Calculating the Number of Legal Strings of Parentheses and for Calculating Catalan Numbers.
- Bidirectional Construction of Suffix Trees.
- String Regularities with Don’t Cares.
- A Bit-Vector Algorithm for Computing Levenshtein and Damerau Edit Distances.
- A Work-Optimal Parallel Implementation of Lossless Image Compression by Block Matching.
- String Matching with Gaps for Musical Melodic Recognition.
- A Note on Randomized Algorithm for String Matching with Mismatches.
- Border Array on Bounded Alphabet.
- Split and join for minimizing: Brzozowski’s algorithm.
- Approximate String Matching in Musical Sequences.
- Searching in an Efficiently Stored DNA Text Using a Hardware Solution.
- A linear time string matching algorithm on average with efficient text storage.
- Bioinformatics: tools for analysis of biological sequences.
- Construction of the CDAWG for a Trie.
- Multiple Sequence Alignment as a Facility Location Problem.
- Computing Approximate Repetitions in Musical Sequences.
- The Set-Set Closest Common Subsequence Problem.
- Word-based Compression Method with Direct Access.
- A new family of Commentz-Walter-style multiple-keyword pattern matching algorithms.
- Repetitions in two-pattern strings.
- Condensation Principle.
- A New Practical Linear Space Algorithm for the Longest Common Subsequence Problem.
- Centroid Trees with Application to String Processing.
- On Procedures for Multiple-string Match with Respect to Two Sets.
- The Closest Common Subsequence Problems.
- A Fast String Matching Algorithm and Experimental Results.
- On the All Occurrences of a Word in a Text.
- Approximate String Matching by Fuzzy Automata.
- Validating and Decomposing Partially Occluded Two-Dimensional Images (Extended Abstract).
- Local Prediction for Lossless Image Compression.
- An Early-Retirement Plan for the States.
- Implementation of DAWG.
- A Fast Morphological Analysis Using the Extended AC Machine for Oriental Languages.
- The Factor Automaton.
- Exact String Matching Animation in Java.
- A Highly Parallel Finite State Automaton Processor for Biological Pattern Matching.
- Application of Sequence Alignment Methods to Multiple Structural Alignment and Superposition.
- The Longest Restricted Common Subsequence Problem.
- Dynamic Programming for Reduced NFAs for Approximate String and Sequence Matching.
- Directed Acyclic Subsequence Graph.
- SPARE Parts: A C++ Toolkit for String PAttern REcognition.
- Algebra of Pattern Matching Problems.
- A New Family of String Pattern Matching Algorithms.
- An Efficient Trie Hashing Method Using a Compact Binary Trie.
- 6D Classification of Pattern Matching Problems.
- A Boyer-Moore (or Watson-Watson) Type Algorithm for Regular Tree Pattern Matching.
- Simulation of NFA in Approximate String and Sequence Matching.
- An Efficient Multi-Attribute Pattern Matching Machine.
- Efficiency of AC-Machine and SNFA in Practical String Matching.
- Approximate Regular Expression Matching.
- Space Complexity of Linear Time Approximate String Matching.
- A Collection of New Regular Grammar Pattern Matching Algorithms.
- Fast Full Text Search Using Tree Structured[TS] File.
- Reduced Nondeterministic Finite Automata for Approximate String Matching.