Papers for stringologist (2013)
Contents
- Fast Packed String Matching for Short Patterns.
- Inducing Suffix and Lcp Arrays in External Memory.
- Lempel-Ziv factorization: Simple, fast, practical.
- Compressed Automata for Dictionary Matching.
- Average Optimal String Matching in Packed Strings.
- A Bit-Parallel, General Integer-Scoring Sequence Alignment Algorithm.
- A Constant-Space Comparison-Based Algorithm for Computing the Burrows-Wheeler Transform.
- A Succinct Grammar Compression.
- Approximating Shortest Superstring Problem Using de Bruijn Graphs.
- Approximation of Grammar-Based Compression via Recompression.
- Compact q-Gram Profiling of Compressed Strings.
- Converting SLP to LZ78 in almost Linear Time.
- Data Structure Lower Bounds on Random Access to Grammar-Compressed Strings.
- Discrete Methods for Image Analysis Applied to Molecular Biology.
- Document Listing on Repetitive Collections.
- Efficient All Path Score Computations on Grid Graphs.
- Efficient Lyndon Factorization of Grammar Compressed Text.
- External Memory Generalized Suffix and LCP Arrays Construction.
- Fast Algorithm for Partial Covers in Words.
- Forty Years of Text Indexing.
- LCP Magic.
- Linear Time Lempel-Ziv Factorization: Simple, Fast, Small.
- Local Search for String Problems: Brute Force Is Essentially Optimal.
- Locating All Maximal Approximate Runs in a String.
- New Algorithms for Position Heaps.
- On Minimal and Maximal Suffixes of a Substring.
- Pattern Matching with Variables: A Multivariate Complexity Analysis.
- Space-Efficient Construction Algorithm for the Circular Suffix Tree.
- Time-Space Trade-Offs for the Longest Common Substring Problem.
- Alphabetic Minimax Trees in Linear Time.
- Discovering Hidden Repetitions in Words.
- A Simple Online Competitive Adaptation of Lempel-Ziv Compression with Efficient Random Access Support.
- Algorithms for Compressed Inputs.
- An Adaptive Difference Distribution-Based Coding with Hierarchical Tree Structure for DNA Sequence Compression.
- Compressed Parameterized Pattern Matching.
- Compressing Huffman Models on Large Alphabets.
- Computing Convolution on Grammar-Compressed Text.
- Effective Variable-Length-to-Fixed-Length Coding via a Re-Pair Algorithm.
- Faster Compact Top-k Document Retrieval.
- Faster Compressed Top-k Document Retrieval.
- From Run Length Encoding to LZ78 and Back Again.
- Partition Tree Weighting.
- Practical Parallel Lempel-Ziv Factorization.
- Quadratic Similarity Queries on Compressed Data.
- Random Extraction from Compressed Data - A Practical Study.
- Simpler and Faster Lempel Ziv Factorization.
- Space-Efficient Construction Algorithm for the Circular Suffix Tree.
- Texture Compression.
- The Rightmost Equal-Cost Position Problem.
- Variable-to-Fixed-Length Encoding for Large Texts Using Re-Pair Algorithm with Shared Dictionaries.
- Abelian Repetitions in Sturmian Words.
- On the Number of Abelian Bordered Words.
- Repetition Avoidance in Circular Factors.
- Suffixes, Conjugates and Lyndon Words.
- Binary Jumbled Pattern Matching on Trees and Tree-Like Structures.
- Compressed Cache-Oblivious String B-tree.
- Efficient Indexes for Jumbled Pattern Matching with Constant-Sized Alphabet.
- Encodings for Range Selection and Top-k Queries.
- Optimal Color Range Reporting in One Dimension.
- Parallel String Sample Sort.
- The Encoding Complexity of Two Dimensional Range Minimum Data Structures.
- Versatile Succinct Representations of the Bidirectional Burrows-Wheeler Transform.
- Combining Binary Search Trees.
- Dynamic Compressed Strings with Random Access.
- Sparse Suffix Tree Construction in Small Space.
- Tree Compression with Top Trees.
- n-step FM-Index for Faster Pattern Matching.
- A reconfigurable stream compression hardware based on static symbol-lookup table.
- Beating $\mathcal{O}(nm)$ in Approximate LZW-Compressed Pattern Matching.
- Faster, Space-Efficient Selection Algorithms in Read-Only Memory for Integers.
- Less Space: Indexing for Queries with Wildcards.
- Pattern Matching with Non Overlapping Reversals - Approximation and On-line Algorithms.
- RAM-Efficient External Memory Sorting.
- Single and Multiple Consecutive Permutation Motif Search.
- Sliding Bloom Filters.
- Succinct Data Structures for Representing Equivalence Classes.
- Top-k Document Retrieval in Compact Space and Near-Optimal Time.
- An Optimal Algorithm for Computing All Subtree Repeats in Trees.
- Deciding Representability of Sets of Words of Equal Length in Polynomial Time.
- Motif Matching Using Gapped Patterns.
- Prefix Table Construction and Conversion.
- Suffix Tree of Alignment: An Efficient Index for Similar Data.
- On the Value of Multiple Read/Write Streams for Data Compression.
- Linear-Time Version of Holub’s Algorithm for Morphic Imprimitivity Testing.
- On the Number of Unbordered Factors.
- Detecting Regularities on Grammar-Compressed Strings.
- Decoupled compressed cache: exploiting spatial locality for energy-optimized compressed caching.
- Linearly compressed pages: a low-complexity, low-latency main memory compression framework.
- Accelerating String Matching on MIC Architecture for Motif Extraction.
- Lightweight Lempel-Ziv Parsing.
- Space-Efficient, High-Performance Rank and Select Structures on Uncompressed Bit Sequences.
- Faster and smaller inverted indices with treaps.
- Adaptive and Approximate Orthogonal Range Counting.
- Compressed static functions with applications.
- Lyndon Words and Short Superstrings.
- Near-Optimal Range Reporting Structures for Categorical Data.
- Optimal Dynamic Sequence Representations.
- The Space Complexity of 2-Dimensional Approximate Range Counting.
- Twisted Tabulation Hashing.
- Permuted Pattern Matching on Multi-track Strings.
- A Lempel-Ziv Compressed Structure for Document Listing.
- Accurate Profiling of Microbial Communities from Massively Parallel Sequencing Using Convex Optimization.
- Adaptive Data Structures for Permutations and Binary Relations.
- Adding Compression and Blended Search to a Compact Two-Level Suffix Array.
- Compact Querieable Representations of Raster Data.
- Consolidating and Exploring Information via Textual Inference.
- Discovering Dense Subgraphs in Parallel for Compressing Web and Social Networks.
- Distributed Query Processing on Compressed Graphs Using K2-Trees.
- Document Listing on Versioned Documents.
- Efficient Approximation of Edit Distance.
- Faster Lyndon Factorization Algorithms for SLP and LZ78 Compressed Text.
- Faster Range LCP Queries.
- Faster Top-k Document Retrieval in Optimal Space.
- Fully-Online Grammar Compression.
- Indexes for Jumbled Pattern Matching in Strings, Trees and Graphs.
- Learning URL Normalization Rules Using Multiple Alignment of Sequences.
- Learning to Schedule Webpage Updates Using Genetic Programming.
- Lossless Compression of Rotated Maskless Lithography Images.
- Minimal Discriminating Words Problem Revisited.
- Nowcasting with Google Trends.
- On Two-Dimensional Lyndon Words.
- Order-Preserving Incomplete Suffix Trees and Order-Preserving Indexes.
- Pattern Discovery and Listing in Graphs.
- Position-Restricted Substring Searching over Small Alphabets.
- Query Processing in Highly-Loaded Search Engines.
- Simulation Study of Multi-threading in Web Search Engine Processors.
- Solving Graph Isomorphism Using Parameterized Matching.
- Space-Efficient Construction of the Burrows-Wheeler Transform.
- Suffix Array of Alignment: A Practical Index for Similar Data.
- Top-k Color Queries on Tree Paths.
- Using Mutual Influence to Improve Recommendations.
- You Are What You Eat: Learning User Tastes for Rating Prediction.
- Fast Algorithms for Abelian Periods in Words and Greatest Common Divisor Queries.
- Finding Pseudo-repetitions.
- Parameterized Matching in the Streaming Model.
- Recompression: a simple and powerful technique for word equations.
- Compact Complete Inverted Files for Texts and Directed Acyclic Graphs Based on Sequence Binary Decision Diagrams .
- Computing Reversed Lempel-Ziv Factorization Online.
- Crochemore’s String Matching Algorithm: Simplification, Extensions, Applications.
- Deciding the Density Type of a Given Regular Language.
- Degenerate String Reconstruction from Cover Arrays.
- Finding Distinct Subpalindromes Online.
- Graphs and Automata.
- Improved and Self-Tuned Occurrence Heuristics.
- Maximal Palindromic Factorization.
- On Morphisms Generating Run-Rich Strings.
- Optimal Partitioning of Data Chunks in Deduplication Systems.
- Parallel Suffix Array Construction by Accelerated Sampling.
- Sorting Suffixes of a Text via its Lyndon Factorization.
- Swap Matching in Strings by Simulating Reactive Automata.
- The Sum of Exponents of Maximal Repetitions in Standard Sturmian Words.
- Towards a Very Fast Multiple String Matching Algorithm for Short Patterns.
- Weak Factor Automata: Comparing (Failure) Oracles and Storacles.
- A Fixed-Parameter Algorithm for Minimum Common String Partition with Few Duplications.
- Probabilistic Approaches to Alignment with Tandem Repeats.
- Using Cascading Bloom Filters to Improve the Memory Usage for de Brujin Graphs.
- Better Space Bounds for Parameterized Range Majority and Minority.
- Compressed Persistent Index for Efficient Rank/Select Queries.
- Fingerprints in Compressed Strings.
- On (Dynamic) Range Minimum Queries in External Memory.
ACM Comput. Surv. 2013
- Spaces, Trees, and Colors: The algorithmic landscape of document retrieval on sequences.
ACM J. Exp. Algorithmics 2013
- Compressed suffix trees: Efficient computation and storage of LCP-values.
ACM Trans. Algorithms 2013
- Optimal Pattern Matching in LZW Compressed Strings.
Algorithmica 2013
- Distribution-Aware Compressed Full-Text Indexes.
- Sublinear Algorithms for Approximating String Compressibility.
Algorithms 2013
- Practical Compressed Suffix Trees.
Algorithms Mol. Biol. 2013
- Data compression for sequencing data.
- libgapmis: extending short-read alignments.
CoRR 2013
- 2D Lyndon Words and Applications
- A Dynamic Programming Solution to a Generalized LCS Problem
- A Fixed-Parameter Algorithm for Minimum Common String Partition with Few Duplications.
- A Functional Approach to Standard Binary Heaps.
- A Note on the Longest Common Compatible Prefix Problem for Partial Words.
- A Succinct Grammar Compression
- A general definition of the big-oh notation for algorithm analysis.
- A simple online competitive adaptation of Lempel-Ziv compression with efficient random access support
- AliBI: An Alignment-Based Index for Genomic Datasets.
- Alphabet-Dependent String Searching with Wexponential Search Trees
- An Efficient Dynamic Programming Algorithm for the Generalized LCS Problem with Multiple Substring Exclusion Constrains
- An Elegant Algorithm for the Construction of Suffix Arrays.
- Approximate String Matching using a Bidirectional Index.
- Approximation of grammar-based compression via recompression
- Approximation of smallest linear tree grammar.
- Average Case and Distributional Analysis of Java 7’s Dual Pivot Quicksort
- Beating O(nm) in approximate LZW-compressed pattern matching.
- Bicriteria data compression.
- Bin Packing and Related Problems: General Arc-flow Formulation with Graph Compression.
- Binary Jumbled Pattern Matching on Trees and Tree-Like Structures
- Compact q-gram Profiling of Compressed Strings
- Complexity of the FIFO Stack-Up Problem.
- Compressed Pattern-Matching with Ranked Variables in Zimin Words.
- Compressed Spaced Suffix Arrays.
- Computing convolution on grammar-compressed text
- Computing the Longest Increasing Subsequence of a Sequence Subject to Dynamic Insertion.
- Data Structures in Classical and Quantum Computing.
- Detecting regularities on grammar-compressed strings
- Domain Specific Hierarchical Huffman Encoding.
- Dynamic 2D Dictionary Matching in Small Space
- Dynamic Gomory-Hu Tree Construction - fast and simple.
- ELB-Trees, An Efficient and Lock-free B-tree Derivative.
- Efficient Lyndon factorization of grammar compressed text
- Efficient algorithms for the longest common subsequence in $k$-length substrings.
- Efficient repeat finding via suffix arrays
- Efficiently Computing Edit Distance to Dyck Language.
- Encoding Range Minimum Queries.
- Engineering Small Space Dictionary Matching
- Estimating the longest increasing sequence in polylogarithmic time.
- Faster Compact On-Line Lempel-Ziv Factorization
- Finding Distinct Subpalindromes Online
- Finding small patterns in permutations in linear time.
- Fingerprints in Compressed Strings
- First-Come-First-Served for Online Slot Allocation and Huffman Coding.
- From Theory to Practice: Plug and Play with Succinct Data Structures.
- Full-fledged Real-Time Indexing for Constant Size Alphabets
- GPU Accelerated Multiple Deoxyribose Nucleic Acid Sequence Parallel Matching
- Heaviest Induced Ancestors and Longest Common Substrings
- Hybrid Indexes for Repetitive Datasets.
- Indexes for Jumbled Pattern Matching in Strings, Trees and Graphs
- LZ-Compressed String Dictionaries
- Large-Scale Pattern Search Using Reduced-Space On-Disk Suffix Arrays
- Lempel-Ziv Parsing in External Memory.
- Lightweight LCP Construction for Next-Generation Sequencing Datasets
- Lightweight Lempel-Ziv Parsing
- Minimal Indices for Successor Search.
- Modulated String Searching
- Motif matching using gapped patterns.
- Near-optimal labeling schemes for nearest common ancestors.
- On Updating and Querying Sub-arrays of Multidimensional Arrays.
- On a compact encoding of the swap automaton.
- On string matching with k mismatches.
- One-variable word equations in linear time
- Optimal Color Range Reporting in One Dimension.
- Optimal Partitioning for Dual Pivot Quicksort
- Optimal Top-k Document Retrieval.
- Order Preserving Matching
- Order-Preserving Suffix Trees and Their Algorithmic Applications
- Order-preserving pattern matching with k mismatches.
- Orthogonal Range Searching for Text Indexing.
- Parallel Algorithm for Longest Common Subsequence in a String.
- Parallel String Sample Sort
- Parallel Suffix Array Construction by Accelerated Sampling
- QuickXsort: Efficient Sorting with n log n - 1.399n +o(n) Comparisons on Average.
- RAM-Efficient External Memory Sorting.
- Regular Expression Searching in Sublinear Time.
- Repetition-free longest common subsequence of random sequences
- Set-Difference Range Queries.
- Shortest Unique Substring Query Revisited.
- Simple, compact and robust approximate string dictionary.
- Single and multiple consecutive permutation motif search
- Sorted Range Reporting Revisited.
- Sorting suffixes of a text via its Lyndon Factorization.
- Space Efficient Linear Time Lempel-Ziv Factorization on Constant~Size~Alphabets.
- Statistically-secure ORAM with $\tilde{O}(\log^2 n)$ Overhead.
- Substring Suffix Selection.
- Succinct data structures for representing equivalence classes.
- Succinct representation of labeled trees.
- Suffix Trays and Suffix Trists: Structures for Faster Text Indexing.
- Suffix Tree of Alignment: An Efficient Index for Similar Data
- TRANS outperforms MTF for two special types of request sequences without locality of reference.
- The Swap Matching Problem Revisited.
- The technique of in-place associative sorting.
- Tree Compression with Top Trees
- Tree-based Arithmetic and Compressed Representations of Giant Numbers
- Using cascading Bloom filters to improve the memory usage for de Brujin graphs
- Various improvements to text fingerprinting
- Web graph compression with fast access
- XML Compression via DAGs.
Eur. J. Comb. 2013
- Computing the Longest Previous Factor.
- Minimax trees in linear time with applications.
IEICE Trans. Inf. Syst. 2013
- Scalable Detection of Frequent Substrings by Grammar-Based Compression.
Inf. Comput. 2013
- Compact binary relation representations with rich functionality.
- Range majority in constant time and linear space.
Inf. Process. Manag. 2013
- DACs: Bringing direct access to variable-length codes.
Inf. Syst. 2013
- Space-efficient representations of rectangle datasets supporting orthogonal range querying.
- Succinct nearest neighbor search.
Int. J. Comput. Biol. Drug Des. 2013
- Querying highly similar sequences.
Int. J. Comput. Geom. Appl. 2013
- External Memory orthogonal Range Reporting with Fast Updates.
J. Discrete Algorithms 2013
- Computing the longest common prefix array based on the Burrows-Wheeler transform.
- ESP-index: A compressed index based on edit-sensitive parsing.
- Fast q-gram mining on SLP compressed strings.
- Improved compressed indexes for full-text document retrieval.
- Various improvements to text fingerprinting.
SIAM J. Comput. 2013
- On the Bit-Complexity of Lempel-Ziv Compression.
Theor. Comput. Sci. 2013
- Colored range queries and document retrieval.
- Enhanced string covering.
- On compressing and indexing repetitive sequences.
- On compressing permutations and adaptive sorting.
- On the weak prefix-search problem.
- Palindrome pattern matching.
- Space-efficient data-analysis queries on grids.
- Succinct encoding of arbitrary graphs.