Papers for stringologist (2016)
Contents
- A General Framework for Dynamic Succinct and Compressed Data Structures.
- On-Line Pattern Matching on Uncertain Sequences and Applications.
- A 7/2-Approximation Algorithm for the Maximum Duo-Preservation String Mapping Problem.
- A Linear-Time Algorithm for the Copy Number Transformation Problem.
- Boxed Permutation Pattern Matching.
- Color-Distance Oracles and Snippets.
- Deterministic Sub-Linear Space LCE Data Structures With Efficient Construction.
- Efficient Index for Weighted Sequences.
- Efficient Non-Binary Gene Tree Resolution with Weighted Reconciliation Cost.
- Encoding Two-Dimensional Range Top-k Queries.
- Estimating Statistics on Words Using Ambiguous Descriptions.
- Factorizing a String into Squares in Linear Time.
- Fast Compatibility Testing for Rooted Phylogenetic Trees.
- Faster Longest Common Extension Queries in Strings over General Alphabets.
- Finding Maximal 2-Dimensional Palindromes.
- Front Matter, Table of Contents, Preface.
- Fully-online Construction of Suffix Trees for Multiple Texts.
- Genomic Scaffold Filling Revisited.
- Graph Motif Problems Parameterized by Dual.
- Hardness of RNA Folding Problem With Four Symbols.
- Linear-time Suffix Sorting - A New Approach for Suffix Array Construction.
- Longest Common Substring with Approximately k Mismatches.
- Minimal Suffix and Rotation of a Substring in Optimal Time.
- On Almost Monge All Scores Matrices.
- On the Benefit of Merging Suffix Array Intervals for Parallel Pattern Matching.
- Optimal Prefix Free Codes with Partial Sorting.
- Reconstruction of Trees from Jumbled and Weighted Subtrees.
- Space-Efficient Dictionaries for Parameterized and Order-Preserving Pattern Matching.
- Succinct Online Dictionary Matching with Improved Worst-Case Guarantees.
- The Nearest Colored Node in a Tree.
- Tight Tradeoffs for Real-Time Approximation of Longest Palindromes in Streams.
- Truly Subquadratic-Time Extension Queries and Periodicity Detection in Strings with Uncertainties.
- A Simple and Efficient Approach for Adaptive Entropy Coding over Large Alphabets.
- A Space Efficient Direct Access Data Structure.
- Analysis of a Rewriting Compression System for Flash Memory.
- Approximate String Matching for Self-Indexes.
- Burrows-Wheeler Transform for Terabases.
- CS2A: A Compressed Suffix Array-Based Method for Short Read Alignment.
- Compressing Combinatorial Objects.
- Computing LZ77 in Run-Compressed Space.
- Efficient Compression of Genomic Sequences.
- Efficient Environmental Temperature Monitoring Using Compressed Sensing.
- Faster, Minuter.
- Grammatical Ziv-Lempel Compression: Achieving PPM-Class Text Compression Ratios with LZ-Class Decompression Speed.
- Hardware Based Compression in Big Data.
- Improved Range Minimum Queries.
- Induced Suffix Sorting for String Collections.
- Lempel-Ziv Computation in Compressed Space (LZ-CICS).
- Linear Time Succinct Indexable Dictionary Construction with Applications.
- Lossy Compression of Unordered Rooted Trees.
- Online Grammar Transformation Based on Re-Pair Algorithm.
- Parallel Lightweight Wavelet Tree, Suffix Array and FM-Index Construction.
- Positional Inverted Self-index.
- Quick Access to Compressed Data in Storage Systems.
- Self-Indexing RDF Archives.
- Shortest DNA Cyclic Cover in Compressed Space.
- Small Polygon Compression.
- Timeliness in Lossless Block Coding.
- When Less is More - Using Restricted Repetition Search in Fast Compressors.
- Ternary Square-Free Partial Words with Many Wildcards.
- BlockQuicksort: Avoiding Branch Mispredictions in Quicksort.
- Faster External Memory LCP Array Construction.
- Streaming Pattern Matching with d Wildcards.
- Edit Distance: Sketching, Streaming, and Document Exchange.
- Finger Search in Grammar-Compressed Strings.
- LZ77 Factorisation of Trees.
- Approximate Hamming Distance in a Stream.
- Data Structure Lower Bounds for Document Indexing Problems.
- Towards Tight Lower Bounds for Range Reporting on the RAM.
- Dynamic Relative Compression, Dynamic Partial Sums, and Substring Concatenation.
- Mind the Gap: Essentially Optimal Algorithms for Online Dictionary Matching with One Gap.
- Pattern Matching and Consensus Problems on Weighted Sequences and Profiles.
- Space-Time Trade-Offs for the Shortest Unique Substring Problem.
- Is There an Oblivious RAM Lower Bound?
- Finding Gapped Palindromes Online.
- Packed Compact Tries: A Fast and Efficient Data Structure for Online String Processing.
- Partial Covering Arrays: Algorithms and Asymptotics.
- Scalable Partial Least Squares Regression on Grammar-Compressed Data Matrices.
- On Del-Robust Primitive Partial Words with One Hole.
- Optimal Bounds for Computing \alpha α -gapped Repeats.
- Bidirectional Variable-Order de Bruijn Graphs.
- Compressing Bounded Degree Graphs.
- Deterministic Sparse Suffix Sorting on Rewritable Texts.
- Linear-Time Sequence Comparison Using Minimal Absent Words & Applications.
- The Grandmama de Bruijn Sequence for Binary Strings.
- Tree Compression Using String Grammars.
- Computing DAWGs and Minimal Absent Words in Linear Time for Integer Alphabets.
- Dividing by Zero - How Bad Is It, Really?.
- Fully Dynamic Data Structure for LCE Queries in Compressed Space.
- Shortest Unique Substring Queries on Run-Length Encoded Strings.
- CHICO: A Compressed Hybrid Index for Repetitive Collections.
- Fast Scalable Construction of (Minimal Perfect Hash) Functions.
- Lempel-Ziv Decoding in External Memory.
- Practical Dynamic Entropy-Compressed Bitvectors with Applications.
- Practical Variable Length Gap Pattern Matching.
- Worst-Case-Efficient Dynamic Arrays in Practice.
- Fast and Compact Hamming Distance Index.
- Succinct Data Structures in Information Retrieval: Theory and Practice.
- Range Predecessor and Lempel-Ziv Parsing.
- The k-mismatch problem revisited.
- Weighted dynamic finger in binary search trees.
- Compacting a Dynamic Edit Distance Table by RLE Compression.
- Subsequence Automata with Default Transitions.
- A Linear-Space Algorithm for the Substring Constrained Alignment Problem.
- AC-Automaton Update Algorithm for Semi-dynamic Dictionary Matching.
- Analyzing Relative Lempel-Ziv Reference Construction.
- Bookmarks in Grammar-Compressed Strings.
- Compact Trip Representation over Networks.
- Dynamic and Approximate Pattern Matching in 2D.
- Efficient Representation of Multidimensional Data over Hierarchical Domains.
- Efficient and Compact Representations of Some Non-canonical Prefix-Free Codes.
- Fast Classification of Protein Structures by an Alignment-Free Kernel.
- Fragmented BWT: An Extended BWT for Full-Text Indexing.
- Fully Dynamic de Bruijn Graphs.
- GraCT: A Grammar Based Compressed Representation of Trajectories.
- Inverse Range Selection Queries.
- LCP Array Construction Using O(sort(n)) (or Less) I/Os.
- Lexical Matching of Queries and Ads Bid Terms in Sponsored Search.
- Longest Common Abelian Factors and Large Alphabets.
- Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array.
- Maximal Unbordered Factors of Random Strings.
- Near-Optimal Computation of Runs over General Alphabet via Non-Crossing LCE Queries.
- Parallel Computation for the All-Pairs Suffix-Prefix Problem.
- Parallel Lookups in String Indexes.
- Pattern Matching for Separable Permutations.
- RLZAP: Relative Lempel-Ziv with Adaptive Pointers.
- The Smallest Grammar Problem Revisited.
- XBWT Tricks.
- Efficiently Finding All Maximal alpha-gapped Repeats.
- External Memory Three-Sided Range Reporting and Top-k Queries with Sublogarithmic Updates.
- Periods and Borders of Random Words.
- Streaming algorithms for embedding and computing edit distance in the low distance regime.
- A Framework for Dynamic Parameterized Dictionary Matching.
- A Simple Mergeable Dictionary.
- Cuckoo Filter: Simplification and Analysis.
- Lower Bounds for Approximation Schemes for Closest String.
- A Family of Data Compression Codes with Multiple Delimiters.
- A Resource-frugal Probabilistic Dictionary and Applications in (Meta)Genomics.
- Accelerated Partial Decoding in Wavelet Trees.
- Algorithms to Compute the Lyndon Array.
- Computing All Approximate Enhanced Covers with the Hamming Distance.
- Computing Smallest and Largest Repetition Factorizations in O(n log n) Time.
- Dynamic Index and LZ Factorization in Compressed Space.
- Fast Full Permuted Pattern Matching Algorithms on Multi-track Strings.
- Forced Repetitions over Alphabet Lists.
- Generating All Minimal Petri Net Unsolvable Binary Words.
- Interpreting the Subset Construction Using Finite Sublanguages.
- Jumbled Matching with SIMD.
- The String Matching Algorithms Research Tool.
- The Use and Usefulness of Fibonacci Codes.
- Using Human Computation in Dead-zone based 2D Pattern Matching.
- A Graph Extension of the Positional Burrows-Wheeler Transform and Its Applications.
- A Natural Encoding of Genetic Variation in a Burrows-Wheeler Transform to Enable Mapping and Genome Inference.
- Optimal Computation of Avoided Words.
ACM J. Exp. Algorithmics 2016
- Faster Compressed Suffix Trees for Repetitive Collections.
- Inducing Suffix and LCP Arrays in External Memory.
- LCP Array Construction in External Memory.
- Lazy Lempel-Ziv Factorization Algorithms.
ACM Trans. Algorithms 2016
- Compressed Cache-Oblivious String B-Tree.
- Data Structures for Path Queries.
- Sparse Text Indexing in Small Space.
ACM Trans. Archit. Code Optim. 2016
- Yet Another Compressed Cache: A Low-Cost Yet Effective Compressed Cache.
Algorithmica 2016
- Compressed String Dictionary Search with Edit Distance One.
- Optimal Encodings for Range Majority Queries.
Algorithms 2016
- siEDM: An Efficient String Index and Search Algorithm for Edit Distance with Moves.
Algorithms Mol. Biol. 2016
- Circular sequence comparison: algorithms and applications.
- Erratum to: Circular sequence comparison: algorithms and applications.
- libFLASM: a software library for fixed-length approximate string matching.
CoRR 2016
- A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs.
- A New Lightweight Algorithm to compute the BWT and the LCP array of a Set of Strings.
- A Self-Index on Block Trees.
- A Simpler Bit-parallel Algorithm for Swap Matching.
- A family of fast exact pattern matching algorithms.
- A hardness result and new algorithm for the longest common palindromic subsequence problem.
- A loopless and branchless $O(1)$ algorithm to generate the next Dyck word.
- Access Time Tradeoffs in Archive Compression.
- Aggregated 2D Range Queries on Clustered Points.
- Algorithms to Compute the Lyndon Array.
- An Estimation of the Size of Non-Compact Suffix Trees.
- An Optimal Algorithm for Range Search on Multidimensional Points.
- Approximate Hamming distance in a stream.
- Asymmetric Rényi Problem and PATRICIA Tries.
- Average Size of a Suffix Tree for Markov Sources.
- Bidirectional Conditional Insertion Sort algorithm; An efficient progress on the classical insertion sort.
- Burrows-Wheeler transform and LCP array construction in constant space.
- CSA++: Fast Pattern Search for Large Alphabets.
- Comments on Dumitrescu’s “A Selectable Sloppy Heap”.
- Compressed Dynamic Range Majority Data Structures.
- Compressing Graphs and Indexes with Recursive Graph Bisection.
- Compressing and Indexing Stock Market Data.
- Computing All Distinct Squares in Linear Time for Integer Alphabets.
- Computing Longest Increasing Subsequence Over Sequential Data Streams.
- Computing longest single-arm-gapped palindromes in a string.
- Data Structure Lower Bounds for Document Indexing Problems.
- Designing optimal- and fast-on-average pattern matching algorithms.
- Detecting Unary Patterns.
- Deterministic Indexing for Packed Strings.
- Deterministic sub-linear space LCE data structures with efficient construction.
- Distortion-Resistant Hashing for rapid search of similar DNA subsequence.
- Document Retrieval on Repetitive String Collections.
- Dual-Pivot Quicksort: Optimality, Analysis and Zeros of Associated Lattice Paths.
- Dynamic Time Warping: Breaking the Quadratic Barrier.
- Dynamic index and LZ factorization in compressed space.
- Edit Distance: Sketching, Streaming and Document Exchange.
- Efficient Index Maintenance Under Dynamic Genome Modification.
- Efficient Index for Weighted Sequences.
- Efficient Pattern Matching in Elastic-Degenerate Strings.
- Efficient Representation of Multidimensional Data over Hierarchical Domains.
- Efficient and Compact Representations of Some Non-Canonical Prefix-Free Codes.
- Encoding Arguments.
- Energy-Efficient Algorithms.
- Engineering a Distributed Full-Text Index.
- Fast Longest Common Extensions in Small Space.
- Faster Longest Common Extension Queries in Strings over General Alphabets.
- From H&M to Gap for Lightweight BWT Merging.
- Fully Dynamic de Bruijn Graphs.
- Fully dynamic data structure for LCE queries in compressed space.
- Games from Basic Data Structures.
- GateKeeper: Enabling Fast Pre-Alignment in DNA Short Read Mapping with a New Streaming Accelerator Architecture.
- GraCT: A Grammar based Compressed representation of Trajectories.
- Hardness of Permutation Pattern Matching.
- Improved Space efficient algorithms for BFS, DFS and applications.
- In-Place Longest Common Extensions.
- LZ-End Parsing in Compressed Space.
- Lempel-Ziv Decoding in External Memory.
- Lightweight LCP Construction for Very Large Collections of Strings.
- Linear-time string indexing and analysis in small space.
- Longest Common Extensions with Recompression.
- Longest Common Subsequence in at Least k Length Order-isomorphic Substrings.
- Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array.
- Minimal Suffix and Rotation of a Substring in Optimal Time.
- Monte Carlo Sort for unreliable human comparisons.
- Multi-view pattern matching.
- Near-Optimal Computation of Runs over General Alphabet via Non-Crossing LCE Queries.
- New Error Tolerant Method to Search Long Repeats in Symbol Sequences.
- Oblivious Sorting and Queues.
- On pattern matching with k mismatches and few don’t cares.
- On the Benefit of Merging Suffix Array Intervals for Parallel Pattern Matching.
- On the Size of Lempel-Ziv and Lyndon Factorizations.
- On the circuit complexity of the standard and the Karatsuba methods of multiplying integers.
- Online Grammar Compression for Frequent Pattern Discovery.
- Optimal Computation of Avoided Words.
- Optimal In-Place Suffix Sorting.
- Optimal Prefix Free Codes With Partial Sorting.
- Optimizing run-length algorithm using octonary repetition tree.
- Pachinko.
- Packed Compact Tries: A Fast and Efficient Data Structure for Online String Processing.
- Parameterized Pattern Matching - Succinctly.
- Practical Data Compression for Modern Memory Hierarchies.
- Practical combinations of repetition-aware data structures.
- RLZAP: Relative Lempel-Ziv with Adaptive Pointers.
- Randomized Ternary Search Tries.
- Range Majorities and Minorities in Arrays.
- Rank and select: Another lesson learned.
- Representing Pattern Matching Algorithms by Polynomial-Size Automata.
- Scalable Construction of Text Indexes.
- Scalable Partial Least Squares Regression on Grammar-Compressed Data Matrices.
- Shortest unique palindromic substring queries in optimal time.
- Simple and Efficient Fully-Functional Succinct Trees.
- Sort Race.
- Sort well with energy-constrained comparisons.
- Sorting Discrete i.i.d. Inputs: Quicksort is Optimal.
- Space-Efficient Construction of Compressed Indexes in Deterministic Linear Time.
- Space-Efficient Re-Pair Compression.
- Sparse Suffix Tree Construction in Optimal Time and Space.
- Streaming Algorithms For Computing Edit Distance Without Exploiting Suffix Trees.
- Streaming k-mismatch with data recovery and applications.
- String Cadences.
- String Inference from the LCP Array.
- Succinct Choice Dictionaries.
- Succinct data-structure for nearest colored node in a tree.
- Suffix arrays with a twist.
- The Generalized Smallest Grammar Problem.
- The complexity of bit retrieval.
- The landscape of bounds for binary search trees.
- Tight Lower Bounds for the Longest Common Extension Problem.
- Tight Tradeoffs for Real-Time Approximation of Longest Palindromes in Streams.
- Tight bound on the maximum number of shortest unique substrings.
- Toward a Succinct Index for Order-Preserving Pattern Matching.
- Twenty (simple) questions.
- Two-stage algorithms for covering array construction.
- TwoPaCo: An efficient algorithm to build the compacted de Bruijn graph from many complete genomes.
- Universal Indexes for Highly Repetitive Document Collections.
- Variance of the Internal Profile in Suffix Trees.
- Word Existence Algorithm.
- siEDM: an efficient string index and search algorithm for edit distance with moves.
Dagstuhl Reports 2016
- Computation over Compressed Structured Data (Dagstuhl Seminar 16431).
Discret. Appl. Math. 2016
- A computational substantiation of the d-step approach to the number of distinct squares problem.
- A multiobjective optimization algorithm for the weighted LCS.
- A note on easy and efficient computation of full abelian periods of a word.
- Closed factorization.
- Computing covers using prefix tables.
- Editorial: Stringology Algorithms.
- New tabulation and sparse dynamic programming based techniques for sequence similarity problems.
- Optimal partitioning of data chunks in deduplication systems.
- Random access to Fibonacci encoded files.
- Sequence binary decision diagram: Minimization, relationship to acyclic automata, and complexities of Boolean set operations.
- Similarity based deduplication with small data chunks.
- The New Periodicity Lemma revisited.
- The power of greedy algorithms for approximating Max-ATSP, Cyclic Cover, and superstrings.
- Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data.
IEEE Trans. Parallel Distributed Syst. 2016
- A Survey Of Architectural Approaches for Data Compression in Cache and Main Memory Systems.
Inf. 2016
- Lazy Management for Frequency Table on Hardware-Based Stream Lossless Data Compression.
Inf. Process. Lett. 2016
- Computing runs on a general alphabet.
- On the greedy algorithm for the Shortest Common Superstring problem with reversals.
Inf. Syst. 2016
- Aggregated 2D range queries on clustered points.
- New dynamic metric indices for secondary memory.
- Practical compressed string dictionaries.
- Universal indexes for highly repetitive document collections.
J. Discrete Algorithms 2016
- An improved algorithm for the all-pairs suffix-prefix problem.
SIAM J. Discret. Math. 2016
- Efficient Ranking of Lyndon Words and Decoding Lexicographically Minimal de Bruijn Sequence.
Softw. Pract. Exp. 2016
- Improving a lightweight LZ77 computation algorithm for running faster.
Theor. Comput. Sci. 2016
- A really simple approximation of smallest grammar.
- Computing minimal and maximal suffixes of a substring.
- Document retrieval with one wildcard.
- Dynamic range majority data structures.
- Fast computation of abelian runs.
- Fast construction of wavelet trees.
- Faster Lyndon factorization algorithms for SLP and LZ78 compressed text.
- Finding the leftmost critical factorization on unordered alphabet.
- Generalized pattern matching and periodicity under substring consistent equivalence relations.
- Linear-time computation of prefix table for weighted strings & applications.
- Linear-time superbubble identification algorithm for genome assembly.
- Longest common extensions in trees.
- Maximum number of distinct and nonequivalent nonstandard squares in a word.
- Order-preserving indexing.
- Order-preserving pattern matching with k mismatches.
- Permuted scaled matching.
- Reporting consecutive substring occurrences under bounded gap constraints.
- Simple and efficient fully-functional succinct trees.
- Tighter bounds for the sum of irreducible LCP values.