Papers for stringologist (2015)
Contents
- A Data-Aware FM-index.
- Faster Linear-space Orthogonal Range Searching in Arbitrary Dimensions.
- Improved Single-Term Top-k Document Retrieval.
- Fast and efficient compression of high-throughput sequencing reads.
- Stream-Based Lossless Data Compression Hardware Using Adaptive Frequency Table Management.
- An Opportunistic Text Indexing Structure Based on Run Length Encoding.
- A Framework for Space-Efficient String Kernels.
- A Probabilistic Analysis of the Reduction Ratio in the Suffix-Array IS-Algorithm.
- Alphabet-Dependent String Searching with Wexponential Search Trees.
- Combinatorial RNA Design: Designability and Structure-Approximating Algorithm.
- Compact Indexes for Flexible Top- k k Retrieval.
- Composite Repetition-Aware Data Structures.
- Dictionary Matching with Uneven Gaps.
- Efficient Construction of a Compressed de Bruijn Graph for Pan-Genome Analysis.
- Encoding Nearest Larger Values.
- Encodings of Range Maximum-Sum Segment Queries and Applications.
- Fast String Dictionary Lookup with One Error.
- Greedy Conjecture for Strings of Length 4.
- Improved Algorithms for the Boxed-Mesh Permutation Pattern Matching Problem.
- LZD Factorization: Simple and Practical Online Grammar Compression with Variable-to-Fixed Encoding.
- Lempel Ziv Computation in Small Space (LZ-CISS).
- Longest Common Extensions in Sublinear Space.
- Longest Common Extensions in Trees.
- On Maximal Unbordered Factors.
- On the Fixed Parameter Tractability and Approximability of the Minimum Error Correction Problem.
- On the Hardness of Optimal Vertex Relabeling and Restricted Vertex Relabeling.
- On the Readability of Overlap Digraphs.
- Online Detection of Repetitions with Backtracking.
- Parallel External Memory Suffix Sorting.
- Parameterized Complexity of Superstring Problems.
- Partition into Heapable Sequences, Heap Tableaux and a Multiset Extension of Hammersley’s Process.
- Range Minimum Query Indexes in Higher Dimensions.
- Ranked Document Retrieval with Forbidden Pattern.
- Reporting Consecutive Substring Occurrences Under Bounded Gap Constraints.
- Semi-dynamic Compact Index for Short Patterns and Succinct van Emde Boas Tree.
- Sorting by Cuts, Joins and Whole Chromosome Duplications.
- String Powers in Trees.
- Succinct Non-overlapping Indexing.
- The Approximability of Maximum Rooted Triplets Consistency with Fan Triplets and Forbidden Triplets.
- Tighter Bounds for the Sum of Irreducible LCP Values.
- Bi-Directional Context Modeling with Combinatorial Structuring for Genome Sequence Compression.
- Compressing Yahoo Mail.
- Compression for Similarity Identification: Computing the Error Exponent.
- Compression of Next Generation Sequencing Data.
- Compression-Aware Algorithms for Massive Datasets.
- Data Compression Cost Optimization.
- Document Counting in Compressed Space.
- Efficient Set Operations over k2-Trees.
- Enhanced Direct Access to Huffman Encoded Files.
- Faster Compressed Quadtrees.
- Geometric Compression of Orientation Signals for Fast Gesture Analysis.
- Improving PPM with Dynamic Parameter Updates.
- Incremental Locality and Clustering-Based Compression.
- On Probability Estimation via Relative Frequencies and Discount.
- OnlineRePair: A Recompressor for XML Structures.
- Parallel Wavelet Tree Construction.
- Queries on LZ-Bounded Encodings.
- Range Selection Queries in Data Aware Space and Time.
- Serializing RDF in Compressed Space.
- Smaller and Faster: Parallel Processing of Compressed Graphs with Ligra+.
- Universal Compression of Memoryless Sources over Large Alphabets via Independent Component Analysis.
- Variable-Order de Bruijn Graphs.
- Longest Previous Non-overlapping Factors Computation.
- Diverse Palindromic Factorization Is NP-complete.
- Grammar-Based Tree Compression.
- Squareable Words.
- State Complexity of Neighbourhoods and Approximate Pattern Matching.
- Transfinite Lyndon Words.
- Unary Patterns with Permutations.
- Access, Rank, and Select in Grammar-compressed Strings.
- Approximating LZ77 via Small-Space Multiple-Pattern Matching.
- Compressed Data Structures for Dynamic Sequences.
- Dictionary Matching in a Stream.
- Longest α-Gapped Repeat and Palindrome.
- Pattern-Avoiding Access in Binary Search Trees.
- Quadratic Conditional Lower Bounds for String Problems and Dynamic Time Warping.
- Tight Hardness Results for LCS and Other Sequence Similarity Measures.
- Hollow Heaps.
- Optimal Encodings for Range Top- k k , Selection, and Min-Max.
- Replacing Mark Bits with Randomness in Fibonacci Heaps.
- An In-place Framework for Exact and Approximate Shortest Unique Substring Queries.
- Inferring Strings from Full Abelian Periods.
- Multidimensional Range Selection.
- On the Succinct Representation of Unlabeled Permutations.
- Optimal Search Trees with 2-Way Comparisons.
- Computing the BWT and the LCP Array in Constant Space.
- EERTREE: An Efficient Data Structure for Processing Palindromes in Strings.
- Longest Common Extensions in Partial Words.
- Average Linear Time and Compressed Space Construction of the Burrows-Wheeler Transform.
- Average-Case Optimal Approximate Circular String Matching.
- Backward Linearised Tree Pattern Matching.
- Compressed Data Structures for Range Searching.
- Construction of a de Bruijn Graph for Assembly from a Truncated Suffix Tree.
- Coverability in Two Dimensions.
- Equation x^iy^jx^k=u^iv^ju^k in Words.
- On the Language of Primitive Partial Words.
- On the Number of Closed Factors in a Word.
- Online Computation of Abelian Runs.
- Square-Free Words over Partially Commutative Alphabets.
- Longest Gapped Repeats and Palindromes.
- Strong Inapproximability of the Shortest Reset Word.
- Faster Lightweight Lempel-Ziv Parsing.
- Parallelising the Computation of Minimal Absent Words.
- A Bulk-Parallel Priority Queue in External Memory with STXXL.
- Huffman Codes versus Augmented Non-Prefix-Free Codes.
- Parallel Construction of Succinct Trees.
- Tree Compression with Top Trees Revisited.
- A new characterization of maximal repetitions by Lyndon trees.
- Approximate Range Emptiness in Constant Time and Optimal Space.
- Cell-probe bounds for online edit distance and other pattern matching problems.
- Internal Pattern Matching Queries in a Text and Applications.
- The amortized cost of finding the minimum.
- Wavelet Trees Meet Suffix Trees.
- A Compact RDF Store Using Suffix Arrays.
- A Faster Algorithm for Computing Maximal \alpha -gapped Repeats in a String.
- Adaptive Computation of the Swap-Insert Correction Distance.
- Assessing the Efficiency of Suffix Stripping Approaches for Portuguese Stemming.
- Beyond the Runs Theorem.
- Chaining Fragments in Sequences: to Sweep or Not (Extended Abstract).
- Computing the Longest Unbordered Substring.
- DeShaTo: Describing the Shape of Cumulative Topic Distributions to Rank Retrieval Systems Without Relevance Judgments.
- Efficient Algorithms for Longest Closed Factor Array.
- Efficient Term Set Prediction Using the Bell-Wigner Inequality.
- Evaluating Geographical Knowledge Re-Ranking, Linguistic Processing and Query Expansion Techniques for Geographical Information Retrieval.
- Fast Online Lempel-Ziv Factorization in Compressed Space.
- Faster Exact Search Using Document Clustering.
- Feasibility of Word Difficulty Prediction.
- Filtration Algorithms for Approximate Order-Preserving Matching.
- Fishing in Read Collections: Memory Efficient Indexing for Sequence Assembly.
- How Big is that Genome? Estimating Genome Size and Coverage from k-mer Abundance Spectra.
- Improved Practical Compact Dynamic Tries.
- Induced Sorting Suffixes in External Memory with Better Design and Less Space.
- Longest Common Prefix with Mismatches.
- On Prefix/Suffix-Square Free Words.
- Online Self-Indexed Grammar Compression.
- Parallel Construction of Succinct Representations of Suffix Tree Topologies.
- Prefix and Suffix Reversals on Strings.
- Range LCP Queries Revisited.
- Relative Select.
- Sampling the Suffix Array with Minimizers.
- Selective Labeling and Incomplete Label Mitigation for Low-Cost Evaluation.
- ShRkC: Shard Rank Cutoff Prediction for Selective Search.
- Space-Efficient Detection of Unusual Words.
- Temporal Analysis of CHAVE Collection.
- Temporal Query Classification at Different Granularities.
- Tight Bound for the Number of Distinct Palindromes in a Tree.
- Transforming XML Streams with References.
- Lempel-Ziv Factorization May Be Harder Than Computing All Runs.
- Pattern Matching with Variables: Fast Algorithms and New Hardness Results.
- Space-efficient Basic Graph Algorithms.
- Edit Distance Cannot Be Computed in Strongly Subquadratic Time (unless SETH is false).
- A Faster Longest Common Extension Algorithm on Compressed Strings and its Applications.
- A Formal Framework for Stringology.
- Alternative Algorithms for Order-Preserving Matching.
- An Efficient Skip-Search Approach to the Order-Preserving Pattern Matching Problem.
- Combinatorics of the Interrupted Period.
- Computing Left-Right Maximal Generic Words.
- Controlling the Chunk-Size in Deduplication Systems.
- Efficient Algorithm for δ-Approximate Jumbled Pattern Matching.
- Enhanced Extraction from Huffman Encoded Files.
- Parameterized Matching: Solutions and Extensions.
- Quantum Leap Pattern Matching.
- Refined Tagging of Complex Verbal Phrases for the Italian Language.
- Tuning Algorithms for Jumbled Matching.
- Bloom Filter Trie - A Data Structure for Pan-Genome Storage.
- Circular Sequence Comparison with q-grams.
- Optimizing Read Reversals for Sequence Compression - (Extended Abstract).
- Universal Reconstruction of a String.
- A Practical Succinct Data Structure for Tree-Like Graphs.
- Non-repetitive Strings over Alphabet Lists.
- Arithmetics on Suffix Arrays of Fibonacci Words.
- Linear-Time Computation of Prefix Table for Weighted Strings.
- Compressed Indexes for String Searching in Labeled Graphs.
ACM Trans. Algorithms 2015
- Optimal Lower and Upper Bounds for Representing Sequences.
Algorithmica 2015
- Binary Jumbled Pattern Matching on Trees and Tree-Like Structures.
- Fast Algorithm for Partial Covers in Words.
- Improved Space-Time Tradeoffs for Approximate Full-Text Indexing with One Edit Error.
- Linked Dynamic Tries with Applications to LZ-Compression in Sublinear Time and Space.
- Space-Time Trade-offs for Stack-Based Algorithms.
- Fast randomized approximate string matching with succinct hash data structures.
- MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.
- Ultrafast SNP analysis using the Burrows-Wheeler transform of short-read data.
CoRR 2015
- A Bloom filter based semi-index on q-grams.
- A Compressed-Gap Data-Aware Measure for Indexable Dictionaries.
- A Fast Heuristic for Exact String Matching.
- A Lower Bound on Supporting Predecessor Search in k sorted Arrays.
- A Note on Easy and Efficient Computation of Full Abelian Periods of a Word.
- A Practical O(Rlog log n+n) time Algorithm for Computing the Longest Common Subsequence.
- A Quadratic Assignment Formulation of the Graph Edit Distance.
- A Review on the Tree Edit Distance Problem and Related Path-Decomposition Algorithms.
- A Study on Splay Trees.
- A framework for space-efficient string kernels.
- A note on the longest common Abelian factor problem.
- A numerical analysis of Quicksort: How many cases are bad cases?
- Adaptive Search over Sorted Sets.
- Algorithms for Longest Common Abelian Factors.
- Amortized Rotation Cost in AVL Trees.
- An Efficient Dynamic Programming Algorithm for STR-IC-SEQ-EC-LCS Problem.
- An efficient dynamic programming algorithm for the generalized LCS problem with multiple substring inclusive constraints.
- Approximating LZ77 in Small Space.
- Approximating LZ77 via Small-Space Multiple-Pattern Matching.
- Binary Coding in Stream.
- Burrows-Wheeler transform for terabases.
- Communication Complexity (for Algorithm Designers).
- Composite repetition-aware data structures.
- Compressed Data Structures for Dynamic Sequences.
- Compressed Tree Canonization.
- Computing LZ77 in Run-Compressed Space.
- Computing Runs on a General Alphabet.
- Constructing LZ78 Tries and Position Heaps in Linear Time for Large Alphabets.
- Deterministic Sparse Suffix Sorting on Rewritable Texts.
- Dictionary matching in a stream.
- Diverse Palindromic Factorization is NP-Complete.
- Dual pivot Quicksort.
- Dynamic Data Structures for Document Collections and Graphs.
- Dynamic Relative Compression.
- Dynamic concurrent van Emde Boas array.
- Dynamic index, LZ factorization, and LCE queries in compressed space.
- Efficient Algorithms for the Order Preserving Pattern Matching Problem.
- Efficient Deterministic Single Round Document Exchange for Edit Distance.
- Efficient Ranking of Lyndon Words and Decoding Lexicographically Minimal de Bruijn Sequence.
- Efficiently Finding All Maximal $α$-gapped Repeats.
- Enhanced Covers of Regular & Indeterminate Strings using Prefix Tables.
- Error Tree: A Tree Structure for Hamming & Edit Distances & Wildcards Matching.
- External Memory Three-Sided Range Reporting and Top-$k$ Queries with Sublogarithmic Updates.
- FM-index for dummies.
- Fast Algorithms for Exact String Matching.
- Fast Algorithms for Finding Pattern Avoiders and Counting Pattern Occurrences in Permutations.
- Fast Average-Case Pattern Matching on Weighted Sequences.
- Fast Computation of Abelian Runs.
- Fast and Powerful Hashing using Tabulation.
- Fast and Vectorizable Alternatives to Binary Search.
- Faster Lightweight Lempel-Ziv Parsing.
- Finding the Leftmost Critical Factorization on Unordered Alphabet.
- Finger Search, Random Access, and Longest Common Extensions in Grammar-Compressed Strings.
- Full-text and Keyword Indexes for String Searching.
- Fully-online construction of suffix trees and DAWGs for multiple texts.
- How Good is Multi-Pivot Quicksort?
- Implementation of BT-trees.
- Layered Heaps Beating Standard and Fibonacci Heaps in Practice.
- Lempel Ziv Computation In Compressed Space (LZ-CICS).
- Lempel Ziv Computation In Small Space (LZ-CISS).
- Linear Algorithm for Conservative Degenerate Pattern Matching.
- Linear Algorithms for Computing the Lyndon Border Array and the Lyndon Suffix Array.
- Linear-Time Sequence Comparison Using Minimal Absent Words & Applications.
- Linear-Time Superbubble Identification Algorithm.
- Longest Common Extensions in Sublinear Space.
- Longest Gapped Repeats and Palindromes.
- Lower bounds for approximation schemes for Closest String.
- Mespotine-RLE-basic v0.9 - An overhead-reduced and improved Run-Length-Encoding Method.
- Multiple sequence alignment for short sequences.
- On Longest Repeat Queries.
- On Maximal Unbordered Factors.
- On The Average-Case Complexity of Shellsort.
- On the Greedy Algorithm for the Shortest Common Superstring Problem with Reversals.
- Online Computation of Abelian Runs.
- Online Dictionary Matching with One Gap.
- Online Self-Indexed Grammar Compression.
- Optimal Dynamic Strings.
- Optimal search trees with equality tests.
- Parallel Query in the Suffix Tree.
- Parameterized Complexity of Superstring Problems.
- Pattern-avoiding access in binary search trees.
- Permutations sortable by two stacks in series.
- Practical Concurrent Priority Queues.
- Probabilistic Threshold Indexing for Uncertain Strings.
- Quadratic Conditional Lower Bounds for String Problems and Dynamic Time Warping.
- Quadratic-Time Hardness of LCS and other Sequence Similarity Measures.
- Range Predecessor and Lempel-Ziv Parsing.
- Read Mapping on de Bruijn graph.
- Relative Compressed Suffix Trees.
- Relative Select.
- Space-efficient detection of unusual words.
- Strictly Implicit Priority Queues: On the Number of Moves and Worst-Case Time.
- Subsequence Automata with Default Transitions.
- Testing k-binomial equivalence.
- The Complexity of Pattern Matching for 321-Avoiding and Skew-Merged Permutations.
- The complexity of computation in bit streams.
- The k-mismatch problem revisited.
- Traversing Grammar-Compressed Trees with Constant Delay.
- Tree Compression with Top Trees Revisited.
- Tree compression using string grammars.
- Triple State QuickSort, A replacement for the C/C++ library qsort.
IEEE Trans. Inf. Theory 2015
- Efficient and Compact Representations of Prefix Codes.
Inf. Comput. 2015
- Approximate periodicity.
- Detecting regularities on grammar-compressed strings.
- Tree compression with top trees.
Inf. Process. Lett. 2015
- Constructing LZ78 tries and position heaps in linear time for large alphabets.
Inf. Syst. 2015
- The wavelet matrix: An efficient wavelet tree for large alphabets.
J. Discrete Algorithms 2015
- A suffix tree or not a suffix tree?
- An efficient Variable-to-Fixed length encoding using multiplexed parse trees.
- Approximate pattern matching in LZ77-compressed texts.
- Bottom-k document retrieval.
- Computing the Burrows-Wheeler transform in place and in small space.
- Dynamic edit distance table under a general weighted cost function.
- Improved and extended locating functionality on compressed suffix arrays.
Knowl. Inf. Syst. 2015
- Compressed vertical partitioning for efficient RDF management.
SIAM J. Comput. 2015
- Random Access to Grammar-Compressed Strings and Trees.
SIAM J. Discret. Math. 2015
- String Reconstruction from Substring Compositions.
Softw. Pract. Exp. 2015
- Fast in-memory XPath search using compressed indexes.
Theor. Comput. Sci. 2015
- Compressed automata for dictionary matching.
- Dictionary matching with a few gaps.
- Global and local sequence alignment with a bounded number of gaps.
- On hardness of several string indexing problems.