Papers for stringologist (2010)
Contents
- Succinct Trees in Practice.
- A Parallel Algorithm for Fixed-Length Approximate String-Matching with k-mismatches.
- Extended Compact Web Graph Representations.
- Unified View of Backward Backtracking in Short Read Mapping.
- Compressed q-Gram Indexing for Highly Repetitive Biological Sequences.
- An algorithm for mapping short reads to a dynamically changing genomic sequence.
- A Compact Representation of Nondeterministic (Suffix) Automata for the Bit-Parallel Approach.
- A Minimal Periods Algorithm with Applications.
- Affine Image Matching Is Uniform TC0-Complete.
- Algorithms for Forest Pattern Matching.
- Algorithms for Three Versions of the Shortest Common Superstring Problem.
- Approximate All-Pairs Suffix/Prefix Overlaps.
- Bidirectional Search in a String with Wavelet Trees.
- Bounds on the Minimum Mosaic of Population Sequences under Recombination.
- Breakpoint Distance and PQ-Trees.
- Building the Minimal Automaton of A*X in Linear Time, When X Is of Bounded Cardinality.
- Compression, Indexing, and Retrieval for Massive String Data.
- Cover Array String Reconstruction.
- Extended Islands of Tractability for Parsimony Haplotyping.
- Extension and Faster Implementation of the GRP Transform for Lossless Compression.
- Faster Computation of the Robinson-Foulds Distance between Phylogenetic Networks.
- Finding Optimal Alignment and Consensus of Circular Strings.
- Implicit Hitting Set Problems and Multi-genome Alignment.
- Mod/Resc Parsimony Inference.
- Old and New in Stringology.
- On the Parameterized Complexity of Some Optimization Problems Related to Multiple-Interval Graphs.
- Optimizing Restriction Site Placement for Synthetic Genomes.
- Parallel and Distributed Compressed Indexes.
- Phylogeny- and Parsimony-Based Haplotype Inference with Constraints.
- Pseudo-realtime Pattern Matching: Closing the Gap.
- Sampled Longest Common Prefix Array.
- Small-Space 2D Compressed Dictionary Matching.
- Succinct Dictionary Matching with No Slowdown.
- Succinct Representations of Separable Graphs.
- The Highest Expected Reward Decoding for HMMs with Application to Recombination Detection.
- The Property Suffix Tree with Dynamic Properties.
- Verifying a Parameterized Border Array in O(n1.5) Time.
- Validating the Knuth-Morris-Pratt Failure Function, Fast and Online.
- An Improved Algorithm for Extracting Research Communities from Bibliographic Data.
- A New Searchable Variable-to-Variable Compressor.
- A Pseudo-Random Number Generator Based on LZSS.
- A Similarity Measure Using Smallest Context-Free Grammars.
- Advantages of Shared Data Structures for Sequences of Balanced Parentheses.
- Bidirectional Delta Files.
- Efficient Algorithms for Constructing Optimal Bi-directional Context Sets.
- File-Size Preserving LZ Encoding for Reversible Data Embedding.
- I/O-Efficient Compressed Text Indexes: From Theory to Practice.
- LZ77-Like Compression with Fast Random Access.
- Local Modeling for WebGraph Compression.
- Lossless Compression of Maps, Charts, and Graphs via Color Separation.
- Lossless Data Compression via Substring Enumeration.
- Modelling Parallel Texts for Boosting Compression.
- Neural Markovian Predictive Compression: An Algorithm for Online Lossless Data Compression.
- On Computation of Performance Bounds of Optimal Index Assignment.
- Optimum String Match Choices in LZSS.
- Xampling: Analog Data Compression.
- gFPC: A Self-Tuning Compression Algorithm.
- Sparse Substring Pattern Set Discovery Using Linear Programming Boosting.
- Range Queries over a Compact Representation of Minimum Bounding Rectangles.
- A Fully Compressed Algorithm for Computing the Edit Distance of Run-Length Encoded Strings.
- Fast Prefix Search in Little Space, with Applications.
- Medium-Space Algorithms for Inverse BWT.
- On the Huffman and Alphabetic Tree Problem with General Cost Functions.
- Data Structures: Time, I/Os, Entropy, Joules!
- On Space Efficient Two Dimensional Range Minimum Data Structures.
- Top-k Ranked Document Search in General Text Databases.
- A Lower Bound for Dynamic Approximate Membership Data Structures.
- A Fun Application of Compact Data Structures to Indexing Geographic Data.
- Interval Sorting.
- Mergeable Dictionaries.
- Optimal Trade-Offs for Succinct String Indexes.
- Fast in-memory XPath search using compressed indexes.
- Priority Range Trees.
- Should Static Search Trees Ever Be Unbalanced?
- Alphabet Partitioning for Compressed Rank/Select and Applications.
- Dynamic Range Reporting in External Memory.
- Efficient Indexes for the Positional Pattern Matching Problem and Two Related Problems over Small Alphabets.
- Entropy-Bounded Representation of Point Grids.
- Identifying Approximate Palindromes in Run-Length Encoded Strings.
- Dictionary-Symbolwise Flexible Parsing.
- Minimum Number of Holes in Unavoidable Sets of Partial Words of Size Three.
- On the Maximal Sum of Exponents of Runsin a String.
- Skip Lift: A Probabilistic Alternative to Red-Black Trees.
- Worst Case Efficient Single and Multiple String Matching in the RAM Model.
- A Fast Longest Common Subsequence Algorithm for Similar Strings.
- Abelian Square-Free Partial Words.
- Avoidable Binary Patterns in Partial Words.
- Choosing Word Occurrences for the Smallest Grammar Problem.
- Extending Stochastic Context-Free Grammars for an Application in Bioinformatics.
- Grammar-Based Compression in a Streaming Model.
- Hard Counting Problems for Partial Words.
- Compact Rich-Functional Binary Relation Representations.
- Fast Set Intersection and Two-Patterns Matching.
- Lightweight Data Indexing and Compression in External Memory.
- Optimal Succinctness for Range Minimum Queries.
- Sharp Separation and Applications to Exact and Parameterized Algorithms.
- Counting Dependent and Independent Strings.
- Bit-Parallel Search Algorithms for Long Patterns.
- Experiments on Union-Find Algorithms for the Disjoint-Set Data Structure.
- Practical Compressed Suffix Trees.
- Cache-Oblivious Dynamic Dictionaries with Update/Query Tradeoffs.
- Cell-Probe Lower Bounds for Succinct Partial Sums.
- Counting Inversions, Offline Orthogonal Range Counting, and Related Problems.
- Data Structures for Range Minimum Queries in Multidimensional Arrays.
- Data-Specific Analysis of String Sorting.
- Deletion Without Rebalancing in Balanced Binary Trees.
- Fully-Functional Succinct Trees.
- Lower Bounds for Edit Distance and Product Metrics via Poincaré-Type Inequalities.
- On the Cell Probe Complexity of Dynamic Membership.
- Randomized Shellsort: A Simple Oblivious Sorting Algorithm.
- Regular Expression Matching with Multi-Strings and Intervals.
- Dynamic Edit Distance Table under a General Weighted Cost Function.
- Efficient Algorithms for Two Extensions of LPF Table: The Power of Suffix Arrays.
- Fast Arc-Annotated Subsequence Matching in Linear Space.
- Fast and Compact Prefix Codes.
- A PTAS for the Square Tiling Problem.
- A Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles.
- Algorithms for Finding a Minimum Repetition Representation of a String.
- Approximate String Matching with Stuck Address Bits.
- CST++.
- Colored Range Queries and Document Retrieval.
- Compressed Self-indices Supporting Conjunctive Queries on Document Collections.
- Computing Matching Statistics and Maximal Exact Matches on Compressed Full-Text Indexes.
- Counting and Verifying Maximal Palindromes.
- Dual-Sorted Inverted Lists.
- Dynamic Z-Fast Tries.
- Evaluation of Query Performance Prediction Methods by Range.
- Extracting Powers and Periods in a String from Its Runs Structure.
- Fast Bit-Parallel Matching for Network and Regular Expressions.
- Faster Compressed Dictionary Matching.
- Fingerprinting Ratings for Collaborative Filtering - Theoretical and Empirical Analysis.
- Finite Automata Based Algorithms for the Generalized Constrained Longest Common Subsequence Problems.
- Hypergeometric Language Model and Zipf-Like Scoring Function for Web Document Similarity Retrieval.
- Identifying SNPs without a Reference Genome by Comparing Raw Reads.
- Improved Fast Similarity Search in Dictionaries.
- Incremental Algorithms for Effective and Efficient Query Recommendation.
- Mining Large Query Induced Graphs towards a Hierarchical Query Folksonomy.
- Multiplication Algorithms for Monge Matrices.
- On Shortest Common Superstring and Swap Permutations.
- On Tag Spell Checking.
- On the Hardness of Counting and Sampling Center Strings.
- Parameterized Searching with Mismatches for Run-Length Encoded Strings - (Extended Abstract).
- Querying the Web Graph - (Invited Talk).
- Range Queries over Untangled Chains.
- Relative Lempel-Ziv Compression of Genomes for Large-Scale Storage and Retrieval.
- Restricted LCS.
- Standard Deviation as a Query Hardness Estimator.
- String Matching with Variable Length Gaps.
- String Retrieval for Multi-pattern Queries.
- Succinct Representations of Dynamic Strings.
- Temporal Analysis of Document Collections: Framework and Applications.
- Text Comparison Using Soft Cardinality.
- The Gapped Suffix Array: A New Index Structure for Fast Approximate Matching.
- Training Parse Trees for Efficient VF Coding.
- Using Related Queries to Improve Web Search Results Ranking.
- Why Large Closest String Instances Are Easy to Solve in Practice.
- On Equations over Sets of Integers.
- An O(log log n)-Competitive Binary Search Tree with Optimal Worst-Case Access Times.
- (In)approximability Results for Pattern Matching Problems.
- A Space-Efficient Implementation of the Good-Suffix Heuristic.
- Approximate String Matching Allowing for Inversions and Translocations.
- Average Number of Runs and Squares in Necklace.
- Binary Image Compression via Monochromatic Pattern Substitution: Effectiveness and Scalability.
- Bounded Number of Squares in Infinite Repetition-Constrained Binary Words.
- Formal Characterizations of FA-based String Processors.
- Improving Automata Efficiency by Stretching and Jamming.
- Inferring Strings from Runs.
- New Simple Efficient Algorithms Computing Powers and Runs in Strings.
- On the Complexity of Variants of the k Best Strings Problem.
- Practical Fixed Length Lempel Ziv Coding.
- Reactive Links to Save Automata States.
- Simple Tree Pattern Matching for Trees in the Prefix Bar Notation.
- The Number of Runs in a Ternary Word.
- Tight and Simple Web Graph Compression.
- Tiling Binary Matrices in Haplotyping: Complexity, Models and Algorithms.
- Swiftly Computing Center Strings.
- On compressing the textual web.
ACM J. Exp. Algorithmics 2010
- Practical approaches to reduce the space requirement of lempel-ziv-based compressed text indices.
ACM Trans. Algorithms 2010
- The compressed permuterm index.
ACM Trans. Embed. Comput. Syst. 2010
- Online memory compression for embedded systems.
ACM Trans. Inf. Syst. 2010
- Dynamic lightweight text compression.
ACM Trans. Web 2010
- Fast and Compact Web Graph Representations.
Algorithmica 2010
- On Sorting, Heaps, and Minimum Spanning Trees.
Algorithms Mol. Biol. 2010
- Linear-time protein 3-D structure searching with insertions and deletions.
- Efficient construction of an assembly string graph using the FM-index.
CoRR 2010
- An Efficient Algorithm For Chinese Postman Walk on Bi-directed de Bruijn Graphs
- Compressed random access memory
- Counting Colours in Compressed Strings
- Document Clustering with K-tree
- Dynamic Range Reporting in External Memory
- Efficient Parallel and Out of Core Algorithms for Constructing Large Bi-directed de Bruijn Graphs
- Exact Analysis of Pattern Matching Algorithms with Probabilistic Arithmetic Automata
- Fully Dynamic Data Structure for Top-k Queries on Uncertain Data
- K-tree: Large Scale Document Clustering
- LRM-Trees: Compressed Indices, Adaptive Sorting, and Compressed Permutations
- Lightweight LCP-Array Construction in Linear Time
- Mergeable Dictionaries
- New Algorithms on Wavelet Trees and Applications to Information Retrieval
- On Finding Frequent Patterns in Directed Acyclic Graphs
- On the Border Length Minimization Problem (BLMP) on a Square Array
- Optimal Trade-Off for Succinct String Indexes
- Parallelization of Weighted Sequence Comparison by using EBWT
- Pattern Kits
- Permuted Common Supersequence
- Random Access to Grammar Compressed Strings
- Random Indexing K-tree
- Range Reporting for Moving Points on a Grid
- Sampled Longest Common Prefix Array
- Should Static Search Trees Ever Be Unbalanced?
- Some long-period random number generators using shifts and xors
- Succinct Data Structures for Assembling Large Genomes
- Succinct Dictionary Matching With No Slowdown
- Succinct Representations of Dynamic Strings
- The B-Skip-List: A Simpler Uniquely Represented Alternative to B-Trees
- The universality of iterated hashing over variable-length strings
- Tight and simple Web graph compression
- Top-K Color Queries for Document Retrieval
- Tree structure compression with RePair
- Unified Compression-Based Acceleration of Edit-Distance Computation
- Uses of randomness in computation
- Worst case efficient single and multiple string matching in the Word-RAM model
Discret. Appl. Math. 2010
- Sorting with networks of data structures.
IEICE Trans. Inf. Syst. 2010
- Context-Sensitive Grammar Transform: Compression and Pattern Matching.
Inf. Process. Lett. 2010
- Efficient algorithms for the all-pairs suffix-prefix problem and the all-pairs substring-prefix problem.
J. Comput. Biol. 2010
- Storage and Retrieval of Highly Repetitive Sequence Collections.
J. Discrete Algorithms 2010
- The longest common extension problem revisited and applications to approximate string searching.
Math. Comput. Sci. 2010
- Fast, Practical Algorithms for Computing All the Repeats in a String.
Theor. Comput. Sci. 2010
- Move-to-Front, Distance Coding, and Inversion Frequencies revisited.
- On compact representations of All-Pairs-Shortest-Path-Distance matrices.