CMU-CS-09-159
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-09-159

Fast Cache for Your Text: Accelerating Exact
Pattern matching with Feed-Forward Bloom Filters

Iulian Moraru, David G. Andersen

September 2009

CMU-CS-09-159.pdf


Keywords: Feed-forward Bloom filter, text scanning, cache efficient

This paper presents an algorithm for exact pattern matching based on a new type of Bloom filter that we call a feed-forward Bloom filter. Besides filtering the input corpus, a feed-forward Bloom filter is also able to reduce the set of patterns needed for the exact matching phase. We show that this technique, along with a CPU architecture aware design of the Bloom filter, can provide speedups between 2x and 30x, and memory consumption reductions as large as 50x when compared with grep, while the filtering speed can be as much as 5x higher than that of a normal Bloom filters. 26 pages


Return to: SCS Technical Report Collection
School of Computer Science

This page maintained by reports@cs.cmu.edu