bloom-filter-new

1.0.2 • Public • Published

bloom-filter-new

A simple Bloom Filter implementation in JavaScript (ES6). Bloom Filters are probabilistic data structures that allow you to test if an element is in a set with a small chance of false positives but no false negatives.


Table of Contents

  1. Introduction
  2. Installation
  3. Usage
  4. API Reference
  5. How It Works
  6. License

Introduction

Bloom Filters are useful when you need to:

  • Check for set membership quickly.
  • Optimize memory usage for large datasets where traditional data structures (like sets) are too memory-intensive.
  • Tolerate a small rate of false positives (incorrectly indicating that an element is present) while ensuring no false negatives (correctly indicating that an element is definitely not present).

This implementation allows you to:

  • Create a Bloom Filter with a customizable false positive rate.
  • Add elements to the filter.
  • Check if an element might be present.
  • Load elements from a .txt file asynchronously.

Installation

Install the package via npm:

npm install bloom-filter-new

Usage

Importing the Bloom Filter

import BloomFilter from 'bloom-filter-new';

Creating a Bloom Filter

Create a new Bloom Filter with a specified number of expected items and a false positive rate.

const bloom = new BloomFilter(1000, 0.01); // 1000 expected items, 1% false positive rate

Adding Elements

Add elements to the Bloom Filter using the add method.

bloom.add('apple');
bloom.add('banana');

Checking for Elements

Check if an element might be in the filter using the contains method.

console.log(bloom.contains('apple'));    // true (might be present)
console.log(bloom.contains('grape'));    // false (definitely not present)

Loading Words from a File

If you have a .txt file with one word per line, you can load these words asynchronously.

Example words.txt:

apple
banana
cherry
(async () => {
    const bloom = await BloomFilter.fromFile('words.txt', 1000, 0.01);
    console.log(bloom.contains('apple'));  // true
    console.log(bloom.contains('grape'));  // false
})();

Getting Bloom Filter Info

Retrieve information about the Bloom Filter, such as size, number of hash functions, and fill percentage.

console.log(bloom.getInfo());

Example Output:

{
  "expectedItems": 1000,
  "itemsAdded": 3,
  "size": 9586,
  "hashCount": 7,
  "falsePositiveRate": 0.01,
  "filledBits": 21,
  "fillPercentage": "0.22%"
}

API Reference

constructor(expectedItems = 1000, falsePositiveRate = 0.01)

Creates a new Bloom Filter.

  • expectedItems: Number of items expected to be stored.
  • falsePositiveRate: Desired false positive rate (e.g., 0.01 for 1%).

add(value)

Adds a string value to the Bloom Filter.

bloom.add('example');

contains(value)

Checks if a string value might be in the Bloom Filter.

Returns true if the value might be present, otherwise false.

bloom.contains('example'); // true or false

async loadFromFile(filePath)

Loads words from a .txt file and adds them to the Bloom Filter.

await bloom.loadFromFile('words.txt');

getInfo()

Returns detailed information about the Bloom Filter.

console.log(bloom.getInfo());

static async fromFile(filePath, expectedItems = 1000, falsePositiveRate = 0.01)

Creates and loads a Bloom Filter from a .txt file.

const bloom = await BloomFilter.fromFile('words.txt');

How It Works

A Bloom Filter uses multiple hash functions to set bits in a fixed-size bit array. When checking for an element, the same hash functions are used to verify if all the corresponding bits are set:

  • False Positives: The filter may return true for elements not in the set due to hash collisions.
  • False Negatives: The filter never returns false for elements that were added.

Formulae Used

  1. Optimal Bit Array Size (m):

    m = -(n * log(p)) / (log(2) ^ 2)
    

    Where:

    • n = Expected number of items
    • p = False positive rate
  2. Optimal Number of Hash Functions (k):

    k = (m / n) * log(2)
    

License

This project is licensed under the MIT License.


Happy filtering! 🚀

Dependents (0)

Package Sidebar

Install

npm i bloom-filter-new

Weekly Downloads

4

Version

1.0.2

License

MIT

Unpacked Size

11.1 kB

Total Files

5

Last publish

Collaborators

  • ravipawar2024