Aleph-w 3.0
A C++ Library for Data Structures and Algorithms
Loading...
Searching...
No Matches
simhash.H File Reference

SimHash for estimating cosine similarity between sets of features. More...

#include <algorithm>
#include <cmath>
#include <cstdint>
#include <bit>
#include <ah-errors.H>
#include <tpl_array.H>
#include <hash-fct.H>
Include dependency graph for simhash.H:
This graph shows which files directly or indirectly include this file:

Go to the source code of this file.

Classes

class  Aleph::SimHash< T >
 SimHash fingerprint generator. More...
 

Namespaces

namespace  Aleph
 Main namespace for Aleph-w library functions.
 

Detailed Description

SimHash for estimating cosine similarity between sets of features.

Provides the SimHash class for computing bitwise fingerprints of weighted feature sets. SimHash is particularly useful for detecting near-duplicates in documents or large datasets.

Author
Leandro Rabindranath León

Definition in file simhash.H.