|
Aleph-w 3.0
A C++ Library for Data Structures and Algorithms
|
SimHash for estimating cosine similarity between sets of features. More...
#include <algorithm>#include <cmath>#include <cstdint>#include <bit>#include <ah-errors.H>#include <tpl_array.H>#include <hash-fct.H>Go to the source code of this file.
Classes | |
| class | Aleph::SimHash< T > |
| SimHash fingerprint generator. More... | |
Namespaces | |
| namespace | Aleph |
| Main namespace for Aleph-w library functions. | |
SimHash for estimating cosine similarity between sets of features.
Provides the SimHash class for computing bitwise fingerprints of weighted feature sets. SimHash is particularly useful for detecting near-duplicates in documents or large datasets.
Definition in file simhash.H.