MinHash

MinHash

MinHash is a technique used to quickly estimate the similarity between two sets. It was initially used in AltaVista search engine to detect duplicate web pages and has since been applied to large-scale clustering problems. It was invented by Andrei Broder in 1997.

1 courses cover this concept

CS 168: The Modern Algorithmic Toolbox

Stanford University

Spring 2022

CS 168 provides a comprehensive introduction to modern algorithm concepts, covering hashing, dimension reduction, programming, gradient descent, and regression. It emphasizes both theoretical understanding and practical application, with each topic complemented by a mini-project. It's suitable for those who have taken CS107 and CS161.

No concepts data

+ 57 more concepts