MinHash is a technique used to quickly estimate the similarity between two sets. It was initially used in AltaVista search engine to detect duplicate web pages and has since been applied to large-scale clustering problems. It was invented by Andrei Broder in 1997.
Stanford University
Spring 2022
CS 168 provides a comprehensive introduction to modern algorithm concepts, covering hashing, dimension reduction, programming, gradient descent, and regression. It emphasizes both theoretical understanding and practical application, with each topic complemented by a mini-project. It's suitable for those who have taken CS107 and CS161.
No concepts data
+ 57 more concepts