A Nearly Tight Analysis of Greedy k-means++  @GoogleTechTalks
A Nearly Tight Analysis of Greedy k-means++  @GoogleTechTalks
Google TechTalks | A Nearly Tight Analysis of Greedy k-means++ @GoogleTechTalks | Uploaded April 2023 | Updated October 2024, 1 week ago.
A Google TechTalk, presented by Václav Rozhoň, 2023-04-13
Abstract: The famous k-means++ algorithm of Arthur and Vassilvitskii is the most popular practical algorithm for solving the k-means problem. The algorithm is very simple and computes the k output centers as follows: it samples the first center as a uniformly random point in the dataset and each of the following k−1 centers is then always sampled with probability proportional to the squared distance to the currently closest center. Amazingly, the k-means++ algorithm is known to return a Θ(log k) approximate solution in expectation.
In their seminal work, Arthur and Vassilvitskii asked about the guarantees of its following greedy variant: in every step, we sample ℓ candidate centers instead of one and then pick the one that minimizes the new cost. This is also how k-means++ is implemented in e.g. the popular Scikit-learn library. We analyze greedy k-means++: We prove that it is an O(ℓ^3 * log^3 k)-approximation algorithm and provide a near-matching lower bound.

Joint work with Christoph Grunau, Ahmet Alper Özüdoğru, Jakub Tětek
arxiv: arxiv.org/abs/2207.07949

Bio: Vaclav Rozhon is a PhD student at ETH Zurich advised by Mohsen Ghaffari. He works mostly on distributed and parallel algorithms; he also creates YouTube videos about algorithms (channel name: polylog). He has a young child and thus no hobbies.

A Google Talk Series on Algorithms, Theory, and Optimization
A Nearly Tight Analysis of Greedy k-means++Auto-bidding in Online Advertising: Campaign Management and FairnessSupply Chain Security with GoSergey Nazarov | Co-Founder Chainlink | web3 talks | Mar 16 2023 | MC: Marlon RuizThe Data Minimization Principle in Machine Learning2022 Blockly Developers Summit: SerializationChris Nunes, Scott Clark & BC Biermann | IMMUSE Founders | web3 talks | June 9th 2022 | Raphael HydeImproved Feature Importance Computation for Tree Models Based on the Banzhaf ValueAcademic Keynote: Differentially Private Covariance-Adaptive Mean Estimation, Adam Smith (BU)Foundation Models and Fair UseTree Learning: Optimal Algorithms and Sample Complexity2023 Blockly Developer Summit Day 1-8: Blocks in Docs

A Nearly Tight Analysis of Greedy k-means++ @GoogleTechTalks

SHARE TO X SHARE TO REDDIT SHARE TO FACEBOOK WALLPAPER