filmov
tv
CountMin sketch, part 2: proving error bound

Показать описание
We frame and then prove the bound that says the count estimate produced by a CountMin point query is "probably not too far" from the correct count. The proof uses universality, Markov's inequality, and linearity of expectation, among other ideas. Finally, we consider how large the sketch must be to achieve particular bounds.
These materials are also openly available on figshare. Please cite this work; this ensures that funding agencies see the impact and importance of these open learning materials.
Channel: @BenLangmead
These materials are also openly available on figshare. Please cite this work; this ensures that funding agencies see the impact and importance of these open learning materials.
Channel: @BenLangmead
CountMin sketch, part 2: proving error bound
CountMin sketch, part 3
CountMin sketch, part 1
Count Min Sketch | Probabilistic Data Structure | Space Efficient | Frequency Mapping
Count min sketch | Efficient algorithm for counting stream of data | system design components
Count-min Sketch to Infinity - Steve Lorello - NDC Oslo 2022
MAT1841 - Lec 13 - Majority item and frequency sketches + MinHash set similarity sketch
Advanced Data Structures: Designing an Optimal Count-Min Sketch
Lecture 6.1 Count-min sketches, Sparse recovery
Counting Distinct Elements in the Message Passing Model
Lecture 14 - Count Min Sketch, Counter Braids, Locality Sensitive Hashing
Count-Min and Count Sketches
Sketching Part 1
Applications of Count-Min and Count Sketches: Range andPoint Queries
Redis Monthly Live: Count-min Sketch to Infinity with Stephen Lorello - Episode 6
NDSS 2023 - A Robust Counting Sketch for Data Plane Intrusion Detection
Sketching as a Tool for Fast Algorithm Design
Lecture 12, Video 4: Applications of List Recovery
Lecture 8 (a) Streaming Algorithm: Count Sketch
Sketching, Sampling and Sublinear Time Algorithms
A problem so hard even Google relies on Random Chance
Advanced Data Structures: Inverting the BWT
Streaming, Sketching and Sufficient Statistics I
Randomization or Condensation? LinearCost Matrix Sketching Via Cascaded Compression Sampling
Комментарии