Zhao et al., 2023 - Google Patents
Double-Anonymous Sketch: Achieving Top-K-fairness for Finding Global Top-K Frequent ItemsZhao et al., 2023
View PDF- Document ID
- 10260680745650505775
- Author
- Zhao Y
- Han W
- Zhong Z
- Zhang Y
- Yang T
- Cui B
- Publication year
- Publication venue
- Proceedings of the ACM on Management of Data
External Links
Snippet
Finding top-K frequent items has been a hot topic in data stream processing in recent years, which has a wide range of applications. However, most of existing sketch algorithms focuses on finding local top-K in a single data stream. In this paper, we work on finding global top-K …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30533—Other types of queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30477—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/12—Replacement control
- G06F12/121—Replacement control using replacement algorithms
- G06F12/122—Replacement control using replacement algorithms of the least frequently used [LFU] type, e.g. with individual count value
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30312—Storage and indexing structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0806—Multiuser, multiprocessor or multiprocessing cache systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G06F17/30864—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems
- G06F17/30867—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems with filtering and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Li et al. | Wavingsketch: An unbiased and generic sketch for finding top-k items in data streams | |
Harmouch et al. | Cardinality estimation: An experimental survey | |
Dimitropoulos et al. | Probabilistic lossy counting: An efficient algorithm for finding heavy hitters | |
Zhong et al. | Burstsketch: Finding bursts in data streams | |
Shin et al. | D-cube: Dense-block detection in terabyte-scale tensors | |
Zhao et al. | Double-Anonymous Sketch: Achieving Top-K-fairness for Finding Global Top-K Frequent Items | |
Zhang et al. | On-off sketch: A fast and accurate sketch on persistence | |
Liu et al. | Methods for mining frequent items in data streams: an overview | |
Chen et al. | Mining frequent items in data stream using time fading model | |
Han et al. | Applications of sketches in network traffic measurement: A survey | |
Chen et al. | Out of many we are one: Measuring item batch with clock-sketch | |
Jia et al. | Loglog filter: Filtering cold items within a large range over high speed data streams | |
Qi et al. | Cuckoo counter: A novel framework for accurate per-flow frequency estimation in network measurement | |
Shi et al. | Cuckoo counter: Adaptive structure of counters for accurate frequency and top-k estimation | |
Wang et al. | Joinsketch: A sketch algorithm for accurate and unbiased inner-product estimation | |
Li et al. | Ladderfilter: Filtering infrequent items with small memory and time overhead | |
Zhang et al. | CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models | |
Basat et al. | Efficient summing over sliding windows | |
Cheng et al. | LTC: a fast algorithm to accurately find significant items in data streams | |
Wu et al. | MicroscopeSketch: Accurate Sliding Estimation Using Adaptive Zooming | |
Liu et al. | XY-sketch: On sketching data streams at web scale | |
Li et al. | Tight-sketch: A high-performance sketch for heavy item-oriented data stream mining with limited memory size | |
Dutta et al. | Towards" intelligent compression" in streams: a biased reservoir sampling based bloom filter approach | |
Wang et al. | Real-time Spread Burst Detection in Data Streaming | |
Xiao et al. | Accurate and o (1)-time query of per-flow cardinality in high-speed networks |