Cited By
View all- Peng ZWang ZDeng D(2023)Near-Duplicate Sequence Search at Scale for Large Language Model Memorization EvaluationProceedings of the ACM on Management of Data10.1145/35893241:2(1-18)Online publication date: 20-Jun-2023
- Azeroual OJha MNikiforova ASha KAlsmirat MJha S(2022)A Record Linkage-Based Data Deduplication Framework with DataCleaner ExtensionMultimodal Technologies and Interaction10.3390/mti60400276:4(27)Online publication date: 11-Apr-2022
- Roegiest ALee EPiwowarski BChevalier MGaussier EMaarek YNie JScholer F(2019)On Tradeoffs Between Document Signature Methods for a Legal Due Diligence CorpusProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331311(1001-1004)Online publication date: 18-Jul-2019
- Show More Cited By