Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1835804.1835929acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Mass estimation and its applications

Published: 25 July 2010 Publication History

Abstract

This paper introduces mass estimation--a base modelling mechanism in data mining. It provides the theoretical basis of mass and an efficient method to estimate mass. We show that it solves problems very effectively in tasks such as information retrieval, regression and anomaly detection. The models, which use mass in these three tasks, perform at least as good as and often better than a total of eight state-of-the-art methods in terms of task-specific performance measures. In addition, mass estimation has constant time and space complexities.

Supplementary Material

JPG File (kdd2010_ting_mea_01.jpg)
MOV File (kdd2010_ting_mea_01.mov)

References

[1]
G. Aloupis. Geometric measures of data depth. DIMACS Series in Discrete Math and Theoretical Computer Science, 72:147--158, 2006.
[2]
A. Asuncion and D. Newman. UCI machine learning repository, 2007.
[3]
S. D. Bay and M. Schwabacher. Mining distance-based outliers in near linear time with randomization and a simple pruning rule. In Proceedings of SIGKDD, pages 29--38, 2003.
[4]
M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. LOF: Identifying density-based local outliers. In Proceedings of SIGKDD, pages 93--104, 2000.
[5]
B. Caputo, K. Sim, F. Furesjo, and A. Smola. Appearance-based object recognition using svms: which kernel should i use? In NIPS workshop on Statitsical methods for computational experiments in visual processing and computer vision, 2002.
[6]
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001.
[7]
R. Duda, P. Hart, and D. Stork. Pattern Classification. Second Edition. John Wiley, 2001.
[8]
G. Giacinto and F. Roli. Instance-based relevance feedback for image retrieval. In Advances in NIPS, pages 489--496, 2005.
[9]
J. He, M. Li, H. Zhang, H. Tong, and C. Zhang. Manifold-ranking based image retrieval. In Proceedings of ACM Multimedia, pages 9--16, 2004.
[10]
F. T. Liu, K. M. Ting, and Z.-H. Zhou. Isolation forest. In Proceedings of ICDM, pages 413--422, 2008.
[11]
R. Liu, J. M. Parelius, and K. Singh. Multivariate analysis by data depth. The Annals of Statistics, 27(3):783--840, 1999.
[12]
D. M. Rocke and D. L. Woodruff. Identification of outliers in multivariate data. Journal of the American Statistical Association, 91(435):1047--1061, 1996.
[13]
B. Sch-olkopf, R. C. Williamson, A. J. Smola, J. Shawe-Taylor, and J. C. Platt. Support vector method for novelty detection. In Advances in NIPS, pages 582--588, 2000.
[14]
J. S. Simonoff. Smoothing Methods in Statistics. Springer-Verlag, 1996.
[15]
K. M. Ting, S. C. Tan, and F. T. Liu. Mass: A new ranking measure for anomaly detection. Gippsland School of Information Technology, Monash University, Technical Report TR2009/1, 2009.
[16]
V. N. Vapnik. The Nature of Statistical Learning Theory. Second Edition. Springer, 2000.
[17]
G.-T. Zhou, K. M. Ting, F. T. Liu, and Y. Yin. Relevance feature mapping for content-based image retrieval. In Proceedings of Multimedia Data Mining Workshop at KDD, 2010.
[18]
Z.-H. Zhou, K.-J. Chen, and H.-B. Dai. Enhancing relevance feedback in image retrieval using unlabeled data. ACM Transactions on Information Systems, 24(2):219--244, 2006.
[19]
Z.-H. Zhou and H.-B. Dai. Query-sensitive similarity measure for content-based image retrieval. In Proceedings of ICDM, pages 1211--1215, 2006.

Cited By

View all
  • (2024)Layered isolation forest: A multi-level subspace algorithm for improving isolation forestNeurocomputing10.1016/j.neucom.2024.127525581(127525)Online publication date: May-2024
  • (2024)Random clustering-based outlier detectorInformation Sciences: an International Journal10.1016/j.ins.2024.120498667:COnline publication date: 1-May-2024
  • (2023)MFGAD-INT: in-band network telemetry data-driven anomaly detection using multi-feature fusion graph deep learningJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-023-00492-w12:1Online publication date: 28-Aug-2023
  • Show More Cited By

Index Terms

  1. Mass estimation and its applications

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 July 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tag

      1. mass estimation

      Qualifiers

      • Research-article

      Conference

      KDD '10
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 16 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Layered isolation forest: A multi-level subspace algorithm for improving isolation forestNeurocomputing10.1016/j.neucom.2024.127525581(127525)Online publication date: May-2024
      • (2024)Random clustering-based outlier detectorInformation Sciences: an International Journal10.1016/j.ins.2024.120498667:COnline publication date: 1-May-2024
      • (2023)MFGAD-INT: in-band network telemetry data-driven anomaly detection using multi-feature fusion graph deep learningJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-023-00492-w12:1Online publication date: 28-Aug-2023
      • (2023)A self-training algorithm based on the two-stage data editing method with mass-based dissimilarityNeural Networks10.1016/j.neunet.2023.09.046168(431-449)Online publication date: Nov-2023
      • (2023) S: Adaptive anomaly detection on sporadic data streams Computer Communications10.1016/j.comcom.2023.06.027209(151-162)Online publication date: Sep-2023
      • (2023)kNN Classification: a reviewAnnals of Mathematics and Artificial Intelligence10.1007/s10472-023-09882-xOnline publication date: 1-Sep-2023
      • (2022)Ensemble-Based Information Retrieval With Mass Estimation for Hyperspectral Target DetectionIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2021.307558360(1-23)Online publication date: 2022
      • (2022)Detection and Classification of Anomalies in Large Datasets on the Basis of Information GranulesIEEE Transactions on Fuzzy Systems10.1109/TFUZZ.2021.307626530:8(2850-2860)Online publication date: 1-Aug-2022
      • (2022)Enhanced Tree-Based Anomaly Detection2022 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)10.1109/FUZZ-IEEE55066.2022.9882557(1-7)Online publication date: 18-Jul-2022
      • (2022)Fuzzy C-Means-based Isolation ForestApplied Soft Computing10.1016/j.asoc.2021.107354106:COnline publication date: 9-Apr-2022
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media