Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/978-3-319-96136-1_31guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Enhancing Outlier Detection by an Outlier Indicator

Published: 15 July 2018 Publication History

Abstract

Outlier detection is an important task in data mining and has high practical value in numerous applications such as astronomical observation, text detection, fraud detection and so on. At present, a large number of popular outlier detection algorithms are available, including distribution-based, distance-based, density-based, and clustering-based approaches and so on. However, traditional outlier detection algorithms face some challenges. For one example, most distance-based and density-based outlier detection methods are based on k-nearest neighbors and therefore, are very sensitive to the value of k. For another example, some methods can only detect global outliers, but fail to detect local outliers. Last but not the least, most outlier detection algorithms do not accurately distinguish between boundary points and outliers. To partially solve these problems, in this paper, we propose to augment some boundary indicators to classical outlier detection algorithms. Experiments performed on both synthetic and real data sets demonstrate the efficacy of enhanced outlier detection algorithms.

References

[1]
Hawkins DM Identification of Outliers 1980 London Chapman and Hall
[2]
Barnett V and Lewis T Outliers in Statistical Data 1994 New York Wiley
[3]
Knorr, E.M., Ng, R.T.: Algorithms for mining distance-based outliers in large datasets. In: Proceedings of the 24th VLDB Conference, New York, USA, pp. 392–403 (1998)
[4]
Breuning, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: LOF: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 93–104 (2000)
[5]
Jiang MF, Tseng SS, and Su CM Two-phase clustering process for outliers detection Pattern Recogn. Lett. 2001 22 691-700
[6]
Angiulli F and Pizzuti C Elomaa T, Mannila H, and Toivonen H Fast outlier detection in high dimensional spaces Principles of Data Mining and Knowledge Discovery 2002 Heidelberg Springer 15-27
[7]
Ramaswamy Sridhar, Rastogi Rajeev, and Shim Kyuseok Efficient algorithms for mining outliers from large data sets ACM SIGMOD Record 2000 29 2 427-438
[8]
Jin W, Tung AKH, Han J, and Wang W Ng W-K, Kitsuregawa M, Li J, and Chang K Ranking outliers using symmetric neighborhood relationship Advances in Knowledge Discovery and Data Mining 2006 Heidelberg Springer 577-593
[9]
Huang H, Mehrotra K, and Mohan CK Rank-based outlier detection J. Stat. Comput. Simul. 2013 83 3 1-14
[10]
UCI: The UCI KDD Archive, University of California, Irvine, CA. http://kdd.ics.uci.edu/
[11]
Aggarwal, C., Yu, P.: Outlier detection for high-dimensional data. In: Proceedings of the 2001 ACM SIGMOD Conference (SIGMOD 2001), Santa Barbara, CA, USA, pp. 37–46 (2001)

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
Machine Learning and Data Mining in Pattern Recognition: 14th International Conference, MLDM 2018, New York, NY, USA, July 15-19, 2018, Proceedings, Part I
Jul 2018
469 pages
ISBN:978-3-319-96135-4
DOI:10.1007/978-3-319-96136-1
  • Editor:
  • Petra Perner

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 15 July 2018

Author Tags

  1. Outlier detection
  2. Distance-based outlier detection
  3. Density-based outlier detection
  4. Boundary detection
  5. k-Nearest neighbors

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media