Nothing Special   »   [go: up one dir, main page]

Skip to main content

A Semi-naive Bayesian Learning Method for Utilizing Unlabeled Data

  • Conference paper
Knowledge-Based Intelligent Information and Engineering Systems (KES 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4251))

  • 1280 Accesses

Abstract

In many applications, an enormous amount of unlabeled data is available with little cost. Therefore, it is natural to ask whether we can take advantage of these unlabeled data in classification learning. In this paper, we analyzed the role of unlabeled data in the context of naive Bayesian learning. Experimental results show that including unlabeled data as part of training data can significantly improve the performance of classification accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Castelli, V., Cover, T.M.: On the Exponential Value of Labeled Samples. Pattern Recognition Letters 16, 105–111 (1995)

    Article  Google Scholar 

  2. Cohn, D., et al.: Active Learning with Statistical Models. Journal of Artificial Intelligence Research 4, 129–145 (1996)

    MATH  Google Scholar 

  3. De Comite, F., et al.: Positive and Unlabeled Examples Help Learning. In: Tenth Int’l Conf. on Algorithmic Learning Theory, pp. 219–230 (1999)

    Google Scholar 

  4. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of Royal Statistical Society 39, 1–38 (1977)

    MATH  MathSciNet  Google Scholar 

  5. Duda, R., et al.: Pattern Classification, 2nd edn. John Wiley, Chichester (2001)

    MATH  Google Scholar 

  6. Goldman, S., Zhou, Y.: Enhancing Supervised Learning with Unlabeled Data. In: Int’l Conf. on Machine Leaning (2000)

    Google Scholar 

  7. Hofmann, T.: Text Categorization with Labeled and Unlabeled Data: A Generative Model Approach. In: NIPS 1999 Workshop on Using Unlabeled Data for Supervised Learning (1999)

    Google Scholar 

  8. Liere, R., Tadepalli, P.: Active Learning with Committees for Text Categorization. In: 14th National Conf. on Artificial Intelligence, pp. 591–596 (1997)

    Google Scholar 

  9. Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)

    MATH  Google Scholar 

  10. Mitchell, T.: The Role of Unlabeled Data in Supervised Learning. In: 6th Int’l Colloquium on Cognitive Science (1999)

    Google Scholar 

  11. Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text Classification from Labeled and Unlabeled Documents Using EM. Machine Learning 39, 103–134 (2000)

    Article  MATH  Google Scholar 

  12. Nigam, K., Ghani, R.: Analyzing the Effectiveness and Applicability of Co-training. In: CIKM 2000 (2000)

    Google Scholar 

  13. Zhang, T.: Some Asymptotic Results Concerning the Value of Unlabeled Data. In: NIPS 1999 Workshop on Using Unlabeled Data for Supervised Learning (1999)

    Google Scholar 

  14. Zhou, Y., Goldman, S.: Enhancing Supervised Learning with Unlabeled Data. In: 17th Int’l Conf. On Machine Learning, pp. 327–334 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, CH. (2006). A Semi-naive Bayesian Learning Method for Utilizing Unlabeled Data. In: Gabrys, B., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2006. Lecture Notes in Computer Science(), vol 4251. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11892960_23

Download citation

  • DOI: https://doi.org/10.1007/11892960_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-46535-5

  • Online ISBN: 978-3-540-46536-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics