Imbalanced Learning: Foundations, Algorithms, and Applications | Guide books

Imbalanced Learning: Foundations, Algorithms, and ApplicationsJuly 2013

July 2013

Authors:
Haibo He,
Yunqian Ma

Publisher:

Wiley-IEEE Press

ISBN:978-1-118-07462-6

Published:01 July 2013

Pages:

216

Available at Amazon

Bibliometrics

Abstract

The first book of its kind to review the current status and future direction of the exciting new branch of machine learning/data mining called imbalanced learningImbalanced learning focuses on how an intelligent system can learn when it is provided with imbalanced data. Solving imbalanced learning problems is critical in numerous data-intensive networked systems, including surveillance, security, Internet, finance, biomedical, defense, and more. Due to the inherent complex characteristics of imbalanced data sets, learning from such data requires new understandings, principles, algorithms, and tools to transform vast amounts of raw data efficiently into information and knowledge representation. The first comprehensive look at this new branch of machine learning, this book offers a critical review of the problem of imbalanced learning, covering the state of the art in techniques, principles, and real-world applications. Featuring contributions from experts in both academia and industry, Imbalanced Learning: Foundations, Algorithms, and Applications provides chapter coverage on:Foundations of Imbalanced LearningImbalanced Datasets: From Sampling to ClassifiersEnsemble Methods for Class Imbalance LearningClass Imbalance Learning Methods for Support Vector MachinesClass Imbalance and Active LearningNonstationary Stream Data Learning with Imbalanced Class DistributionAssessment Metrics for Imbalanced LearningImbalanced Learning: Foundations, Algorithms, and Applications will help scientists and engineers learn how to tackle the problem of learning from imbalanced datasets, and gain insight into current developments in the field as well as future research directions.

Cited By

Contributors

Haibo He
The University of Rhode Island
- Publication Years2006 - 2021
- Publication counts76
- Citation count1,974
- Available for Download1
- Downloads (cumulative)2,706
- Downloads (12 months)54
- Downloads (6 weeks)4
- Average Downloads per Article2,706
- Average Citation per Article26
View Full Profile
Yunqian Ma
- Publication Years2013 - 2013
- Publication counts1
- Citation count44
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article44
View Full Profile

Index Terms

Imbalanced Learning: Foundations, Algorithms, and Applications
1. Computing methodologies
  1. Machine learning

Reviews

Reviewer: CK Raju

Imagine an imbalanced dataset of cancer patients with highly skewed data having only 0.01 percent positive cancer cases. A naive or dumb machine that calls out "no cancer" to all queries would appear to be 99.99 percent accurate, and could even be misconstrued as a good prediction model over a competing machine learning algorithm. Additionally, the consequences of such a misclassification could be disastrous for patients with cancer. A comprehensive knowledge of machine learning, therefore, would be incomplete without a fair understanding of such predicaments and how to resolve them. This book promises to engage the reader by providing a vivid picture of the problems associated with imbalanced datasets, specific aspects and approaches to solve the problems, and assessment metrics. The narrative is ordered and easy to understand. A dozen authors contribute to the book's eight chapters: "Introduction," "Foundations of Imbalanced Learning," "Imbalanced Datasets: From Sampling to Classifiers," "Ensemble Methods for Class Imbalance Learning," "Class Imbalance Learning Methods for Support Vector Machines," "Class Imbalance and Active Learning," "Nonstationary Stream Data Learning with Imbalanced Class Distribution," and "Assessment Metrics for Imbalanced Learning." There aren't any competing books on imbalanced learning. Leaving aside the usual issues associated with multiple contributors-for example, the high chance for repetition or the difficulty of maintaining uniformity in presentation style-the book does justice to machine learning by bringing out issues related to imbalanced datasets. The significance of precision and recall is introduced or explained in multiple chapters, but presented as it is from varying perspectives, it doesn't affect the interest of the reader. The editors have succeeded in maintaining coherency and consistency while presenting content. For instance, while the terms F-score or F1 score could also have been used, the consistent use of F-measure throughout the book is noteworthy. Consistency is also visible in the illustrations involving precision and recall. With different authors assigned to different chapters, it is extremely difficult to trace out errors. Only one instance was detected: welding flaw was introduced as an example for imbalanced datasets in chapter 2. The case associated with welding flaws is a more apt example for discussions on anomaly detection and outliers. Anomalies, by definition, do not constitute a class or cluster by themselves, even if skewness is present as an attribute on the data. This book certainly qualifies as a reference for graduate studies in machine learning. Research students are sure to find it highly valuable and a prized possession, especially taking into account the wealth of supporting literature that the authors have brought to the fore. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Learning from Imbalanced Data

With the continuous expansion of data availability in many large-scale, complex, and networked systems, such as surveillance, security, Internet, and finance, it becomes critical to advance the fundamental understanding of knowledge discovery and ...
Multiset feature learning for highly imbalanced data classification
AAAI'17: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence

With the expansion of data, increasing imbalanced data has emerged. When the imbalance ratio of data is high, most existing imbalanced learning methods decline in classification performance. To address this problem, a few highly imbalanced learning ...
Imbalanced Sentiment Classification with Multi-Task Learning
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

Supervised learning methods are widely used in sentiment classification. However, when sentiment distribution is imbalanced, the performance of these methods declines. In this paper, we propose an effective approach for imbalanced sentiment ...

Browse Books

Sections

Cited By

Index Terms

Reviews

Access critical reviews of Computing literature here

Learning from Imbalanced Data

Multiset feature learning for highly imbalanced data classification

Imbalanced Sentiment Classification with Multi-Task Learning

Save to Binder

Sections

Cited By

Save to Binder

Index Terms

Reviews

Access critical reviews of Computing literature here

Recommendations

Learning from Imbalanced Data

Multiset feature learning for highly imbalanced data classification

Imbalanced Sentiment Classification with Multi-Task Learning