Software testing and debugging

research-article

FRIES: Fuzzing Rust Library Interactions via Efficient Ecosystem-Guided Target Generation

ISSTA 2024: Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and AnalysisPages 1137–1148https://doi.org/10.1145/3650212.3680348

Rust has been extensively used in software development in the past decades due to its memory safety mechanisms and gradually matured ecosystems. Enhancing the quality of Rust libraries is critical to Rust ecosystems as the libraries are often the core ...

research-article

ObjTest: Object-Level Mutation for Testing Object Detection Systems

Internetware '24: Proceedings of the 15th Asia-Pacific Symposium on InternetwarePages 61–70https://doi.org/10.1145/3671016.3671400

With the tremendous advancement of deep learning techniques, object detection (OD) systems have achieved significant development. These systems, powered by deep neural networks, are now widely employed in diverse applications, including autonomous ...

research-article

Hybrid mutation driven testing for natural language inference

Journal of Software: Evolution and Process (WSMR), Volume 36, Issue 10https://doi.org/10.1002/smr.2694

Summary

Natural language inference (NLI) is a task to infer the relationship between the premise and hypothesis sentences, whose models have essential applications in the many natural language processing (NLP) fields, for example, machine reading ...

We propose a novel automatic testing method, hybrid mutation driven testing (HMT), which extends the mutation idea in natural language inference (NLI). We apply four mutation operators to achieve the hybrid mutation strategy, mutating the premise and the ...

research-article

Enumerating Valid Non-Alpha-Equivalent Programs for Interpreter Testing

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 5Article No.: 118, Pages 1–31https://doi.org/10.1145/3647994

Skeletal program enumeration (SPE) can generate a great number of test programs for validating the correctness of compilers or interpreters. The classic SPE generates programs by exhaustively enumerating all possible variable usage patterns into a given ...

research-article

MultiTest: Physical-Aware Object Insertion for Testing Multi-sensor Fusion Perception Systems

ICSE '24: Proceedings of the IEEE/ACM 46th International Conference on Software EngineeringArticle No.: 139, Pages 1–13https://doi.org/10.1145/3597503.3639191

Multi-sensor fusion stands as a pivotal technique in addressing numerous safety-critical tasks and applications, e.g., self-driving cars and automated robotic arms. With the continuous advancement in data-driven artificial intelligence (AI), MSF's ...

research-article

COPS: An improved information retrieval-based bug localization technique using context-aware program simplification

Journal of Systems and Software (JSSO), Volume 207, Issue Chttps://doi.org/10.1016/j.jss.2023.111868

Abstract

Information Retrieval Based Bug Localization (IRBL) techniques are well suited for large-scale software debugging with fewer external dependencies and lower execution costs. However, existing IRBL techniques have several challenges, including ...

Highlights

A statement-level information retrieval-based bug localization technique.
Setting the retrieval scope can improve the effectiveness of bug localization.
The first two-thirds of the stack trace are more advantageous for bug ...

research-article

Crowdsourced test case generation for android applications via static program analysis

Automated Software Engineering (KLU-AUSE), Volume 30, Issue 2https://doi.org/10.1007/s10515-023-00394-w

Abstract

The testing of Android applications(apps) is a challenging task due to the serious fragmentation issues and diverse usage environments. To improve the testing efficiency and collect the feedbacks from real usage scenarios, crowdsourcing has been ...

research-article

Towards understanding bugs in Python interpreters

Empirical Software Engineering (KLU-EMSE), Volume 28, Issue 1https://doi.org/10.1007/s10664-022-10239-x

Abstract

Python has been widely used to develop large-scale software systems such as distributed systems, cloud computing, artificial intelligence, and Web platforms due to its flexibility and versatility. As a kind of complex software, Python interpreter ...

research-article

Distinguished Paper

QATest: A Uniform Fuzzing Framework for Question Answering Systems

ASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software EngineeringArticle No.: 81, Pages 1–12https://doi.org/10.1145/3551349.3556929

The tremendous advancements in deep learning techniques have empowered question answering(QA) systems with the capability of dealing with various tasks. Many commercial QA systems, such as Siri, Google Home, and Alexa, have been deployed to assist ...

research-article

How higher order mutant testing performs for deep learning models: A fine-grained evaluation of test effectiveness and efficiency improved from second-order mutant-classification tuples

Information and Software Technology (INST), Volume 150, Issue Chttps://doi.org/10.1016/j.infsof.2022.106954

Abstract Context:

Given the prevalence of Deep Learning (DL) models in daily life, it is crucial to guarantee their reliability by DL testing. Recently, researchers have adapted mutation testing into DL testing to measure the test power of test sets. The ...

research-article

Aligned metric representation based balanced multiset ensemble learning for heterogeneous defect prediction

Information and Software Technology (INST), Volume 147, Issue Chttps://doi.org/10.1016/j.infsof.2022.106892

Abstract Context:

Heterogeneous defect prediction (HDP) refers to the defect prediction across projects with different metrics. Most existing HDP methods map source and target data into a common metric space where each dimension has ...

research-article

Adaptive test selection for deep neural networks

ICSE '22: Proceedings of the 44th International Conference on Software EngineeringPages 73–85https://doi.org/10.1145/3510003.3510232

Deep neural networks (DNN) have achieved tremendous development in the past decade. While many DNN-driven software applications have been deployed to solve various tasks, they could also produce incorrect behaviors and result in massive losses. To reveal ...

research-article

Classifying crowdsourced mobile test reports with image features: An empirical study

Journal of Systems and Software (JSSO), Volume 184, Issue Chttps://doi.org/10.1016/j.jss.2021.111121

Abstract

Crowdsourced testing has become a popular mobile application testing method, and it is capable of simulating real usage scenarios and detecting various bugs with a large workforce. However, inspecting and classifying the overwhelming ...

Graphical abstract

Display Omitted

Highlights

An empirical study of crowdsourced mobile test report classification.
...

research-article

An Empirical Study on Dynamic Typing Related Practices in Python Systems

ICPC '20: Proceedings of the 28th International Conference on Program ComprehensionPages 83–93https://doi.org/10.1145/3387904.3389253

The dynamic typing discipline of Python allows developers to program at a high level of abstraction. However, type related bugs are commonly encountered in Python systems due to the lack of type declaration and static type checking. Especially, the ...

research-article

Debugging with intelligence via probabilistic inference

ICSE '18: Proceedings of the 40th International Conference on Software EngineeringPages 1171–1181https://doi.org/10.1145/3180155.3180237

We aim to debug a single failing execution without the assistance from other passing/failing runs. In our context, debugging is a process with substantial uncertainty - lots of decisions have to be made such as what variables shall be inspected first. To ...

research-article

Public Access

Python predictive analysis for bug detection

FSE 2016: Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software EngineeringPages 121–132https://doi.org/10.1145/2950290.2950357

Python is a popular dynamic language that allows quick software development. However, Python program analysis engines are largely lacking. In this paper, we present a Python predictive analysis. It first collects the trace of an execution, and then ...

research-article

Revisit of automatic debugging via human focus-tracking analysis

ICSE '16: Proceedings of the 38th International Conference on Software EngineeringPages 808–819https://doi.org/10.1145/2884781.2884834

In many fields of software engineering, studies on human behavior have attracted a lot of attention; however, few such studies exist in automated debugging. Parnin and Orso conducted a pioneering study comparing the performance of programmers in ...

research-article

Verifying Synchronization for Atomicity Violation Fixing

IEEE Transactions on Software Engineering (ISOF), Volume 42, Issue 3Pages 280–296https://doi.org/10.1109/TSE.2015.2477820

Atomicity is a fundamental property to guarantee the isolation of a work unit (<italic>i.e.</italic>, a sequence of related events in a thread) from concurrent threads. However, ensuring atomicity is often very challenging due to complex thread ...

research-article

Test report prioritization to assist crowdsourced testing

ESEC/FSE 2015: Proceedings of the 2015 10th Joint Meeting on Foundations of Software EngineeringPages 225–236https://doi.org/10.1145/2786805.2786862

In crowdsourced testing, users can be incentivized to perform testing tasks and report their results, and because crowdsourced workers are often paid per task, there is a financial incentive to complete tasks quickly rather than well. These reports of ...

research-article

Heterogeneous cross-company defect prediction by unified metric representation and CCA-based transfer learning

ESEC/FSE 2015: Proceedings of the 2015 10th Joint Meeting on Foundations of Software EngineeringPages 496–507https://doi.org/10.1145/2786805.2786813

Cross-company defect prediction (CCDP) learns a prediction model by using training data from one or multiple projects of a source company and then applies the model to the target company data. Existing CCDP methods are based on the assumption that the ...

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

FRIES: Fuzzing Rust Library Interactions via Efficient Ecosystem-Guided Target Generation

ObjTest: Object-Level Mutation for Testing Object Detection Systems

Hybrid mutation driven testing for natural language inference

Enumerating Valid Non-Alpha-Equivalent Programs for Interpreter Testing

MultiTest: Physical-Aware Object Insertion for Testing Multi-sensor Fusion Perception Systems

COPS: An improved information retrieval-based bug localization technique using context-aware program simplification

Crowdsourced test case generation for android applications via static program analysis

Towards understanding bugs in Python interpreters

QATest: A Uniform Fuzzing Framework for Question Answering Systems

How higher order mutant testing performs for deep learning models: A fine-grained evaluation of test effectiveness and efficiency improved from second-order mutant-classification tuples

Aligned metric representation based balanced multiset ensemble learning for heterogeneous defect prediction

Adaptive test selection for deep neural networks

Classifying crowdsourced mobile test reports with image features: An empirical study

An Empirical Study on Dynamic Typing Related Practices in Python Systems

Debugging with intelligence via probabilistic inference

Python predictive analysis for bug detection

Revisit of automatic debugging via human focus-tracking analysis

Verifying Synchronization for Atomicity Violation Fixing

Test report prioritization to assist crowdsourced testing

Heterogeneous cross-company defect prediction by unified metric representation and CCA-based transfer learning

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder