Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleSeptember 2024
FRIES: Fuzzing Rust Library Interactions via Efficient Ecosystem-Guided Target Generation
ISSTA 2024: Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and AnalysisPages 1137–1148https://doi.org/10.1145/3650212.3680348Rust has been extensively used in software development in the past decades due to its memory safety mechanisms and gradually matured ecosystems. Enhancing the quality of Rust libraries is critical to Rust ecosystems as the libraries are often the core ...
- research-articleJuly 2024
ObjTest: Object-Level Mutation for Testing Object Detection Systems
Internetware '24: Proceedings of the 15th Asia-Pacific Symposium on InternetwarePages 61–70https://doi.org/10.1145/3671016.3671400With the tremendous advancement of deep learning techniques, object detection (OD) systems have achieved significant development. These systems, powered by deep neural networks, are now widely employed in diverse applications, including autonomous ...
- research-articleJune 2024
Hybrid mutation driven testing for natural language inference
Journal of Software: Evolution and Process (WSMR), Volume 36, Issue 10https://doi.org/10.1002/smr.2694SummaryNatural language inference (NLI) is a task to infer the relationship between the premise and hypothesis sentences, whose models have essential applications in the many natural language processing (NLP) fields, for example, machine reading ...
We propose a novel automatic testing method, hybrid mutation driven testing (HMT), which extends the mutation idea in natural language inference (NLI). We apply four mutation operators to achieve the hybrid mutation strategy, mutating the premise and the ...
- research-articleJune 2024
Enumerating Valid Non-Alpha-Equivalent Programs for Interpreter Testing
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 5Article No.: 118, Pages 1–31https://doi.org/10.1145/3647994Skeletal program enumeration (SPE) can generate a great number of test programs for validating the correctness of compilers or interpreters. The classic SPE generates programs by exhaustively enumerating all possible variable usage patterns into a given ...
- research-articleApril 2024
MultiTest: Physical-Aware Object Insertion for Testing Multi-sensor Fusion Perception Systems
ICSE '24: Proceedings of the IEEE/ACM 46th International Conference on Software EngineeringArticle No.: 139, Pages 1–13https://doi.org/10.1145/3597503.3639191Multi-sensor fusion stands as a pivotal technique in addressing numerous safety-critical tasks and applications, e.g., self-driving cars and automated robotic arms. With the continuous advancement in data-driven artificial intelligence (AI), MSF's ...
-
- research-articleJanuary 2024
COPS: An improved information retrieval-based bug localization technique using context-aware program simplification
Journal of Systems and Software (JSSO), Volume 207, Issue Chttps://doi.org/10.1016/j.jss.2023.111868AbstractInformation Retrieval Based Bug Localization (IRBL) techniques are well suited for large-scale software debugging with fewer external dependencies and lower execution costs. However, existing IRBL techniques have several challenges, including ...
Highlights- A statement-level information retrieval-based bug localization technique.
- Setting the retrieval scope can improve the effectiveness of bug localization.
- The first two-thirds of the stack trace are more advantageous for bug ...
- research-articleAugust 2023
Crowdsourced test case generation for android applications via static program analysis
Automated Software Engineering (KLU-AUSE), Volume 30, Issue 2https://doi.org/10.1007/s10515-023-00394-wAbstractThe testing of Android applications(apps) is a challenging task due to the serious fragmentation issues and diverse usage environments. To improve the testing efficiency and collect the feedbacks from real usage scenarios, crowdsourcing has been ...
- research-articleJanuary 2023
Towards understanding bugs in Python interpreters
Empirical Software Engineering (KLU-EMSE), Volume 28, Issue 1https://doi.org/10.1007/s10664-022-10239-xAbstractPython has been widely used to develop large-scale software systems such as distributed systems, cloud computing, artificial intelligence, and Web platforms due to its flexibility and versatility. As a kind of complex software, Python interpreter ...
- research-articleJanuary 2023Distinguished Paper
QATest: A Uniform Fuzzing Framework for Question Answering Systems
ASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software EngineeringArticle No.: 81, Pages 1–12https://doi.org/10.1145/3551349.3556929The tremendous advancements in deep learning techniques have empowered question answering(QA) systems with the capability of dealing with various tasks. Many commercial QA systems, such as Siri, Google Home, and Alexa, have been deployed to assist ...
- research-articleOctober 2022
How higher order mutant testing performs for deep learning models: A fine-grained evaluation of test effectiveness and efficiency improved from second-order mutant-classification tuples
Information and Software Technology (INST), Volume 150, Issue Chttps://doi.org/10.1016/j.infsof.2022.106954Abstract Context:Given the prevalence of Deep Learning (DL) models in daily life, it is crucial to guarantee their reliability by DL testing. Recently, researchers have adapted mutation testing into DL testing to measure the test power of test sets. The ...
- research-articleJuly 2022
Aligned metric representation based balanced multiset ensemble learning for heterogeneous defect prediction
Information and Software Technology (INST), Volume 147, Issue Chttps://doi.org/10.1016/j.infsof.2022.106892Abstract Context:Heterogeneous defect prediction (HDP) refers to the defect prediction across projects with different metrics. Most existing HDP methods map source and target data into a common metric space where each dimension has ...
- research-articleJuly 2022
Adaptive test selection for deep neural networks
ICSE '22: Proceedings of the 44th International Conference on Software EngineeringPages 73–85https://doi.org/10.1145/3510003.3510232Deep neural networks (DNN) have achieved tremendous development in the past decade. While many DNN-driven software applications have been deployed to solve various tasks, they could also produce incorrect behaviors and result in massive losses. To reveal ...
- research-articleFebruary 2022
Classifying crowdsourced mobile test reports with image features: An empirical study
Journal of Systems and Software (JSSO), Volume 184, Issue Chttps://doi.org/10.1016/j.jss.2021.111121AbstractCrowdsourced testing has become a popular mobile application testing method, and it is capable of simulating real usage scenarios and detecting various bugs with a large workforce. However, inspecting and classifying the overwhelming ...
Graphical abstractDisplay Omitted
Highlights- An empirical study of crowdsourced mobile test report classification.
- ...
- research-articleSeptember 2020
An Empirical Study on Dynamic Typing Related Practices in Python Systems
ICPC '20: Proceedings of the 28th International Conference on Program ComprehensionPages 83–93https://doi.org/10.1145/3387904.3389253The dynamic typing discipline of Python allows developers to program at a high level of abstraction. However, type related bugs are commonly encountered in Python systems due to the lack of type declaration and static type checking. Especially, the ...
- research-articleMay 2018
Debugging with intelligence via probabilistic inference
ICSE '18: Proceedings of the 40th International Conference on Software EngineeringPages 1171–1181https://doi.org/10.1145/3180155.3180237We aim to debug a single failing execution without the assistance from other passing/failing runs. In our context, debugging is a process with substantial uncertainty - lots of decisions have to be made such as what variables shall be inspected first. To ...
- research-articleNovember 2016
Python predictive analysis for bug detection
FSE 2016: Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software EngineeringPages 121–132https://doi.org/10.1145/2950290.2950357Python is a popular dynamic language that allows quick software development. However, Python program analysis engines are largely lacking. In this paper, we present a Python predictive analysis. It first collects the trace of an execution, and then ...
- research-articleMay 2016
Revisit of automatic debugging via human focus-tracking analysis
ICSE '16: Proceedings of the 38th International Conference on Software EngineeringPages 808–819https://doi.org/10.1145/2884781.2884834In many fields of software engineering, studies on human behavior have attracted a lot of attention; however, few such studies exist in automated debugging. Parnin and Orso conducted a pioneering study comparing the performance of programmers in ...
- research-articleMarch 2016
Verifying Synchronization for Atomicity Violation Fixing
IEEE Transactions on Software Engineering (ISOF), Volume 42, Issue 3Pages 280–296https://doi.org/10.1109/TSE.2015.2477820Atomicity is a fundamental property to guarantee the isolation of a work unit (<italic>i.e.</italic>, a sequence of related events in a thread) from concurrent threads. However, ensuring atomicity is often very challenging due to complex thread ...
- research-articleAugust 2015
Test report prioritization to assist crowdsourced testing
ESEC/FSE 2015: Proceedings of the 2015 10th Joint Meeting on Foundations of Software EngineeringPages 225–236https://doi.org/10.1145/2786805.2786862In crowdsourced testing, users can be incentivized to perform testing tasks and report their results, and because crowdsourced workers are often paid per task, there is a financial incentive to complete tasks quickly rather than well. These reports of ...
- research-articleAugust 2015
Heterogeneous cross-company defect prediction by unified metric representation and CCA-based transfer learning
ESEC/FSE 2015: Proceedings of the 2015 10th Joint Meeting on Foundations of Software EngineeringPages 496–507https://doi.org/10.1145/2786805.2786813Cross-company defect prediction (CCDP) learns a prediction model by using training data from one or multiple projects of a source company and then applies the model to the target company data. Existing CCDP methods are based on the assumption that the ...