default search action
19th MSR 2022: Pittsburgh, PA, USA
- 19th IEEE/ACM International Conference on Mining Software Repositories, MSR 2022, Pittsburgh, PA, USA, May 23-24, 2022. ACM 2022, ISBN 978-1-4503-9303-4
- Nhan Nguyen, Sarah Nadi:
An Empirical Evaluation of GitHub Copilot's Code Suggestions. 1-5 - Sohil Lal Shrestha, Shafiul Azam Chowdhury, Christoph Csallner:
SLNET: A Redistributable Corpus of 3rd-party Simulink Models. 1-5 - Fatemeh Khoshnoud, Ali Rezaei Nasab, Zahra Toudeji, Ashkan Sami:
Which bugs are missed in code reviews: An empirical study on SmartSHARK dataset. 1-5 - Ahmad Abdellatif, Mairieli Santos Wessel, Igor Steinmacher, Marco Aurélio Gerosa, Emad Shihab:
BotHunter: An Approach to Detect Software Bots in GitHub. 6-17 - Nikitha Rao, Jason Tsay, Martin Hirzel, Vincent J. Hellendoorn:
Comments on Comments: Where Code Review and Documentation Meet. 18-22 - Akalanka Galappaththi, Sarah Nadi, Christoph Treude:
Does This Apply to Me? An Empirical Study of Technical Context in Stack Overflow. 23-34 - Jirat Pasuksmit, Patanamon Thongtanunam, Shanika Karunasekera:
Towards Reliable Agile Iterative Planning via Predicting Documentation Changes of Work Items. 35-47 - Clara Marie Lüders, Abir Bouraffa, Walid Maalej:
Beyond Duplicates: Towards Understanding and Predicting Link Types in Issue Tracking Systems. 48-60 - Ruben Opdebeeck, Ahmed Zerouali, Coen De Roover:
Smelly Variables in Ansible Infrastructure Code: Detection, Prevalence, and Lifetime. 61-72 - Lloyd Montgomery, Clara Marie Lüders, Walid Maalej:
An Alternative Issue Tracking Dataset of Public Jira Repositories. 73-77 - Qinyun Wu, Huan Song, Ping Yang:
Real-World Clone-Detection in Go. 78-79 - Davide Rossi, Stefano Zacchiroli:
Geographic Diversity in Public Code Contributions: An Exploratory Large-Scale Study Over 50 Years. 80-85 - Johannes Härtel, Ralf Lämmel:
Operationalizing Threats to MSR Studies by Simulation-Based Testing. 86-97 - Zeinab Abou Khalil, Stefano Zacchiroli:
The General Index of Software Engineering Papers. 98-102 - Masanari Kondo, Shinobu Saito, Yukako Iimura, Eunjong Choi, Osamu Mizuno, Yasutaka Kamei, Naoyasu Ubayashi:
Challenges and Future Research Direction for Microtask Programming in Industry. 103-104 - Daniel Izquierdo-Cortazar, Jesús Alonso-Gutiérrez, Alberto Pérez García-Plaza, Gregorio Robles, Jesús M. González-Barahona:
Starting the InnerSource Journey: Key Goals and Metrics to Measure Collaboration. 105-106 - Eman Abdullah AlOmar, Anthony Peruma, Mohamed Wiem Mkaouer, Christian D. Newman, Ali Ouni:
An Exploratory Study on Refactoring Documentation in Issues Handling. 107-111 - Ambarish Moharil, Dmitrii Orlov, Samar Jameel, Tristan Trouwen, Nathan Cassee, Alexander Serebrenik:
Between JIRA and GitHub: ASFBot and its Influence on Human Comments in Issue Trackers. 112-116 - Amirreza Bagheri, Péter Hegedüs:
Is Refactoring Always a Good Egg? Exploring the Interconnection Between Bugs and Refactorings. 117-121 - Nicholas Alexandre Nagy, Rabe Abdalkareem:
On the Co-Occurrence of Refactoring of Test and Source Code. 122-126 - Anthony Peruma, Eman Abdullah AlOmar, Christian D. Newman, Mohamed Wiem Mkaouer, Ali Ouni:
Refactoring Debt: Myth or Reality? An Exploratory Study on the Relationship Between Technical Debt and Refactoring. 127-131 - Carlos Diego Andrade de Almeida, Diego N. Feijó, Lincoln S. Rocha:
Studying the Impact of Continuous Delivery Adoption on Bug-Fixing Time in Apache's Open-Source Projects. 132-136 - Preetha Chatterjee, Tushar Sharma, Paul Ralph:
Empirical Standards for Repository Mining. 142-143 - Rui Shu, Tianpei Xia, Laurie A. Williams, Tim Menzies:
Dazzle: Using Optimized Generative Adversarial Networks to Address Security Data Class Imbalance Issue. 144-155 - Rahul Yedida, Tim Menzies:
How to Improve Deep Learning for Software Analytics (a case study with code smell detection). 156-166 - Matteo Ciniselli, Luca Pascarella, Gabriele Bavota:
To What Extent do Deep Learning-based Code Recommenders Generate Predictions by Cloning Code from the Training Set? 167-178 - Harshitha Menon, Konstantinos Parasyris, Tom Scogland, Todd Gamblin:
Searching for High-Fidelity Builds Using Active Learning. 179-190 - Hossein Keshavarz, Meiyappan Nagappan:
ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction. 191-195 - Francesco Altiero, Anna Corazza, Sergio Di Martino, Adriano Peron, Luigi L. L. Starace:
ReCover: a Curated Dataset for Regression Testing Research. 196-200 - Gustavo Ansaldi Oliva:
Mining the Ethereum Blockchain Platform: Best Practices and Pitfalls (MSR 2022 Tutorial). 201-202 - Carlos Zimmerle, Kiev Gama, Fernando Castor, José Murilo Mota Filho:
Mining the Usage of Reactive Programming APIs: A Study on GitHub and Stack Overflow. 203-214 - Sangeeth Kochanthara, Yanja Dajsuren, Loek Cleophas, Mark van den Brand:
Painting the Landscape of Automotive Software in GitHub. 215-226 - Keerthana Muthu Subash, Lakshmi Prasanna Kumar, Sri Lakshmi Vadlamani, Preetha Chatterjee, Olga Baysal:
DISCO: A Dataset of Discord Chat Conversations for Software Engineering Research. 227-231 - Rosa Filgueira, Daniel Garijo:
Inspect4py: A Knowledge Extraction Framework for Python Code Repositories. 232-236 - Murali Sridharan, Mika Mäntylä, Maëlick Claes, Leevi Rantala:
SoCCMiner: A Source Code-Comments and Comment-Context Miner. 242-246 - Bonan Kou, Yifeng Di, Muhao Chen, Tianyi Zhang:
SOSum: A Dataset of Stack Overflow Post Summaries. 247-251 - Shaiful Alam Chowdhury, Gias Uddin, Reid Holmes:
An Empirical Study on Maintainable Method Size in Java. 252-264 - Victor Veloso, André C. Hora:
Characterizing High-Quality Test Methods: A First Empirical Study. 265-269 - Mohammad Reza Taesiri, Finlay Macklon, Cor-Paul Bezemer:
CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot transfer learning. 270-281 - Yi Yang, Ana L. Milanova, Martin Hirzel:
Complex Python Features in the Wild. 282-293 - Kevin Jesse, Premkumar T. Devanbu:
ManyTypes4TypeScript: A Comprehensive TypeScript Dataset for Sequence-Based Type Inference. 294-298 - Michele Tufano, Shao Kun Deng, Neel Sundaresan, Alexey Svyatkovskiy:
METHODS2TEST: A dataset of focal methods mapped to test cases. 299-303 - Ellen Arteca, Alexi Turcotte:
npm-filter: Automating the mining of dynamic information from npm packages. 304-308 - Isabella Ferreira, Bram Adams, Jinghui Cheng:
How heated is it? Understanding GitHub locked issues. 309-320 - Humphrey O. Obie, Idowu Ilekura, Hung Du, Mojtaba Shahin, John C. Grundy, Li Li, Jon Whittle, Burak Turhan:
On the Violation of Honesty in Mobile Apps: Automated Detection and Categories. 321-332 - Anirudh Ramchandran, Likang Yin, Vladimir Filkov:
Exploring Apache Incubator Project Trajectories with APEX. 333-337 - Melanie Warrick, Samuel F. Rosenblatt, Jean-Gabriel Young, Amanda Casari, Laurent Hébert-Dufresne, James P. Bagrow:
The OCEAN mailing list data set: Network analysis spanning mailing lists and code repositories. 338-342 - Gunnar Kudrjavets, Nachiappan Nagappan, Ayushi Rastogi:
The Unexplored Treasure Trove of Phabricator Code Reviews. 343-347 - Kimberly Truong, Courtney Miller, Bogdan Vasilescu, Christian Kästner:
The Unsolvable Problem or the Unheard Answer? A Dataset of 24, 669 Open-Source Software Conference Talks. 348-352 - Konstantin Grotov, Sergey Titov, Vladimir Sotnikov, Yaroslav Golubev, Timofey Bryksin:
A Large-Scale Comparison of Python Code in Jupyter Notebooks and Scripts. 353-364 - Adem Ait, Javier Luis Cánovas Izquierdo, Jordi Cabot:
An Empirical Study on the Survival Rate of GitHub Projects. 365-375 - Pei Liu, Mattia Fazzini, John C. Grundy, Li Li:
Do Customized Android Frameworks Keep Pace with Android? 376-387 - Petya Buchkova, Joakim Hey Hinnerskov, Kasper Olsen, Rolf-Helge Pfeiffer:
DaSEA - A Dataset for Software Ecosystem Analysis. 388-392 - Kristiina Rahkema, Dietmar Pfahl:
Dataset: Dependency Networks of Open Source Libraries Available Through CocoaPods, Carthage and Swift PM. 393-397 - Anna Vlasova, Maria Tigina, Ilya Vlasov, Anastasiia Birillo, Yaroslav Golubev, Timofey Bryksin:
Lupa: A Framework for Large Scale Analysis of the Programming Language Usage. 398-402 - Nicolas Riquet, Xavier Devroey, Benoît Vanderose:
GitDelver Enterprise Dataset (GDED): An Industrial Closed-source Dataset for Socio-Technical Research. 403-407 - K. P. Arun, Saurabh Kumar, Debadatta Mishra, Biswabandan Panda:
SniP: An Efficient Stack Tracing Framework for Multi-threaded Programs. 408-412 - Fabian Heseding, Willy Scheibel, Jürgen Döllner:
Tooling for Time- and Space-efficient git Repository Mining. 413-417 - Cedric Richter, Heike Wehrheim:
TSSB-3M: Mining single statement bugs at massive scale. 418-422 - Wei Tang, Yanlin Wang, Hongyu Zhang, Shi Han, Ping Luo, Dongmei Zhang:
LibDB: An Effective and Efficient Framework for Detecting Third-Party Libraries in Binaries. 423-434 - Roland Croft, Muhammad Ali Babar, Huaming Chen:
Noisy Label Learning for Security Defects. 435-447 - Barbara Russo, Matteo Camilli, Moritz Mock:
WeakSATD: Detecting Weak Self-admitted Technical Debt. 448-453 - Saurabh Kumar, Debadatta Mishra, Biswabandan Panda, Sandeep Kumar Shukla:
AndroOBFS: Time-tagged Obfuscated Android Malware Dataset with Family Information. 454-458 - Jordan Samhi, Tegawendé F. Bissyandé, Jacques Klein:
TriggerZoo: A Dataset of Android Applications Automatically Infected with Logic Bombs. 459-463 - Quang-Cuong Bui, Riccardo Scandariato, Nicolás E. Díaz Ferreyra:
Vul4J: A Dataset of Reproducible Java Vulnerabilities Geared Towards the Study of Program Repair Techniques. 464-468 - Tatiana Castro Vélez, Raffi Khatchadourian, Mehdi Bagherzadeh, Anita Raja:
Challenges in Migrating Imperative Deep Learning Programs to Graph Execution: An Empirical Study. 469-481 - Jingzhi Gong, Tao Chen:
Does Configuration Encoding Matter in Learning Software Performance? An Empirical Study on Encoding Schemes. 482-494 - Ekaterina Koshchenko, Egor Klimov, Vladimir Kovalenko:
Multimodal Recommendation of Messenger Channels. 495-505 - Rajeswari Hita Kambhamettu, John Billos, Tomi Oluwaseun-Apo, Benjamin Gafford, Rohan Padhye, Vincent J. Hellendoorn:
On the Naturalness of Fuzzer-Generated Code. 506-510 - Fran Silavong, Sean J. Moran, Antonios Georgiadis, Rohan Saphal, Robert Otter:
Senatus - A Fast and Accurate Code-to-Code Recommendation Engine. 511-523 - Wei Ma, Mengjie Zhao, Ezekiel O. Soremekun, Qiang Hu, Jie M. Zhang, Mike Papadakis, Maxime Cordy, Xiaofei Xie, Yves Le Traon:
GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses. 524-536 - Gunnar Kudrjavets, Nachiappan Nagappan, Ayushi Rastogi:
Do Small Code Changes Merge Faster? A Multi-Language Empirical Investigation. 537-548 - Irving Muller Rodrigues, Daniel Aloise, Eraldo Rezende Fernandes:
FaST: A linear time stack trace alignment heuristic for crash report deduplication. 549-560 - Julius Musseau, John Speed Meyers, George P. Sieniawski, C. Albert Thompson, Daniel M. Germán:
Is Open Source Eating the World's Software? Measuring the Proportion of Open Source in Proprietary Software Using Java Binaries. 561-565 - Suvodeep Majumder, Tianpei Xia, Rahul Krishna, Tim Menzies:
Methods for Stabilizing Models Across Large Samples of Projects (with case studies on Predicting Defect and Project Health). 566-578 - Gunnar Kudrjavets, Aditya Kumar, Nachiappan Nagappan, Ayushi Rastogi:
Mining Code Review Data to Understand Waiting Times Between Acceptance and Merging: An Empirical Analysis. 579-590 - Asma Razagallah, Raphaël Khoury, Jean-Baptiste Poulet:
TwinDroid: A Dataset of Android app System call traces and Trace Generation Pipeline. 591-595 - David Hin, Andrey Kan, Huaming Chen, Muhammad Ali Babar:
LineVD: Statement-level Vulnerability Detection using Graph Neural Networks. 596-607 - Michael Fu, Chakkrit Tantithamthavorn:
LineVul: A Transformer-based Line-Level Vulnerability Prediction. 608-620 - Triet Huynh Minh Le, Muhammad Ali Babar:
On the Use of Fine-grained Vulnerable Code Statements for Software Vulnerability Assessment Models. 621-633 - Jinyoung Kim, Misoo Kim, Eunseok Lee:
ECench: An Energy Bug Benchmark of Ethereum Client Software. 634-638 - Kim Herzig, Luke Ghostling, Maximilian Grothusmann, Sascha Just, Nora Huang, Alan Klimowski, Yashasvini Ramkumar, Myles McLeroy, Kivanç Muslu, Hitesh Sajnani, Varsha Vadaga:
Microsoft CloudMine: Data Mining for the Executive Order on Improving the Nation's Cybersecurity. 639 - Yuxiang Gao, Yi Zhu, Qiao Yu:
Evaluating the effectiveness of local explanation methods on source code-based defect prediction models. 640-645 - Fiorella Zampetti, Vittoria Nardone, Massimiliano Di Penta:
Problems and Solutions in Applying Continuous Integration and Delivery to 20 Open-Source Cyber-Physical Systems. 646-657 - Justus Bogner, Manuel Merkel:
To Type or Not to Type? A Systematic Comparison of the Software Quality of JavaScript and TypeScript Applications on GitHub. 658-669 - Masateru Tsunoda, Akito Monden, Koji Toda, Amjed Tahir, Kwabena Ebo Bennin, Keitaro Nakasai, Masataka Nagura, Kenichi Matsumoto:
Using Bandit Algorithms for Selecting Feature Reduction Techniques in Software Defect Prediction. 670-681 - Yoshiki Higo, Shinsuke Matsumoto, Shinji Kusumoto, Kazuya Yasuda:
Constructing Dataset of Functionally Equivalent Java Methods Using Automated Test Generation Techniques. 682-686 - Yegor Bugayenko, Kirill Daniakin, Mirko Farina, Firas Jolha, Artem V. Kruglov, Giancarlo Succi, Witold Pedrycz:
Extracting Corrective Actions from Code Repositories. 687-688 - Eman Abdullah AlOmar, Moataz Chouchen, Mohamed Wiem Mkaouer, Ali Ouni:
Code Review Practices for Refactoring Changes: An Empirical Study on OpenStack. 689-701 - Bruno Luan de Sousa, Mariza A. S. Bigonha, Kecia A. M. Ferreira, Glaura C. Franco:
A Time Series-Based Dataset of Open-Source Software Evolution. 702-706 - Vali Tawosi, Afnan A. Al-Subaihin, Rebecca Moussa, Federica Sarro:
A Versatile Dataset of Agile Open Source Software Projects. 707-711 - Viktor Csuvik, László Vidács:
FixJS: A Dataset of Bug-fixing JavaScript Commits. 712-716 - Sourya Dey, Walt Woods:
LAGOON: An Analysis Tool for Open Source Communities. 717-721 - Yegor Bugayenko, Ayomide Bakare, Arina Cheverda, Mirko Farina, Artem V. Kruglov, Yaroslav Plaksin, Giancarlo Succi, Witold Pedrycz:
Automatically Prioritizing and Assigning Tasks from Code Repositories in Puzzle Driven Development. 722-723 - Mairieli Santos Wessel, Marco Aurélio Gerosa, Emad Shihab:
Software Bots in Software Engineering: Benefits and Challenges. 724-725 - Natarajan Chidambaram, Pooya Rostami Mazrae:
Bot Detection in GitHub Repositories. 726-728 - Niranjan Hasabnis:
GitRank: A Framework to Rank GitHub Repositories. 729-731 - Willem Meijer, David Visscher, Erwin de Haan, Merijn Schröder, Leon Visscher, Andrea Capiluppi, Ioan Botez:
Maintenance and Evolution: GrimoireLab Graal. 732-734 - James Walden:
OpenSSL 3.0.0: An exploratory case study. 735-737 - Carlos Gavidia-Calderon, DongGyun Han, Amel Bennaceur:
Quid Pro Quo: An Exploration of Reciprocity in Code Review. 738-740 - Kalvin Eng, Hareem Sahar:
Replicating Data Pipelines with GrimoireLab. 741-743 - Zhengyi Qiu, Shudi Shao, Qi Zhao, Hassan Ali Khan, Xinning Hui, Guoliang Jin:
A Deep Study of the Effects and Fixes of Server-Side Request Races in Web Applications. 744-756 - Stefano Zacchiroli:
A Large-scale Dataset of (Open Source) License Text Variants. 757-761 - Gökalp Demirci, Vijayaraghavan Murali, Imad Ahmad, Rajeev Rao, Gareth Ari Aye:
Detecting Privacy-Sensitive Code Changes with Language Modeling. 761-762 - Sofia Reis, Rui Abreu, Hakan Erdogmus, Corina S. Pasareanu:
SECOM: Towards a convention for security commit messages. 764-765 - Saurabh Pujar, Yunhui Zheng, Luca Buratti, Burn L. Lewis, Alessandro Morari, Jim Laredo, Kevin Postlethwait, Christoph Görn:
Varangian: A Git Bot for Augmented Static Analysis. 766-767
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.