default search action
18th MSR 2021: Madrid, Spain
- 18th IEEE/ACM International Conference on Mining Software Repositories, MSR 2021, Madrid, Spain, May 17-19, 2021. IEEE 2021, ISBN 978-1-7281-8710-5
Technical Papers
- Huy Tu, George Papadimitriou, Mariam Kiran, Cong Wang, Anirban Mandal, Ewa Deelman, Tim Menzies:
Mining Workflows for Anomalous Data Transfers. 1-12 - Egor Spirin, Egor Bogomolov, Vladimir Kovalenko, Timofey Bryksin:
PSIMiner: A Tool for Mining Rich Abstract Syntax Trees from Code. 13-17 - Ruchika Malhotra, Ritvik Kapoor, Deepti Aggarwal, Priya Garg:
Comparative Study of Feature Reduction Techniques in Software Change Prediction. 18-28 - Sofonias Yitagesu, Xiaowang Zhang, Zhiyong Feng, Xiaohong Li, Zhenchang Xing:
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions. 29-40 - Rolf-Helge Pfeiffer:
Identifying Critical Projects via PageRank and Truck Factor. 41-45 - Md. Abdullah Al Alamin, Sanjay Malakar, Gias Uddin, Sadia Afroz, Tameem Bin Haider, Anindya Iqbal:
An Empirical Study of Developer Discussions on Low-Code Software Development Challenges. 46-57 - Hendrig Sellik, Onno van Paridon, Georgios Gousios, Maurício Aniche:
Learning Off-By-One Mistakes: An Empirical Study. 58-67 - Ahmed Imam, Tapajit Dey, Alexander Nolte, Audris Mockus, James D. Herbsleb:
The Secret Life of Hackathon Code Where does it come from and where does it go? 68-79 - Christoph Gote, Christian Zingg:
gambit - An Open Source Name Disambiguation Tool for Version Control Systems. 80-84 - Samuel W. Flint, Jigyasa Chauhan, Robert Dyer:
Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data. 85-96 - Jiayan Pei, Yimin Wu, Zishan Qin, Yao Cong, Jingtao Guan:
Attention-based model for predicting question relatedness on Stack Overflow. 97-107 - Matteo Ciniselli, Nathan Cooper, Luca Pascarella, Denys Poshyvanyk, Massimiliano Di Penta, Gabriele Bavota:
An Empirical Study on the Usage of BERT Models for Code Completion. 108-119 - Quentin Fournier, Daniel Aloise, Seyed Vahid Azhari, François Tetreault:
On Improving Deep Learning Trace Analysis with System Call Arguments. 120-130 - Zhen Yu Ding, Claire Le Goues:
An Empirical Study of OSS-Fuzz Bugs. 131-142 - Jeanderson Cândido, Jan Haesen, Maurício Aniche, Arie van Deursen:
An Exploratory Study of Log Placement Recommendation in an Enterprise System. 143-154 - Sina Gholamian, Paul A. S. Ward:
On the Naturalness and Localness of Software Logs. 155-166 - Mia Mohammad Imran, Agnieszka Ciborowska, Kostadin Damevski:
Automatically Selecting Follow-up Questions for Deficient Bug Reports. 167-178 - Alexandra-Maria Chaniotaki, Tushar Sharma:
Architecture Smells and Pareto Principle: A Preliminary Empirical Exploration. 190-194 - Zadia Codabux, Melina C. Vidoni, Fatemeh H. Fard:
Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study. 195-206 - Diego Marcilio, Carlo A. Furia:
How Java Programmers Test Exceptional Behavior. 207-218 - Guillaume Haben, Sarra Habchi, Mike Papadakis, Maxime Cordy, Yves Le Traon:
A Replication Study on the Usability of Code Vocabulary in Predicting Flaky Tests. 219-229 - Golnaz Gharachorlu, Nick Sumner:
Leveraging Models to Reduce Test Cases in Software Repositories. 230-241 - Jean-Gabriel Young, Amanda Casari, Katie McLaughlin, Milo Z. Trujillo, Laurent Hébert-Dufresne, James P. Bagrow:
Which contributions count? Analysis of attribution in open source. 242-253 - Mahmoud Alfadel, Diego Elias Costa, Emad Shihab, Mouafak Mkhallalati:
On the Use of Dependabot Security Pull Requests. 254-265 - Aleksandr Khvorov, Roman Vasiliev, George A. Chernishev, Irving Muller Rodrigues, Dmitrij V. Koznov, Nikita Povarov:
S3M: Siamese Stack (Trace) Similarity Measure. 266-270 - Gian Luca Scoccia, Patrizio Migliarini, Marco Autili:
Challenges in Developing Desktop Web Apps: a Study of Stack Overflow and GitHub. 271-282 - Saraj Singh Manes, Olga Baysal:
Studying the Change Histories of Stack Overflow and GitHub Snippets. 283-294 - Nikolai Sviridov, Mikhail Evtikhiev, Vladimir Kovalenko:
TNM: A Tool for Mining of Socio-Technical Data from Git Repositories. 295-299 - Ivano Malavolta, Katerina Chinnappan, Stan Swanborn, Grace A. Lewis, Patricia Lago:
Mining the ROS ecosystem for Green Architectural Tactics in Robotics and an Empirical Evaluation. 300-311 - Andreas Schuler, Gabriele Kotsis:
Mining API Interactions to Analyze Software Revisions for the Evolution of Energy Consumption. 312-316 - André C. Hora:
Googling for Software Development: What Developers Search For and What They Find. 317-328 - Alexey Svyatkovskiy, Sebastian Lee, Anna Hadjitofi, Maik Riechert, Juliana Vicente Franco, Miltiadis Allamanis:
Fast and Memory-Efficient Neural Code Completion. 329-340 - Ahmed Zerouali, Camilo Velázquez-Rodríguez, Coen De Roover:
Identifying Versions of Libraries used in Stack Overflow Code Snippets. 341-345 - Fabio Santos, Igor Wiese, Bianca Trinkenreich, Igor Steinmacher, Anita Sarma, Marco Aurélio Gerosa:
Can I Solve It? Identifying APIs Required to Complete OSS Tasks. 346-257 - Murali Sridharan, Mika Mäntylä, Leevi Rantala, Maëlick Claes:
Data Balancing Improves Self-Admitted Technical Debt Detection. 358-368 - Chanathip Pornprasit, Chakkrit Tantithamthavorn:
JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction. 369-379 - Saikat Mondal, Gias Uddin, Chanchal K. Roy:
Rollback Edit Inconsistencies in Developer Forum. 380-391 - André C. Hora:
What Code Is Deliberately Excluded from Test Coverage and Why? 392-402 - Gianmarco Fucci, Nathan Cassee, Fiorella Zampetti, Nicole Novielli, Alexander Serebrenik, Massimiliano Di Penta:
Waiting around or job half-done? Sentiment in self-admitted technical debt. 403-414 - Maria Papoutsoglou, Johannes Wachs, Georgia M. Kapitsaki:
Mining DEV for social and technical insights about software development. 415-419 - Timothy Kinsman, Mairieli Santos Wessel, Marco Aurélio Gerosa, Christoph Treude:
How Do Software Developers Use GitHub Actions to Automate Their Workflows? 420-431 - Jirayus Jiarpakdee, Chakkrit Tantithamthavorn, John C. Grundy:
Practitioners' Perceptions of the Goals and Visual Explanations of Defect Prediction Models. 432-443 - Panyawut Sri-Iesaranusorn, Raula Gaikovina Kula, Takashi Ishio:
Does Code Review Promote Conformance? A Study of OpenStack Patches. 444-448 - Kalvin Eng, Abram Hindle:
Revisiting Dockerfiles in Open Source Software Over Time. 449-459 - Mahfouth Alghamdi, Shinpei Hayashi, Takashi Kobayashi, Christoph Treude:
Characterising the Knowledge about Primitive Variables in Java Code Comments. 460-470 - Anderson G. Uchôa, Caio Barbosa, Daniel Coutinho, Willian Nalepa Oizumi, Wesley K. G. Assunção, Silvia Regina Vergilio, Juliana Alves Pereira, Anderson Oliveira, Alessandro F. Garcia:
Predicting Design Impactful Changes in Modern Code Review: A Large-Scale Empirical Study. 471-482 - Michel Albonico, Ivano Malavolta, Gustavo Pinto, Emitza Guzman, Katerina Chinnappan, Patricia Lago:
Mining Energy-Related Practices in Robotics Software. 483-494
MSR Challenge
- Balázs Mosolygó, Norbert Vándor, Gábor Antal, Péter Hegedüs:
On the Rise and Fall of Simple Stupid Bugs: a Life-Cycle Analysis of SStuBs. 495-499 - Jasmine Latendresse, Rabe Abdalkareem, Diego Elias Costa, Emad Shihab:
How Effective is Continuous Integration in Indicating Single-Statement Bugs? 500-504 - Ehsan Mashhadi, Hadi Hemmati:
Applying CodeBERT for Automated Program Repair of Java Simple Bugs. 505-509 - Fernanda Madeiral, Thomas Durieux:
A large-scale study on human-cloned changes for automated program repair. 510-514 - Wenhan Zhu, Michael W. Godfrey:
Mea culpa: How developers fix their own simple bugs differently from other developers. 515-519 - Arthur V. Kamienski, Luisa Palechor, Cor-Paul Bezemer, Abram Hindle:
PySStuBs: Characterizing Single-Statement Bugs in Popular Open-Source Python Projects. 520-524 - Anthony Peruma, Christian D. Newman:
On the Distribution of "Simple Stupid Bugs" in Unit Test Files: An Exploratory Study. 525-529 - Jiayi Hua, Haoyu Wang:
On the Effectiveness of Deep Vulnerability Detectors to Simple Stupid Bug Detection. 530-534
MSR Data
- Sebastian Nielebock, Paul Blockhaus, Jacob Krüger, Frank Ortmeier:
AndroidCompass: A Dataset of Android Compatibility Checks in Code Repositories. 535-539 - Misoo Kim, Youngkyoung Kim, Eunseok Lee:
Denchmark: A Bug Benchmark of Deep Learning-related Software. 540-544 - Thomas Durieux, César Soto-Valero, Benoit Baudry:
Duets: A Dataset of Reproducible Pairs of Java Library-Clients. 545-549 - Luigi Quaranta, Fabio Calefato, Filippo Lanubile:
KGTorrent: A Dataset of Python Jupyter Notebooks from Kaggle. 550-554 - Mouna Hammoudi, Christoph Mayr-Dorn, Atif Mashkoor, Alexander Egyed:
A Traceability Dataset for Open Source Systems. 555-559 - Ozren Dabic, Emad Aghajani, Gabriele Bavota:
Sampling Projects in GitHub for MSR Studies. 560-564 - Nafise Eskandani, Guido Salvaneschi:
The Wonderless Dataset for Serverless Computing. 565-569 - Wen Li, Xiaoqin Fu, Haipeng Cai:
AndroCT: Ten Years of App Call Traces in Android. 570-574 - Nikitha Rao, Chetan Bansal, Joe Guan:
Search4Code: Code Search Intent Classification Using Weak Supervision. 575-579 - Ruben Opdebeeck, Ahmed Zerouali, Coen De Roover:
Andromeda: A Dataset of Ansible Galaxy Roles and Their Evolution. 580-584 - Amir M. Mir, Evaldas Latoskinas, Georgios Gousios:
ManyTypes4Py: A Benchmark Python Dataset for Machine Learning-based Type Inference. 585-589 - Tushar Sharma, Marouane Kessentini:
QScored: A Large Dataset of Code Smells and Quality Metrics. 590-594 - Likang Yin, Zhiyuan Zhang, Qi Xuan, Vladimir Filkov:
Apache Software Foundation Incubator Project Sustainability Dataset. 595-599 - Tyler Wendland, Jingyang Sun, Junayed Mahmud, S. M. Hasan Mansur, Steven Huang, Kevin Moran, Julia Rubin, Mattia Fazzini:
Andror2: A Dataset of Manually-Reproduced Bug Reports for Android apps. 600-604 - Dheeraj Vagavolu, Vartika Agrahari, Sridhar Chimalakonda, Akhila Sri Manasa Venigalla:
GE526: A Dataset of Open-Source Game Engines. 605-609 - Sahar Badihi, Yi Li, Julia Rubin:
EqBench: A Dataset of Equivalent and Non-equivalent Program Pairs. 610-614
MSR Data Hackathon
- Ahmed Imam, Tapajit Dey:
Tracking Hackathon Code Creation and Reuse. 615-617 - Elena Lyulina, Mahmoud Jahanshahi:
Building the Collaboration Graph of Open-Source Software Ecosystem. 618-620 - David Reid, Kalvin Eng, Chris Bogart, Adam Tutko:
Tracing Vulnerable Code Lineage. 621-623 - James Walden, Noah Burgin, Kuljit Kaur:
An Exploratory Study of Project Activity Changepoints in Open Source Software Evolution. 624-626 - Mengchen Sam Yong, Lavinia Paganini, Huilian Sophie Qiu, José Bayoán Santiago Calderón:
The Diversity-Innovation Paradox in Open-Source Software. 627-629
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.