default search action
MSR 2019: Montreal, Canada
- Margaret-Anne D. Storey, Bram Adams, Sonia Haiduc:
Proceedings of the 16th International Conference on Mining Software Repositories, MSR 2019, 26-27 May 2019, Montreal, Canada. IEEE / ACM 2019, ISBN 978-1-7281-3412-3
Representations for mining (part 1)
- Shayan A. Akbar, Avinash C. Kak:
SCOR: source code retrieval with semantics and order. 1-12 - Vladimir Kovalenko, Egor Bogomolov, Timofey Bryksin, Alberto Bacchelli:
PathMiner: a library for mining of path-based representations of code. 13-17 - Bart Theeten, Frederik Vandeputte, Tom Van Cutsem:
Import2vec learning embeddings for software libraries. 18-28 - Vasiliki Efstathiou, Diomidis Spinellis:
Semantic source code models using identifier embeddings. 29-33
Defect prediction and testing (part 1)
- Thong Hoang, Hoa Khanh Dam, Yasutaka Kamei, David Lo, Naoyasu Ubayashi:
DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction. 34-45 - Hoa Khanh Dam, Trang Pham, Shien Wee Ng, Truyen Tran, John C. Grundy, Aditya Ghose, Taeksu Kim, Chul-Joo Kim:
Lessons learned from using a deep tree-based model for software defect prediction in practice. 46-57 - Max Kiehn, Xiangyi Pan, Fatih Camci:
Empirical study in using version histories for change risk classification. 58-62 - Aalok Ahluwalia, Davide Falessi, Massimiliano Di Penta:
Snoring: a noise in defect prediction datasets. 63-67
Representations for mining (part 2)
- Eeshita Biswas, K. Vijay-Shanker, Lori L. Pollock:
Exploring word embedding techniques to improve sentiment analysis of software engineering texts. 68-78 - Musfiqur Rahman, Peter C. Rigby, Dharani Palani, Tien N. Nguyen:
Cleaning StackOverflow for machine translation. 79-83 - Christoph Treude, Markus Wagner:
Predicting good configurations for GitHub and stack overflow topic models. 84-95
Defect prediction and testing (part 2)
- Anna-Katharina Wickert, Michael Reif, Michael Eichberg, Anam Dodhy, Mira Mezini:
A dataset of parametric cryptographic misuses. 96-100 - Adithya Raghuraman, Truong Ho-Quang, Michel R. V. Chaudron, Alexander Serebrenik, Bogdan Vasilescu:
Does UML modeling associate with lower defect proneness?: a preliminary empirical investigation. 101-104 - Stanislav Chren, Radoslav Micko, Barbora Buhnova, Bruno Rossi:
STRAIT: a tool for automated software reliability growth analysis. 105-110 - Dirk Beyer:
A data set of program invariants and error paths. 111-115 - Hongyu Zhai, Casey Casalnuovo, Premkumar T. Devanbu:
Test coverage in python programs. 116-120 - Domenico Serra, Giovanni Grano, Fabio Palomba, Filomena Ferrucci, Harald C. Gall, Alberto Bacchelli:
On the effectiveness of manual and automatic unit test generation: ten years later. 121-125
Large-scale mining
- Dimitris Mitropoulos, Panos Louridas, Vitalis Salis, Diomidis Spinellis:
Time present and time past: analyzing the evolution of JavaScript code in the wild. 126-137 - Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli:
The software heritage graph dataset: public software development under one roof. 138-142 - Yuxing Ma, Chris Bogart, Sadika Amreen, Russell Zaretzki, Audris Mockus:
World of code: an infrastructure for mining the universe of open source VCS data. 143-154 - Dimitris S. Kolovos, Patrick Neubauer, Konstantinos Barmpis, Nicholas Matragkas, Richard F. Paige:
Crossflow: a framework for distributed mining of software repositories. 155-159
Energy and economics
- Wellington Oliveira, Renato Oliveira, Fernando Castor, Benito Fernandes, Gustavo Pinto:
Recommending energy-efficient Java collections. 160-170 - Hugo Matalonga, Bruno Cabral, Fernando Castor, Marco Couto, Rui Pereira, Simão Melo de Sousa, João Paulo Fernandes:
GreenHub farmer: real-world data for Android energy mining. 171-175 - Rui Rua, Marco Couto, João Saraiva:
GreenSource: a large-scale collection of Android code, tests and energy metrics. 176-180 - Asher Trockman, Rijnard van Tonder, Bogdan Vasilescu:
Striking gold in software repositories?: an econometric study of cryptocurrencies on GitHub. 181-185 - Rijnard van Tonder, Asher Trockman, Claire Le Goues:
A panel data set of cryptocurrency development activity on GitHub. 186-190
Mining challenge
- Sebastian Baltes, Christoph Treude, Stephan Diehl:
SOTorrent: studying the origin, evolution, and usage of stack overflow code snippets. 191-194 - Uriel Campos, Guilherme Smethurst, João Pedro Moraes, Rodrigo Bonifácio, Gustavo Pinto:
Mining rule violations in JavaScript code snippets. 195-199 - Akond Rahman, Effat Farhana, Nasif Imtiaz:
Snakes in paradise?: insecure python-related coding practices in stack overflow. 200-204 - Jens Dietrich, Markus Luczak-Rösch, Elroy Dalefield:
Man vs machine: a study into language identification of stack overflow code snippets. 205-209 - Nikolaos Bafatakis, Niels Boecker, Wenjie Boon, Martin Cabello Salazar, Jens Krinke, Gazi Oznacar, Robert White:
Python coding style compliance on stack overflow. 210-214 - Themistoklis Diamantopoulos, Maria-Ioanna Sifaki, Andreas L. Symeonidis:
Towards mining answer edits to extract evolution patterns in stack overflow. 215-219 - Abhishek Soni, Sarah Nadi:
Analyzing comment-induced updates on stack overflow. 220-234 - Xianhao Jin, Francisco Servant:
What edits are done on the highly answered questions in stack overflow?: an empirical study. 225-229 - Durham Abric, Oliver E. Clark, Matthew Caminiti, Keheliya Gallaba, Shane McIntosh:
Can duplicate questions on stack overflow benefit the software development community? 230-234 - Saraj Singh Manes, Olga Baysal:
How often and what StackOverflow posts do developers reference in their GitHub projects? 235-239 - Manziba Akanda Nishi, Agnieszka Ciborowska, Kostadin Damevski:
Characterizing duplicate code snippets between stack overflow and tutorials. 240-244 - Nasif Imtiaz, Akond Rahman, Effat Farhana, Laurie A. Williams:
Challenges with responding to static analysis tool alerts. 245-249 - Mashal Ahmad, Mel Ó Cinnéide:
Impact of stack overflow code snippets on software cohesion: a preliminary study. 250-254 - Alan Bandeira, Carlos Alberto Medeiros, Matheus Paixão, Paulo Henrique M. Maia:
We need to talk about microservices: an analysis from the discussions on StackOverflow. 255-259 - Abdul Ali Bangash, Hareem Sahar, Shaiful Alam Chowdhury, Alexander William Wong, Abram Hindle, Karim Ali:
What do developers know about machine learning: a study of ML discussions on StackOverflow. 260-264
APIs & dependencies (part 1)
- Sven Amann, Hoan Anh Nguyen, Sarah Nadi, Tien N. Nguyen, Mira Mezini:
Investigating next steps in static API-misuse detection. 265-275 - João Eduardo Montandon, Luciana Lourdes Silva, Marco Túlio Valente:
Identifying experts in software libraries and frameworks among GitHub users. 276-287 - Simone Scalabrino, Gabriele Bavota, Mario Linares-Vásquez, Michele Lanza, Rocco Oliveto:
Data-driven solutions to detect API compatibility issues in Android: an empirical study. 288-298
Automatic summarization
- Qin Liu, Zihe Liu, Hongming Zhu, Hongfei Fan, Bowen Du, Yu Qian:
Generating commit messages from diffs using pointer-generator network. 299-309 - Anwar Alqaimi, Patanamon Thongtanunam, Christoph Treude:
Automatically generating documentation for lambda expressions in Java. 310-320 - Shaohua Wang, NhatHai Phan, Yan Wang, Yong Zhao:
Extracting API tips from developer question and answer websites. 321-332
APIs & dependencies (part 2)
- César Soto-Valero, Amine Benelallam, Nicolas Harrand, Olivier Barais, Benoit Baudry:
The emergence of software diversity in maven central. 333-343 - Amine Benelallam, Nicolas Harrand, César Soto-Valero, Benoit Baudry, Olivier Barais:
The maven dependency graph: a temporal graph-based representation of maven central. 344-348 - Jens Dietrich, David J. Pearce, Jacob Stringer, Amjed Tahir, Kelly Blincoe:
Dependency versioning in the wild. 349-359 - Anderson S. Matos, João Bosco Ferreira Filho, Lincoln S. Rocha:
Splitting APIs: an exploratory study of software unbundling. 360-370
Security
- Triet Huynh Minh Le, Bushra Sabir, Muhammad Ali Babar:
Automated software vulnerability assessment with concept drift. 371-382 - Serena Elisa Ponta, Henrik Plate, Antonino Sabetta, Michele Bezzi, Cédric Dangremont:
A manually-curated dataset of fixes to vulnerabilities of open-source software. 383-387 - Jun Gao, Pingfan Kong, Li Li, Tegawendé F. Bissyandé, Jacques Klein:
Negative results on mining crypto-API usage rules in Android apps. 388-398 - Aida Radu, Sarah Nadi:
A dataset of non-functional bugs. 399-403 - Haoyu Wang, Junjun Si, Hao Li, Yao Guo:
RmvDroid: towards a reliable Android malware dataset with app metadata. 404-408
Collaboration & communication (part 1)
- Jiaxin Zhu, Jun Wei:
An empirical study of multiple names and email addresses in OSS version control repositories. 409-420 - Reed Milewicz, Gustavo Pinto, Paige Rodeghero:
Characterizing the roles of contributors in open-source scientific software projects. 421-432 - Christoph Gote, Ingo Scholtes, Frank Schweitzer:
git2net: mining time-stamped co-editing networks from large git repositories. 433-444
Software quality (part 1)
- Sarra Habchi, Naouel Moha, Romain Rouvoy:
The rise of Android code smells: who is to blame? 445-456 - Jonas De Bleser, Dario Di Nucci, Coen De Roover:
Assessing diffusion and perception of test smells in scala projects. 457-467 - Vadim Markovtsev, Waren Long, Hugo Mougard, Konstantin Slavnov, Egor Bulychev:
STYLE-ANALYZER: fixing code style inconsistencies with interpretable unsupervised algorithms. 468-478
Collaboration & communication (part 2)
- Saikat Mondal, Mohammad Masudur Rahman, Chanchal K. Roy:
Can issues reported at stack overflow questions be reproduced?: an exploratory study. 479-489 - Preetha Chatterjee, Kostadin Damevski, Lori L. Pollock, Vinay Augustine, Nicholas A. Kraft:
Exploratory study of slack Q&A chats as a mining source for software engineering tools. 490-501 - Junichi Hayashi, Yoshiki Higo, Shinsuke Matsumoto, Shinji Kusumoto:
Impacts of daylight saving time on software development. 502-506
Software quality (part 2)
- João Felipe Pimentel, Leonardo Murta, Vanessa Braganholo, Juliana Freire:
A large-scale study about quality and reproducibility of jupyter notebooks. 507-517 - Daniel Perez, Shigeru Chiba:
Cross-language clone detection by learning over abstract syntax trees. 518-528 - Marius Kamp, Patrick Kreutzer, Michael Philippsen:
SeSaMe: a data set of semantically similar Java methods. 529-533
Traceability
- Aidan Z. H. Yang, Daniel Alencar da Costa, Ying Zou:
Predicting co-changes between functionality specifications and source code in behavior driven development. 534-544 - Daan Schipper, Maurício Finavaro Aniche, Arie van Deursen:
Tracing back log data to its log statement: from research to practice. 545-549 - Junnosuke Matsumoto, Yoshiki Higo, Shinji Kusumoto:
Beyond GumTree: a hybrid approach to generate edit scripts. 550-554 - Ryosuke Funaki, Shinpei Hayashi, Motoshi Saeki:
The impact of systematic edits in history slicing. 555-559 - Moein Owhadi-Kareshk, Sarah Nadi:
Scalable software merging studies with MergAnser. 560-564
Building on data
- Zoe Kotti, Diomidis Spinellis:
Standing on shoulders or feet?: the usage of the MSR data papers. 565-576 - Sumon Biswas, Md Johirul Islam, Yijia Huang, Hridesh Rajan:
Boa meets python: a boa dataset of data science software in python language. 577-581 - Oliviero Riganelli, Marco Mobilio, Daniela Micucci, Leonardo Mariani:
A benchmark of data loss bugs for Android apps. 582-586 - Saket Dattatray Joshi, Sridhar Chimalakonda:
RapidRelease: a dataset of projects and issues on github with rapid releases. 587-591 - Ahmed Zerouali, Valerio Cosentino, Gregorio Robles, Jesús M. González-Barahona, Tom Mens:
ConPan: a tool to analyze packages in software containers. 592-596 - Gian Luca Scoccia, Anthony Peruma, Virginia Pujols, Ben Christians, Daniel E. Krutz:
An empirical history of permission requests and mistakes in open source Android apps. 597-601
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.