Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2889160.2892662acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
poster
Public Access

Candoia: a platform and ecosystem for mining software repositories tools

Published: 14 May 2016 Publication History

Abstract

We introduce Candoia, a platform and ecosystem for building Mining Software Repositories (MSR) tools. The platform is designed to support building of MSR tools by providing necessary tools and abstractions that hide the complex details of version control, bug databases, source code programming languages and forges. The ecosystem allows easy sharing and accessing of MSR apps for researchers and practitioners. We have some initial evidence about Candoia's applicability in building robust MSR tools (over two dozen prebuilt apps in the first public release of Candoia), adoptability and interoperability (apps run on widely used projects such as Apache Tomcat, Apache Hadoop etc) and easy customizability (an user study). Candoia is available for download from http://candoia.org.

References

[1]
S. Bajracharya, J. Ossher, and C. Lopes. Sourcerer: An infrastructure for large-scale collection and analysis of open-source code. Sci. Comput. Program., 79:241--259, Jan. 2014.
[2]
J. Bevan, J. E. James Whitehead, S. Kim, and M. Godfrey. Facilitating software evolution research with kenyon. In ESEC/FSE-13: Proceedings of the 13th ACM SIGSOFT international symposium on Foundations of software engineering, pages 177--186. ACM Press, 2005.
[3]
V. Dallmeier and T. Zimmermann. Extraction of bug localization benchmarks from history. In Proceedings of the 22nd IEEE/A CM international conference on Automated software engineering, pages 433--436, 2007.
[4]
S. Ducasse, T. Gîrba, and O. Nierstrasz. Moose: An Agile Reengineering Environment. In Proceedings of the 10th European Software Engineering Conference Held Jointly with 13th ACM SIGSOFT International Symposium on Foundations of Software Engineering, ESEC/FSE-13, pages 99--102. ACM, 2005.
[5]
R. Dyer, H. A. Nguyen, H. Rajan, and T. N. Nguyen. Boa: A Language and Infrastructure for Analyzing Ultra-Large-Scale Software Repositories. In Proceedings of the 35th International Conference on Software Engineering, ICSE '13, pages 422--431. IEEE Press, 2013.
[6]
G. Gousios and D. Spinellis. Alitheia core: An extensible software quality monitoring platform. In Proceedings of the 31st International Conference on Software Engineering, ICSE '09, pages 579--582. IEEE Computer Society, 2009.
[7]
G. Gousios and D. Spinellis. GHTorrent: GitHub's data from a firehose. In MSR '12: Proceedings of the 9th Working Conference on Mining Software Repositories, MSR '12, pages 12--21. IEEE, 2012.
[8]
G. Gousios, B. Vasilescu, A. Serebrenik, and A. Zaidman. Lean GHTorrent: GitHub Data on Demand. In Proceedings of the 11th Working Conference on Mining Software Repositories, MSR'14, pages 384--387. ACM, 2014.
[9]
M. Grechanik, C. McMillan, L. DeFerrari, M. Comi, S. Crespi, D. Poshyvanyk, C. Fu, Q. Xie, and C. Ghezzi. An empirical investigation into a large-scale java open source code repository. In Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM '10, page 11. ACM, 2010.
[10]
J. Howison, M. Conklin, and K. Crowston. Flossmole: A collaborative repository for floss research data and analyses. IJITWE '06, 2006.
[11]
R. Just, D. Jalali, and M. D. Ernst. Defects4J: A database of existing faults to enable controlled testing studies for Java programs. In Proceedings of the 2014 International Symposium on Software Testing and Analysis, pages 437--440, 2014.
[12]
G. Pinto, W. Torres, B. Fernandes, F. Castor, and R. S. Barros. A Large-Scale Study on the Usage of Java's Concurrent Programming Constructs. Journal of Systems and Software, 106:59--81, 2015.
[13]
Promise 2009. http://promisedata.org/2009/datasets.html.
[14]
G. Robles. Replicating MSR: A study of the potential replicability of papers published in the Mining Software Repositories proceedings. In 7th IEEE Working Conference on Mining Software Repositories (MSR), pages 171--180, 2010.
[15]
N. M. Tiwari, D. D. Mills, G. Upadhyaya, E. Lin, and H. Rajan. Candoia: A Platform and an Ecosystem for Building and Deploying Versatile Mining Software Repositories Tools. Technical Report TR15-13, Iowa State University, Nov. 2015. In submission.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICSE '16: Proceedings of the 38th International Conference on Software Engineering Companion
May 2016
946 pages
ISBN:9781450342056
DOI:10.1145/2889160
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 May 2016

Check for updates

Qualifiers

  • Poster

Funding Sources

Conference

ICSE '16
Sponsor:

Acceptance Rates

Overall Acceptance Rate 276 of 1,856 submissions, 15%

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)55
  • Downloads (Last 6 weeks)12
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)An Introduction to Software EcosystemsSoftware Ecosystems10.1007/978-3-031-36060-2_1(1-29)Online publication date: 26-May-2023
  • (2022)Story points changes in agile iterative developmentEmpirical Software Engineering10.1007/s10664-022-10192-927:6Online publication date: 10-Aug-2022
  • (2022)Efficient GitHub Crawling Using the GraphQL APIComputational Science and Its Applications – ICCSA 2022 Workshops10.1007/978-3-031-10548-7_48(662-677)Online publication date: 26-Jul-2022
  • (2021)World of code: enabling a research workflow for mining and analyzing the universe of open source VCS dataEmpirical Software Engineering10.1007/s10664-020-09905-926:2Online publication date: 25-Feb-2021
  • (2020)Software provenance tracking at the scale of public source codeEmpirical Software Engineering10.1007/s10664-020-09828-5Online publication date: 29-May-2020
  • (2019)World of codeProceedings of the 16th International Conference on Mining Software Repositories10.1109/MSR.2019.00031(143-154)Online publication date: 26-May-2019
  • (2017)CandoiaProceedings of the 14th International Conference on Mining Software Repositories10.1109/MSR.2017.56(53-63)Online publication date: 20-May-2017

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media