Proceedings of the 1st International Workshop on Natural Language-based Software Engineering

NLBSE '22: Proceedings of the 1st International Workshop on Natural Language-based Software Engineering

May 2022

2022 Proceeding

Conference Chairs:
Andrea Di Sorbo
University of Sannio, Benevento, Italy
,
Sebastiano Panichella
Zurich University of Applied Sciences, Zurich, Switzerland

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

ICSE '22: 44th International Conference on Software Engineering Pittsburgh Pennsylvania 21 May 2022

ISBN:

978-1-4503-9343-0

Published:

01 February 2023

Sponsors:

SIGSOFT

In-Cooperation:

IEEE CS

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

ICSE 2025

2025 IEEE/ACM 46th International Conference on Software Engineering

April 26 - May 3, 2025

Ottawa , ON , Canada

ICSE 2025 website

Reflects downloads up to 22 Feb 2025Bibliometrics

Citation Count

Downloads (6 weeks)

203

Downloads (12 months)

1,806

Downloads (cumulative)

3,132

Sections

NLBSE '22: Proceedings of the 1st International Workshop on Natural Language-based Software Engineering

2022

Previous Next

Skip Abstract Section

Abstract

Welcome to the 1st edition of the International Workshop on Natural Language-Based Software Engineering (NLBSE). The potential of Natural Language Processing (NLP) and Natural Language Generation (NLG) to support developers and engineers in a wide number of software engineering-related tasks (e.g., requirements engineering, extraction of knowledge and patterns from the software artifacts, summarization and prioritization of development and maintenance activities, etc.) is increasingly evident. Furthermore, the current availability of libraries (e.g., NLTK, CoreNLP, and fasttext) and models (e.g., BERT) that allow efficiently and easily dealing with low-level aspects of natural language processing and representation, pushed more and more researchers to closely work with industry to attempt to solve software engineers' real-world problems.

Proceeding Downloads

PDFFront matter (Title page, Contents, Message from the chairs, Program committee, Author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

research-article

Unsupervised extreme multi label classification of stack overflow posts

Peter Devine,
Kelly Blincoe

Pages 1–8https://doi.org/10.1145/3528588.3528652

Knowing the topics of a software forum post, such as those on StackOverflow, allows for greater analysis and understanding of the large amounts of data that come from these communities. One approach to this problem is using extreme multi label ...

- 8
- 103
Metrics
Total Citations8
Total Downloads103
Last 12 Months44
Last 6 weeks9

Abstract
Get Access

research-article

Public Access

Understanding digits in identifier names: an exploratory study

Anthony Peruma,
Christian D. Newman

Pages 9–16https://doi.org/10.1145/3528588.3528657

Before any software maintenance can occur, developers must read the identifier names found in the code to be maintained. Thus, high-quality identifier names are essential for productive program comprehension and maintenance activities. With developers ...

- 1
- 213
Metrics
Total Citations1
Total Downloads213
Last 12 Months132
Last 6 weeks23

Abstract
View online with eReader
PDF

short-paper

From zero to hero: generating training data for question-to-cypher models

Dominik Opitz,
Nico Hochgeschwender

Pages 17–20https://doi.org/10.1145/3528588.3528655

Graph databases employ graph structures such as nodes, attributes and edges to model and store relationships among data. To access this data, graph query languages (GQL) such as Cypher are typically used, which might be difficult to master for end-...

- 1
- 93
Metrics
Total Citations1
Total Downloads93
Last 12 Months39
Last 6 weeks2

Abstract
Get Access

short-paper

Automatic identification of informative code in stack overflow posts

Preetha Chatterjee

Pages 21–24https://doi.org/10.1145/3528588.3528656

Despite Stack Overflow's popularity as a resource for solving coding problems, identifying relevant information from an individual post remains a challenge. The overload of information in a post can make it difficult for developers to identify specific ...

- 1
- 41
Metrics
Total Citations1
Total Downloads41
Last 12 Months6
Last 6 weeks0

Abstract
Get Access

short-paper

Public Access

NLBSE'22 tool competition

Rafael Kallis,
Oscar Chaparro,
Andrea Di Sorbo,
Sebastiano Panichella

Pages 25–28https://doi.org/10.1145/3528588.3528664

We report on the organization and results of the first edition of the Tool Competition from the International Workshop on Natural Language-based Software Engineering (NLBSE'22). This year, five teams submitted multiple classification models to ...

- 7
- 221
Metrics
Total Citations7
Total Downloads221
Last 12 Months145
Last 6 weeks22

Abstract
View online with eReader
PDF

short-paper

Issue report classification using pre-trained language models

Giuseppe Colavito,
Filippo Lanubile,
Nicole Novielli

Pages 29–32https://doi.org/10.1145/3528588.3528659

This paper describes our participation in the tool competition organized in the scope of the 1st International Workshop on Natural Language-based Software Engineering. We propose a supervised approach relying on fine-tuned BERT-based language models for ...

- 7
- 128
Metrics
Total Citations7
Total Downloads128
Last 12 Months55
Last 6 weeks4

Abstract
Get Access

short-paper

BERT-based GitHub issue report classification

Mohammed Latif Siddiq,
Joanna C. S. Santos

Pages 33–36https://doi.org/10.1145/3528588.3528660

Issue tracking is one of the integral parts of software development, especially for open source projects. GitHub, a commonly used software management tool, provides its own issue tracking system. Each issue can have various tags, which are manually ...

- 12
- 242
Metrics
Total Citations12
Total Downloads242
Last 12 Months135
Last 6 weeks14

Abstract
Get Access

short-paper

Predicting issue types with seBERT

Alexander Trautsch,
Steffen Herbold

Pages 37–39https://doi.org/10.1145/3528588.3528661

Pre-trained transformer models are the current state-of-the-art for natural language models processing. seBERT is such a model, that was developed based on the BERT architecture, but trained from scratch with software engineering data. We fine-tuned ...

- 9
- 62
Metrics
Total Citations9
Total Downloads62
Last 12 Months23
Last 6 weeks4

Abstract
Get Access

short-paper

GitHub issue classification using BERT-style models

Shikhar Bharadwaj,
Tushar Kadam

Pages 40–43https://doi.org/10.1145/3528588.3528663

Recent innovations in natural language processing techniques have led to the development of various tools for assisting software developers. This paper provides a report of our proposed solution to the issue report classification task from the NL-Based ...

- 7
- 166
Metrics
Total Citations7
Total Downloads166
Last 12 Months71
Last 6 weeks8

Abstract
Get Access

short-paper

Open Access

CatIss: an intelligent tool for categorizing issues reports using transformers

Maliheh Izadi

Pages 44–47https://doi.org/10.1145/3528588.3528662

Users use Issue Tracking Systems to keep track and manage issue reports in their repositories. An issue is a rich source of software information that contains different reports including a problem, a request for new features, or merely a question about ...

- 7
- 233
Metrics
Total Citations7
Total Downloads233
Last 12 Months139
Last 6 weeks24

Abstract
View online with eReader
PDF

short-paper

Open Access

On the evaluation of NLP-based models for software engineering

Maliheh Izadi,
Matin Nili Ahmadabadi

Pages 48–50https://doi.org/10.1145/3528588.3528665

NLP-based models have been increasingly incorporated to address SE problems. These models are either employed in the SE domain with little to no change, or they are greatly tailored to source code and its unique characteristics. Many of these approaches ...

- 6
- 340
Metrics
Total Citations6
Total Downloads340
Last 12 Months184
Last 6 weeks27

Abstract
View online with eReader
PDF

research-article

Identification of intra-domain ambiguity using transformer-based machine learning

Ambarish Moharil,
Arpit Sharma

Pages 51–58https://doi.org/10.1145/3528588.3528651

Recently, the application of neural word embeddings for detecting cross-domain ambiguities in software requirements has gained a significant attention from the requirements engineering (RE) community. Several approaches have been proposed in the ...

- 5
- 151
Metrics
Total Citations5
Total Downloads151
Last 12 Months80
Last 6 weeks5

Abstract
Get Access

research-article

Open Access

Can NMT understand me?: towards perturbation-based evaluation of NMT models for code generation

Pietro Liguori,
Cristina Improta,
Simona De Vivo,
Roberto Natella,
Bojan Cukic,
Domenico Cotroneo

Pages 59–66https://doi.org/10.1145/3528588.3528653

Neural Machine Translation (NMT) has reached a level of maturity to be recognized as the premier method for the translation between different languages and aroused interest in different research areas, including software engineering. A key step to ...

- 1
- 187
Metrics
Total Citations1
Total Downloads187
Last 12 Months80
Last 6 weeks8

Abstract
View online with eReader
PDF

research-article

Open Access

Supporting systematic literature reviews using deep-learning-based language models

Rand Alchokr,
Manoj Borkar,
Sharanya Thotadarya,
Gunter Saake,
Thomas Leich

Pages 67–74https://doi.org/10.1145/3528588.3528658

Background: Systematic Literature Reviews are an important research method for gathering and evaluating the available evidence regarding a specific research topic. However, the process of conducting a Systematic Literature Review manually can be ...

- 1
- 709
Metrics
Total Citations1
Total Downloads709
Last 12 Months528
Last 6 weeks33

Abstract
View online with eReader
PDF

short-paper

Open Access

Story point level classification by text level graph neural network

Hung Phan,
Ali Jannesari

Pages 75–78https://doi.org/10.1145/3528588.3528654

Estimating the software projects' efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research ...

- 5
- 188
Metrics
Total Citations5
Total Downloads188
Last 12 Months118
Last 6 weeks19

Abstract
View online with eReader
PDF

Save to Binder

Create a New Binder

Name

Contributors

Andrea Di Sorbo
University of Sannio
- Publication Years2019 - 2019
- Publication counts1
- Citation count21
- Available for Download0
- Downloads (cumulative)3,546
- Downloads (12 months)2,151
- Downloads (6 weeks)162
- Average Downloads per Article0
- Average Citation per Article21
View Full Profile
Sebastiano Panichella
University of Bern
- Publication Years2009 - 2025
- Publication counts70
- Citation count1,355
- Available for Download40
- Downloads (cumulative)22,229
- Downloads (12 months)8,282
- Downloads (6 weeks)918
- Average Downloads per Article556
- Average Citation per Article19
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Summary of the 1st Natural Language-based Software Engineering Workshop (NLBSE 2022)

Natural language processing (NLP) refers to automatic computa- tional processing of human language, including both algorithms that take human-produced text as input and algorithms that pro- duce natural-looking text as outputs. There is a widespread and ...
RoSE '18: Proceedings of the 1st International Workshop on Robotics Software Engineering
BotSE '19: Proceedings of the 1st International Workshop on Bots in Software Engineering

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

Summary of the 1st Natural Language-based Software Engineering Workshop (NLBSE 2022)

RoSE '18: Proceedings of the 1st International Workshop on Robotics Software Engineering

BotSE '19: Proceedings of the 1st International Workshop on Bots in Software Engineering