Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
UniGM: Unifying Multiple Pre-trained Graph Models via Adaptive Knowledge Aggregation
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 8556–8565https://doi.org/10.1145/3664647.3681018Recent years have witnessed remarkable advances in graph representation learning using Graph Neural Networks (GNNs). To fully exploit the unlabeled graphs, researchers pre-train GNNs on large-scale graph databases and then fine-tune these pre-trained <u>...
- research-articleOctober 2024
Coding-PTMs: How to Find Optimal Code Pre-trained Models for Code Embedding in Vulnerability Detection?
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 1732–1744https://doi.org/10.1145/3691620.3695539Vulnerability detection is garnering increasing attention in software engineering, since code vulnerabilities possibly pose significant security. Recently, reusing various code pre-trained models (e.g., CodeBERT, CodeT5, and CodeGen) has become common ...
- research-articleOctober 2024
ChatBR: Automated assessment and improvement of bug report quality using ChatGPT
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 1472–1483https://doi.org/10.1145/3691620.3695518Bug reports, containing crucial information such as the Observed Behavior (OB), the Expected Behavior (EB), and the Steps to Reproduce (S2R), can help developers localize and fix bugs efficiently. However, due to the increasing complexity of some bugs ...
- ArticleSeptember 2024
Leveraging Transfer Learning for Article Segmentation in Historical Newspapers
Linking Theory and Practice of Digital LibrariesPages 222–238https://doi.org/10.1007/978-3-031-72437-4_13AbstractHistorical newspapers serve as invaluable resources for understanding past societies and preserving cultural heritage. However, digitizing these newspapers presents challenges due to their complex layouts and vast content. Article segmentation, ...
- research-articleAugust 2024
Understanding the Performance of AI Algorithms in Text-Based Emotion Detection for Conversational Agents
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 8Article No.: 121, Pages 1–26https://doi.org/10.1145/3643133Current industry trends demand automation in every aspect, where machines could replace humans. Recent advancements in conversational agents have grabbed a lot of attention from industries, markets, and businesses. Building conversational agents that ...
-
- research-articleJuly 2024
C-Pack: Packed Resources For General Chinese Embeddings
SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 641–649https://doi.org/10.1145/3626772.3657878We introduce C-Pack, a package of resources that significantly advances the field of general text embeddings for Chinese. C-Pack includes three critical resources. 1) C-MTP is a massive training dataset for text embedding, which is based on the curation ...
- research-articleJune 2024
RAPID: Zero-Shot Domain Adaptation for Code Search with Pre-Trained Models
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 5Article No.: 128, Pages 1–35https://doi.org/10.1145/3641542Code search, which refers to the process of identifying the most relevant code snippets for a given natural language query, plays a crucial role in software maintenance. However, current approaches heavily rely on labeled data for training, which results ...
- short-paperAugust 2024
Few-Shot Issue Report Classification with Adapters
NLBSE '24: Proceedings of the Third ACM/IEEE International Workshop on NL-based Software EngineeringPages 41–44https://doi.org/10.1145/3643787.3648039The automation of the classification of issue reports helps to improve the efficiency of the software tracking cycle. This task can be considered a multi-class classification problem. Therefore, this work is a participation in the NLBSE'24 Issue Report ...
- research-articleJune 2024
How do Hugging Face Models Document Datasets, Bias, and Licenses? An Empirical Study
- Federica Pepe,
- Vittoria Nardone,
- Antonio Mastropaolo,
- Gabriele Bavota,
- Gerardo Canfora,
- Massimiliano Di Penta
ICPC '24: Proceedings of the 32nd IEEE/ACM International Conference on Program ComprehensionPages 370–381https://doi.org/10.1145/3643916.3644412Pre-trained Machine Learning (ML) models help to create ML-intensive systems without having to spend conspicuous resources on training a new model from the ground up. However, the lack of transparency for such models could lead to undesired consequences ...
- short-paperMay 2024
Multi-step Automated Generation of Parameter Docstrings in Python: An Exploratory Study
ICSE-Companion '24: Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion ProceedingsPages 356–357https://doi.org/10.1145/3639478.3643110Documentation debt hinders the effective utilisation of open-source software. Although code summarisation tools have been helpful for developers, most would prefer a detailed account of each parameter in a function rather than a high-level summary. ...
- research-articleMarch 2024
Representation Learning for Stack Overflow Posts: How Far Are We?
- Junda He,
- Xin Zhou,
- Bowen Xu,
- Ting Zhang,
- Kisub Kim,
- Zhou Yang,
- Ferdian Thung,
- Ivana Clairine Irsan,
- David Lo
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 3Article No.: 69, Pages 1–24https://doi.org/10.1145/3635711The tremendous success of Stack Overflow has accumulated an extensive corpus of software engineering knowledge, thus motivating researchers to propose various solutions for analyzing its content. The performance of such solutions hinges significantly on ...
- research-articleMarch 2024
IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 18–27https://doi.org/10.1145/3616855.3635849Language models such as Bidirectional Encoder Representations from Transformers (BERT) have been very effective in various Natural Language Processing (NLP) and text mining tasks including text classification. However, some tasks still pose challenges ...
- research-articleMarch 2024
Pre-trained Recommender Systems: A Causal Debiasing Perspective
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 424–433https://doi.org/10.1145/3616855.3635779Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully ...
- research-articleFebruary 2024
Toward Automatically Completing GitHub Workflows
ICSE '24: Proceedings of the IEEE/ACM 46th International Conference on Software EngineeringArticle No.: 13, Pages 1–12https://doi.org/10.1145/3597503.3623351Continuous integration and delivery (CI/CD) are nowadays at the core of software development. Their benefits come at the cost of setting up and maintaining the CI/CD pipeline, which requires knowledge and skills often orthogonal to those entailed in ...
- research-articleMay 2024
I2R: Intra and inter-modal representation learning for code search
Code search, which locates code snippets in large code repositories based on natural language queries entered by developers, has become increasingly popular in the software development process. It has the potential to improve the efficiency of software ...
- research-articleSeptember 2024
Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?
ASE '23: Proceedings of the 38th IEEE/ACM International Conference on Automated Software EngineeringPages 585–597https://doi.org/10.1109/ASE56229.2023.00103Upon evolving their software, organizations and individual developers have to spend a substantial effort to pay back technical debt, i.e., the fact that software is released in a shape not as good as it should be, e.g., in terms of functionality, ...
- research-articleOctober 2023
Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 9546–9550https://doi.org/10.1145/3581783.3612862Automatic emotion recognition has a wide range of applications in human-computer interaction. In this paper, we present our work in the Multimodel Emotion Recognition (MER) 2023, which contains three sub-challenges: MER-MULTI, MER-NOISE, and MER-SEMI. We ...
- research-articleJuly 2023
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 569–579https://doi.org/10.1145/3539618.3591670Numerous pre-training techniques for visual document understanding (VDU) have recently shown substantial improvements in performance across a wide range of document tasks. However, these pre-trained VDU models cannot guarantee continued success when the ...
- short-paperJune 2023
More Than Simply Masking: Exploring Pre-training Strategies for Symbolic Music Understanding
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 540–544https://doi.org/10.1145/3591106.3592237Pre-trained language models have become the prevailing approach for handling natural language processing tasks in recent years. Given the similarities in sequential features between symbolic music and natural language text, it is fairly logical to adopt ...
- research-articleMay 2023
What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 32, Issue 3Article No.: 69, Pages 1–57https://doi.org/10.1145/3569934There is a trend of researchers and practitioners to directly apply pre-trained models to solve their specific tasks. For example, researchers in software engineering (SE) have successfully exploited the pre-trained language models to automatically ...