Keyword: pre-trained models : Search

Applied Filters

People

Publications

Conferences

Publication Date

32 Results for: Keyword: pre-trained modelsEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,784,770 records)|Limit your search to The ACM Full-Text Collection (765,874 records)

Showing 1 - 20of32 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Free
October 2024
UniGM: Unifying Multiple Pre-trained Graph Models via Adaptive Knowledge Aggregation
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 8556–8565https://doi.org/10.1145/3664647.3681018

Recent years have witnessed remarkable advances in graph representation learning using Graph Neural Networks (GNNs). To fully exploit the unlabeled graphs, researchers pre-train GNNs on large-scale graph databases and then fine-tune these pre-trained <u>...
0
7
Metrics
Total Citations0
Total Downloads7
Last 12 Months7
Last 6 weeks7
View online with eReader
PDF
research-article
Free
October 2024
Coding-PTMs: How to Find Optimal Code Pre-trained Models for Code Embedding in Vulnerability Detection?
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 1732–1744https://doi.org/10.1145/3691620.3695539

Vulnerability detection is garnering increasing attention in software engineering, since code vulnerabilities possibly pose significant security. Recently, reusing various code pre-trained models (e.g., CodeBERT, CodeT5, and CodeGen) has become common ...
0
24
Metrics
Total Citations0
Total Downloads24
Last 12 Months24
Last 6 weeks24
View online with eReader
PDF
research-article
Free
October 2024
ChatBR: Automated assessment and improvement of bug report quality using ChatGPT
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 1472–1483https://doi.org/10.1145/3691620.3695518

Bug reports, containing crucial information such as the Observed Behavior (OB), the Expected Behavior (EB), and the Steps to Reproduce (S2R), can help developers localize and fix bugs efficiently. However, due to the increasing complexity of some bugs ...
0
35
Metrics
Total Citations0
Total Downloads35
Last 12 Months35
Last 6 weeks35
View online with eReader
PDF
Article
September 2024
Leveraging Transfer Learning for Article Segmentation in Historical Newspapers
Linking Theory and Practice of Digital LibrariesPages 222–238https://doi.org/10.1007/978-3-031-72437-4_13
Abstract
Historical newspapers serve as invaluable resources for understanding past societies and preserving cultural heritage. However, digitizing these newspapers presents challenges due to their complex layouts and vast content. Article segmentation, ...
0
Metrics
Total Citations0
research-article
August 2024
Understanding the Performance of AI Algorithms in Text-Based Emotion Detection for Conversational Agents
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 8Article No.: 121, Pages 1–26https://doi.org/10.1145/3643133
Current industry trends demand automation in every aspect, where machines could replace humans. Recent advancements in conversational agents have grabbed a lot of attention from industries, markets, and businesses. Building conversational agents that ...
1
541
Metrics
Total Citations1
Total Downloads541
Last 12 Months541
Last 6 weeks79
Get Access
Upcoming Conferences
Skip slideshow

WSDM '25

March 10 - 14, 2025

Hannover Congress Centrum, Hannover, and Germany, Hannover, Germany

WSDM '25 Website

CHI '25

April 26 - May 1, 2025

Pacifico Yokohama, Yokohama, Japan

CHI '25 Website

ICSE 2025

April 26 - May 3, 2025

Shaw Centre, Ottawa, ON, Canada

ICSE 2025 Website

CCS '25

October 13 - 17, 2025

Taipei International Convention Center, Taipei, Taiwan

CIKM '25

November 10 - 14, 2025

COEX, Seoul, Republic of Korea
research-article
July 2024
C-Pack: Packed Resources For General Chinese Embeddings
SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 641–649https://doi.org/10.1145/3626772.3657878

We introduce C-Pack, a package of resources that significantly advances the field of general text embeddings for Chinese. C-Pack includes three critical resources. 1) C-MTP is a massive training dataset for text embedding, which is based on the curation ...
7
164
Metrics
Total Citations7
Total Downloads164
Last 12 Months164
Last 6 weeks31
Get Access
research-article
June 2024
RAPID: Zero-Shot Domain Adaptation for Code Search with Pre-Trained Models
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 5Article No.: 128, Pages 1–35https://doi.org/10.1145/3641542
Code search, which refers to the process of identifying the most relevant code snippets for a given natural language query, plays a crucial role in software maintenance. However, current approaches heavily rely on labeled data for training, which results ...
0
736
Metrics
Total Citations0
Total Downloads736
Last 12 Months736
Last 6 weeks101
Get Access
short-paper
Open Access
August 2024
Few-Shot Issue Report Classification with Adapters
- Fahad Ebrahim,
- Mike Joy
NLBSE '24: Proceedings of the Third ACM/IEEE International Workshop on NL-based Software EngineeringPages 41–44https://doi.org/10.1145/3643787.3648039

The automation of the classification of issue reports helps to improve the efficiency of the software tracking cycle. This task can be considered a multi-class classification problem. Therefore, this work is a participation in the NLBSE'24 Issue Report ...
0
46
Metrics
Total Citations0
Total Downloads46
Last 12 Months46
Last 6 weeks28
View online with eReader
PDF
research-article
Open Access
June 2024
How do Hugging Face Models Document Datasets, Bias, and Licenses? An Empirical Study
ICPC '24: Proceedings of the 32nd IEEE/ACM International Conference on Program ComprehensionPages 370–381https://doi.org/10.1145/3643916.3644412

Pre-trained Machine Learning (ML) models help to create ML-intensive systems without having to spend conspicuous resources on training a new model from the ground up. However, the lack of transparency for such models could lead to undesired consequences ...
2
374
Metrics
Total Citations2
Total Downloads374
Last 12 Months374
Last 6 weeks93
View online with eReader
PDF
short-paper
May 2024
Multi-step Automated Generation of Parameter Docstrings in Python: An Exploratory Study
ICSE-Companion '24: Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion ProceedingsPages 356–357https://doi.org/10.1145/3639478.3643110

Documentation debt hinders the effective utilisation of open-source software. Although code summarisation tools have been helpful for developers, most would prefer a detailed account of each parameter in a function rather than a high-level summary. ...
0
42
Metrics
Total Citations0
Total Downloads42
Last 12 Months42
Last 6 weeks5
Get Access
research-article
Open Access
March 2024
Representation Learning for Stack Overflow Posts: How Far Are We?
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 3Article No.: 69, Pages 1–24https://doi.org/10.1145/3635711
The tremendous success of Stack Overflow has accumulated an extensive corpus of software engineering knowledge, thus motivating researchers to propose various solutions for analyzing its content. The performance of such solutions hinges significantly on ...
6
1,205
Metrics
Total Citations6
Total Downloads1,205
Last 12 Months1,205
Last 6 weeks202
View online with eReader
PDF
research-article
Open Access
March 2024
IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 18–27https://doi.org/10.1145/3616855.3635849

Language models such as Bidirectional Encoder Representations from Transformers (BERT) have been very effective in various Natural Language Processing (NLP) and text mining tasks including text classification. However, some tasks still pose challenges ...
0
371
Metrics
Total Citations0
Total Downloads371
Last 12 Months371
Last 6 weeks50
1
Supplementary Material
3616855.3635849-video.mp4
View online with eReader
PDF
research-article
Open Access
March 2024
Pre-trained Recommender Systems: A Causal Debiasing Perspective
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 424–433https://doi.org/10.1145/3616855.3635779

Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully ...
2
425
Metrics
Total Citations2
Total Downloads425
Last 12 Months425
Last 6 weeks61
View online with eReader
PDF
research-article
February 2024
Toward Automatically Completing GitHub Workflows
ICSE '24: Proceedings of the IEEE/ACM 46th International Conference on Software EngineeringArticle No.: 13, Pages 1–12https://doi.org/10.1145/3597503.3623351

Continuous integration and delivery (CI/CD) are nowadays at the core of software development. Their benefits come at the cost of setting up and maintaining the CI/CD pipeline, which requires knowledge and skills often orthogonal to those entailed in ...
1
259
Metrics
Total Citations1
Total Downloads259
Last 12 Months259
Last 6 weeks37
Get Access
research-article
May 2024
I²R: Intra and inter-modal representation learning for code search
Intelligent Data Analysis (INDA), Volume 28, Issue 3Pages 807–823https://doi.org/10.3233/IDA-230082

Code search, which locates code snippets in large code repositories based on natural language queries entered by developers, has become increasingly popular in the software development process. It has the potential to improve the efficiency of software ...
0
Metrics
Total Citations0
research-article
September 2024
Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?
ASE '23: Proceedings of the 38th IEEE/ACM International Conference on Automated Software EngineeringPages 585–597https://doi.org/10.1109/ASE56229.2023.00103

Upon evolving their software, organizations and individual developers have to spend a substantial effort to pay back technical debt, i.e., the fact that software is released in a shape not as good as it should be, e.g., in terms of functionality, ...
0
2
Metrics
Total Citations0
Total Downloads2
Last 12 Months2
Last 6 weeks2
Get Access
research-article
October 2023
Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 9546–9550https://doi.org/10.1145/3581783.3612862

Automatic emotion recognition has a wide range of applications in human-computer interaction. In this paper, we present our work in the Multimodel Emotion Recognition (MER) 2023, which contains three sub-challenges: MER-MULTI, MER-NOISE, and MER-SEMI. We ...
1
185
Metrics
Total Citations1
Total Downloads185
Last 12 Months170
Last 6 weeks12
1
Supplementary Material
mmgc030-video.mp4
Get Access
research-article
Open Access
July 2023
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models
- Jiabang He,
- Yi Hu,
- Lei Wang,
- Xing Xu,
- Ning Liu,
- Hui Liu,
- Heng Tao Shen
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 569–579https://doi.org/10.1145/3539618.3591670

Numerous pre-training techniques for visual document understanding (VDU) have recently shown substantial improvements in performance across a wide range of document tasks. However, these pre-trained VDU models cannot guarantee continued success when the ...
0
211
Metrics
Total Citations0
Total Downloads211
Last 12 Months143
Last 6 weeks17
View online with eReader
PDF
short-paper
June 2023
More Than Simply Masking: Exploring Pre-training Strategies for Symbolic Music Understanding
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia RetrievalPages 540–544https://doi.org/10.1145/3591106.3592237

Pre-trained language models have become the prevailing approach for handling natural language processing tasks in recent years. Given the similarities in sequential features between symbolic music and natural language text, it is fairly logical to adopt ...
2
180
Metrics
Total Citations2
Total Downloads180
Last 12 Months95
Last 6 weeks9
Get Access
research-article
May 2023
What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 32, Issue 3Article No.: 69, Pages 1–57https://doi.org/10.1145/3569934
There is a trend of researchers and practitioners to directly apply pre-trained models to solve their specific tasks. For example, researchers in software engineering (SE) have successfully exploited the pre-trained language models to automatically ...
4
794
Metrics
Total Citations4
Total Downloads794
Last 12 Months388
Last 6 weeks40
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

UniGM: Unifying Multiple Pre-trained Graph Models via Adaptive Knowledge Aggregation

Coding-PTMs: How to Find Optimal Code Pre-trained Models for Code Embedding in Vulnerability Detection?

ChatBR: Automated assessment and improvement of bug report quality using ChatGPT

Leveraging Transfer Learning for Article Segmentation in Historical Newspapers

Understanding the Performance of AI Algorithms in Text-Based Emotion Detection for Conversational Agents

Upcoming Conferences

C-Pack: Packed Resources For General Chinese Embeddings

RAPID: Zero-Shot Domain Adaptation for Code Search with Pre-Trained Models

Few-Shot Issue Report Classification with Adapters

How do Hugging Face Models Document Datasets, Bias, and Licenses? An Empirical Study

Multi-step Automated Generation of Parameter Docstrings in Python: An Exploratory Study

Representation Learning for Stack Overflow Posts: How Far Are We?

IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification

Pre-trained Recommender Systems: A Causal Debiasing Perspective

Toward Automatically Completing GitHub Workflows

I²R: Intra and inter-modal representation learning for code search

Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?

Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition

Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models

More Than Simply Masking: Exploring Pre-training Strategies for Symbolic Music Understanding

What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories