default search action
EMNLP 2023: Singapore
- Mingxuan Wang, Imed Zitouni:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023 - Industry Track, Singapore, December 6-10, 2023. Association for Computational Linguistics 2023 - Frontmatter.
- Tingfeng Cao, Chengyu Wang, Bingyan Liu, Ziheng Wu, Jinhui Zhu, Jun Huang:
BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis. 1-11 - Chenhui Mao, Xiexiong Lin, Xin Jin, Xin Zhang:
Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation. 12-19 - Takuma Udagawa, Aashka Trivedi, Michele Merler, Bishwaranjan Bhattacharjee:
A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models. 20-31 - Tong Zhang, Junhong Liu, Chen Huang, Jia Liu, Hongru Liang, Zujie Wen, Wenqiang Lei:
Towards Effective Automatic Debt Collection with Persona Awareness. 32-45 - Nidhi Tiwari, Sneha Kola, Milos Milunovic, Si-qing Chen, Marjan Slavkovski:
Gatekeeper to save COGS and improve efficiency of Text Prediction. 46-53 - Nathan Brown, Ashton Williamson, Tahj Anderson, Logan Lawrence:
Efficient Transformer Knowledge Distillation: A Performance Review. 54-65 - Changzhen Ji, Yating Zhang, Adam Jatowt, Haipang Wu:
CDD: A Large Scale Dataset for Legal Intelligence Research. 66-73 - Noé Tits:
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning. 74-82 - Masha Belyi, Charlotte Dzialo, Chaitanya Dwivedi, Prajit Muppidi, Kanna Shimizu:
Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems. 83-92 - Fengjun Wang, Moran Beladev, Ofri Kleinfeld, Elina Frayerman, Tal Shachar, Eran Fainman, Karen Lastmann Assaraf, Sarai Mizrachi, Benjamin Wang:
Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities. 93-103 - Kee Kiat Koo, Ashutosh Joshi, Nishaanth Reddy, Karim Bouyarmane, Ismail B. Tutar, Vaclav Petricek, Changhe Yuan:
Deep Metric Learning to Hierarchically Rank - An Application in Product Retrieval. 104-112 - Youngja Park, Weiqiu You:
A Pretrained Language Model for Cyber Threat Intelligence. 113-122 - Rong Tian, Zijing Zhao, Weijie Liu, Haoyan Liu, Weiquan Mao, Zhe Zhao, Kan Zhou:
SAMP: A Model Inference Toolkit of Post-Training Quantization for Text Processing via Self-Adaptive Mixed-Precision. 123-130 - Sanjay Agrawal, Vivek Sembium, Ankith M. S:
KD-Boost: Boosting Real-Time Semantic Matching in E-commerce with Knowledge Distillation. 131-141 - Jingfen Zhang, Xuan Guo, Sravan Bodapati, Christopher Potts:
Multi-teacher Distillation for Multilingual Spelling Correction. 142-151 - Wei-Te Chen, Keiji Shinzato, Naoki Yoshinaga, Yandi Xia:
Does Named Entity Recognition Truly Not Scale Up to Real-world Product Attribute Extraction? 152-159 - Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan:
Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios. 160-175 - Tongxin Hu, Zhuang Li, Xin Jin, Lizhen Qu, Xin Zhang:
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce. 176-184 - Zhengyuan Liu, Siti Umairah Md. Salleh, Hong Choon Oh, Pavitra Krishnaswamy, Nancy F. Chen:
Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations. 185-193 - Junjie Wang, Yicheng Chen, Wangshu Zhang, Sen Hu, Teng Xu, Jing Zheng:
AdapterDistillation: Non-Destructive Task Composition with Knowledge Distillation. 194-201 - Yuqing Wang, Prashanth Vijayaraghavan, Ehsan Degan:
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction. 202-215 - Justin Chiu:
Retrieval-Enhanced Dual Encoder Training for Product Matching. 216-222 - Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Yusen Hu, Bin Luo, Yifeng Geng, Xuansong Xie:
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models. 223-232 - Nobuhiro Kaji:
Lattice Path Edit Distance: A Romanization-aware Edit Distance for Extracting Misspelling-Correction Pairs from Japanese Search Query Logs. 233-242 - Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang:
Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization. 243-262 - Josiane Van Dorpe, Zachary Yang, Nicolas Grenon-Godbout, Grégoire Winterstein:
Unveiling Identity Biases in Toxicity Detection : A Game-Focused Dataset and Reactivity Analysis Approach. 263-274 - Yucheng Lin, Tim Chang, Yaning Chang, Jianqiang Ma, Donghui Li, Ting Peng, Zang Li, Zhiyi Zhou, Feng Wang:
ORANGE: Text-video Retrieval via Watch-time-aware Heterogeneous Graph Contrastive Learning. 275-283 - Christopher Hidey, Sarthak Sarthak:
Compute-Efficient Churn Reduction for Conversational Agents. 284-293 - Fangkai Yang, Pu Zhao, Zezhong Wang, Lu Wang, Bo Qiao, Jue Zhang, Mohit Garg, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering. 294-312 - Dan Li, Zi Long Zhu, Janneke van de Loo, Agnes Masip Gomez, Vikrant Yadav, Georgios Tsatsaronis, Zubair Afzal:
Enhancing Extreme Multi-Label Text Classification: Addressing Challenges in Model, Data, and Evaluation. 313-321 - Chengcan Ye, Ting Peng, Tim Chang, Zhiyi Zhou, Feng Wang:
Query-aware Multi-modal based Ranking Relevance in Video Search. 322-330 - Jack Good, Jimit Majmudar, Christophe Dupuy, Jixuan Wang, Charith Peris, Clement Chung, Richard S. Zemel, Rahul Gupta:
Coordinated Replay Sample Selection for Continual Federated Learning. 331-342 - Md. Tahmid Rahman Laskar, Xue-Yong Fu, Cheng Chen, Shashi Bhushan TN:
Building Real-World Meeting Summarization Systems using Large Language Models: A Practical Perspective. 343-352 - Spurthi Amba Hombaiah, Tao Chen, Mingyang Zhang, Michael Bendersky, Marc Najork, Matt Colen, Sergey Levi, Vladimir Ofitserov, Tanvir Amin:
Creator Context for Tweet Recommendation. 353-363 - Tyler Vuong, Karel Mundnich, Dhanush Bekal, Veera Raghavendra Elluru, Srikanth Ronanki, Sravan Bodapati:
AdaBERT-CTC: Leveraging BERT-CTC for Text-Only Domain Adaptation in ASR. 364-371 - Denis Kochedykov, Fenglin Yin, Sreevidya Khatravath:
Conversing with databases: Practical Natural Language Querying. 372-379 - Bhaktipriya Radharapu, Kevin Robinson, Lora Aroyo, Preethi Lahoti:
AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications. 380-395 - Dhruv Kumar, Vipul Raheja, Alice Kaiser-Schatzlein, Robyn Perry, Apurva Joshi, Justin Hugues-Nuger, Samuel Lou, Navid Chowdhury:
Speakerly: A Voice-based Writing Assistant for Text Composition. 396-407 - Xianzhi Li, Samuel Chan, Xiaodan Zhu, Yulong Pei, Zhiqiang Ma, Xiaomo Liu, Sameena Shah:
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks. 408-422 - Zhongkai Sun, Zhengyang Zhao, Sixing Lu, Chengyuan Ma, Xiaohu Liu, Xing Fan, Wei Shen, Chenlei Guo:
CL-QR: Cross-Lingual Enhanced Query Reformulation for Multi-lingual Conversational AI Agents. 423-431 - Zhongkai Sun, Yingxue Zhou, Jie Hao, Xing Fan, Yanbin Lu, Chengyuan Ma, Wei Shen, Chenlei Guo:
Improving Contextual Query Rewrite for Conversational AI Agents through User-preference Feedback Learning. 432-439 - Bhavuk Singhal, Sindhuja Gopalan, Amrith Krishna, Malolan Chetlur:
Scaling Neural ITN for Numbers and Temporal Expressions in Tamil: Findings for an Agglutinative Low-resource Language. 440-450 - Gabrielle Cohn, Rishika Agarwal, Deepanshu Gupta, Siddharth Patwardhan:
EELBERT: Tiny Models through Dynamic Embeddings. 451-459 - Hasmot Ali, AKM Shahariar Azad Rabby, Md. Majedul Islam, A. k. m Mahamud, Nazmul Hasan, Fuad Rahman:
Gold Standard Bangla OCR Dataset: An In-Depth Look at Data Preprocessing and Annotation Processes. 460-470 - Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu, Yuan Qi:
PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching. 471-482 - Lilach Eden, Yoav Kantor, Matan Orbach, Yoav Katz, Noam Slonim, Roy Bar-Haim:
Welcome to the Real World: Efficient, Incremental and Scalable Key Point Analysis. 483-491 - Hadeel Saadany, Constantin Orasan:
Automatic Linking of Judgements to UK Supreme Court Hearings. 492-500 - Zhiping Wang, Peng Lin, Hainan Zhang, Hongshen Chen, Tianhao Li, Zhuoye Ding, Sulong Xu, Jinghe Hu:
Automatic Marketing Theme and Commodity Construction System for E-commerce. 501-508 - Shumpei Inoue, Minh-Tien Nguyen, Hiroki Mizokuchi, Tuan-Anh D. Nguyen, Huu-Hiep Nguyen, Dung Le:
Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures. 509-521 - Yuanzhou Yao, Zhao Zhang, Kaijia Yang, Huasheng Liang, Qiang Yan, Yongjun Xu:
An Auxiliary Task Boosted Multi-task Learning Method for Service Account Retrieval with Limited Human Annotation. 522-531 - Siyu An, Ye Liu, Haoyuan Peng, Di Yin:
VKIE: The Application of Key Information Extraction on Video Text. 532-540 - Varun Nathan, Ayush Kumar, Jithendra Vepa:
Investigating the Role and Impact of Disfluency on Summarization. 541-551 - Sandeep Sricharan Mukku, Manan Soni, Chetan Aggarwal, Jitenkumar Rana, Promod Yenigalla, Rashmi Patange, Shyam Mohan:
InsightNet : Structured Insight Mining from Customer Feedback. 552-566 - Karan Singla, Yeon-Jun Kim, Srinivas Bangalore:
E2E Spoken Entity Extraction for Virtual Agents. 567-574 - Ansel Blume, Nasser Zalmout, Heng Ji, Xian Li:
Generative Models for Product Attribute Extraction. 575-585 - Md. Rashad Al Hasan Rony, Christian Suess, Sinchana Ramakanth Bhat, Viju Sudhi, Julia Schneider, Maximilian Vogel, Roman Teucher, Ken E. Friedl, Soumya R. Sahoo:
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering. 586-604 - Andrea Zugarini, Andrew Zamai, Marco Ernandes, Leonardo Rigutini:
BUSTER: a "BUSiness Transaction Entity Recognition" dataset. 605-611 - Leonidas Gee, Leonardo Rigutini, Marco Ernandes, Andrea Zugarini:
Multi-word Tokenization for Sequence Compression. 612-621 - Shangching Liu, Shengkun Wang, Tsungyao Chang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-Ping Cheng, Sian-Hong Luo, Jianwei Zhang:
JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization. 622-630 - Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki, Sravan Bodapati:
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs. 631-639 - Leon Liyang Zhang, Jiarui Lu, Joel Ruben Antony Moniz, Aditya Kulkarni, Dhivya Piraviperumal, Tien Dung Tran, Nick Tzou, Hong Yu:
STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants. 640-649 - Xiaoyu Tan, Shaojie Shi, Xihe Qiu, Chao Qu, Zhenting Qi, Yinghui Xu, Yuan Qi:
Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness. 650-662 - Besnik Fetahu, Zhiyu Chen, Oleg Rokhlenko, Shervin Malmasi:
InstructPTS: Instruction-Tuning LLMs for Product Title Summarization. 663-674 - Lei Wang, Songheng Zhang, Yun Wang, Ee-Peng Lim, Yong Wang:
LLM4Vis: Explainable Visualization Recommendation using ChatGPT. 675-692 - Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN: Visual Document Understanding By Language-Image Network. 693-706 - Lijun Yu, Jin Miao, Xiaoyu Sun, Jiayi Chen, Alexander G. Hauptmann, Hanjun Dai, Wei Wei:
DocumentNet: Bridging the Data Gap in Document Pre-training. 707-722 - Jihyuk Kim, Minsoo Kim, Joonsuk Park, Seung-won Hwang:
Relevance-assisted Generation for Robust Zero-shot Retrieval. 723-731 - Aryan Jain, Jitenkumar Rana, Chetan Aggarwal:
Too much of product information : Don't worry, let's look for evidence! 732-738 - Xinli Yu, Zheng Chen, Yanbin Lu:
Harnessing LLMs for Temporal Data - A Study on Explainable Financial Time Series Forecasting. 739-753 - Minh Thuan Nguyen, Khanh-Tung Tran, Nhu-Van Nguyen, Xuan-Son Vu:
ViGPTQA - State-of-the-Art LLMs for Vietnamese Question Answering: System Overview, Core Models Training, and Evaluations. 754-764 - Jinkyung Jo, Dayeon Ki, Soyoung Yoon, Minjoon Seo:
An Integrated Search System for Korea Weather Data. 765-774 - Mingming Li, Chunyuan Yuan, Huimu Wang, Peng Wang, Jingwei Zhuo, Binbin Wang, Lin Liu, Sulong Xu:
Adaptive Hyper-parameter Learning for Deep Semantic Retrieval. 775-782 - Hojae Han, Yu Jin Kim, Byoungjip Kim, Youngwon Lee, Kyungjae Lee, Kyungmin Lee, Moontae Lee, Kyunghoon Bae, Seung-won Hwang:
On Sample-Efficient Code Generation. 783-791 - Zhoujun Cheng, Jungo Kasai, Tao Yu:
Batch Prompting: Efficient Inference with Large Language Model APIs. 792-810 - Zheng Chen, Ziyan Jiang, Fan Yang, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Aram Galstyan:
Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding. 811-819 - David Q. Sun, Artem Abzaliev, Hadas Kotek, Christopher Klein, Zidi Xiu, Jason D. Williams:
DELPHI: Data for Evaluating LLMs' Performance in Handling Controversial Issues. 820-827 - Saiful Haq, Ashutosh Sharma, Pushpak Bhattacharyya:
Angel: Enterprise Search System for the Non-Profit Industry. 828-835
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.