


default search action
ACL 2025: Vienna, Austria
- Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar:

Findings of the Association for Computational Linguistics, ACL 2025, Vienna, Austria, July 27 - August 1, 2025. Findings of ACL ACL 2025, Association for Computational Linguistics 2025, ISBN 979-8-89176-256-5 - Frontmatter.

- Yachao Zhao, Bo Wang, Yan Wang, Dongming Zhao, Ruifang He, Yuexian Hou:

Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection. 1-12 - Yanbei Jiang, Yihao Ding, Chao Lei, Jiayang Ao, Jey Han Lau, Krista A. Ehinger:

Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task. 13-45 - Guhao Feng, Kai Yang, Yuntian Gu, Xinyue Ai, Shengjie Luo, Jiacheng Sun, Di He, Zhenguo Li, Liwei Wang:

How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs. 46-85 - Zeliang Zhang, Xiaodong Liu, Hao Cheng, Chenliang Xu, Jianfeng Gao:

Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts. 86-102 - Dongshuo Liu, Zhijing Wu, Dandan Song, Heyan Huang:

A Persona-Aware LLM-Enhanced Framework for Multi-Session Personalized Dialogue Generation. 103-123 - Yanzhi Tian, Zeming Liu, Zhengyang Liu, Yuhang Guo:

Exploring In-Image Machine Translation with Real-World Background. 124-137 - Wei Li, Lujun Li, Mark G. Lee, Shengjie Sun, Lei Zhang, Wei Xue, Yike Guo:

BayesKD: Bayesian Knowledge Distillation for Compact LLMs in Constrained Fine-tuning Scenarios. 138-152 - Lingyuan Liu, Mengxiang Zhang:

GOLFer: Smaller LMs-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval. 153-162 - Lingyuan Liu, Mengxiang Zhang:

Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion. 163-173 - Alexander Shvets:

Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification. 174-191 - Zifeng Cheng, Zhaoling Chen, Zhiwei Jiang, Yafeng Yin, Cong Wang, Shiping Ge, Qing Gu:

Multi-Prompting Decoder Helps Better Language Understanding. 192-208 - Sam O'Connor Russell, Naomi Harte:

Visual Cues Enhance Predictive Turn-Taking for Two-Party Human Interaction. 209-221 - Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Haiwen Hong, Huan-ang Gao, Longtao Huang, Hui Xue, Huimin Chen, Zhiyuan Liu, Maosong Sun:

The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning. 222-243 - Jie Zhu, Junhui Li, Yalong Wen, Xiandong Li, Lifan Guo, Feng Chen:

MFinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset. 244-266 - Yijie Zhong, Yunfan Gao, Xiaolian Zhang, Haofen Wang:

ODDA: An OODA-Driven Diverse Data Augmentation Framework for Low-Resource Relation Extraction. 267-285 - Luca Cagliero, Lorenzo Vaiani, Eliana Pastor, Alkis Koudounas, Elena Baralis, Vittorio Mazzia, Sandro Pollastrini, Thomas Gueudré, Manuel Giollo, Daniele Amberti, Yue Wu:

Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs. 286-301 - Tarek Mahmoud, Zhuohan Xie, Dimitar Iliyanov Dimitrov, Nikolaos Nikolaidis, Purificação Silvano, Roman Yangarber, Shivam Sharma, Elisa Sartori, Nicolas Stefanovitch, Giovanni Da San Martino, Jakub Piskorski, Preslav Nakov:

Entity Framing and Role Portrayal in the News. 302-326 - Guangya Wan, Yuqi Wu, Hao Wang, Shengming Zhao, Jie Chen, Sheng Li:

Derailer-Rerailer: Adaptive Verification for Efficient and Reliable Language Model Reasoning. 327-348 - Yiming Li, Zhao Zhang:

Leveraging Large Language Models for Conversational Multi-Doc Question Answering: The First Place of WSDM Cup 2024. 349-355 - Wenyu Tao, Xiaofen Xing, Yirong Chen, Linyi Huang, Xiangmin Xu:

TreeRAG: Unleashing the Power of Hierarchical Storage for Enhanced Knowledge Retrieval in Long Documents. 356-371 - Qiang Ding, Lvzhou Luo, Yixuan Cao, Ping Luo:

Attention with Dependency Parsing Augmentation for Fine-Grained Attribution. 372-387 - Yikuan Hu, Chen Huang, Wenqiang Lei:

ASTRO: Automatic Strategy Optimization For Non-Cooperative Dialogues. 388-408 - Chen Xiong, Xiangyu Qi, Pin-Yu Chen, Tsung-Yi Ho:

Defensive Prompt Patch: A Robust and Generalizable Defense of Large Language Models against Jailbreak Attacks. 409-437 - Jessica Lin, Amir Zeldes:

GUM-SAGE: A Novel Dataset and Approach for Graded Entity Salience Prediction. 438-455 - Zacchary Sadeddine, Fabian M. Suchanek:

Verifying the Steps of Deductive Reasoning Chains. 456-475 - Pardis Sadat Zahraei, Ali Emami:

Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations. 476-501 - Benjamin C. Warner, Ziqi Xu, Simon Haroutounian, Thomas George Kannampallil, Chenyang Lu:

Utilizing Semantic Textual Similarity for Clinical Survey Data Feature Selection. 502-520 - Runchu Tian, Yanghao Li, Yuepeng Fu, Siyang Deng, Qinyu Luo, Cheng Qian, Shuo Wang, Xin Cong, Zhong Zhang, Yesai Wu, Yankai Lin, Huadong Wang, Xiaojiang Liu:

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs. 521-533 - Razvan-Gabriel Dumitru, Vikas Yadav, Rishabh Maheshwary, Paul-Ioan Clotan, Sathwik Tejaswi Madhusudhan, Mihai Surdeanu:

Variable Layerwise Quantization: A Simple and Effective Approach to Quantize LLMs. 534-550 - Kazuki Irie:

Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? A Petroglyph Revisited. 551-559 - Guofeng Cui, Pichao Wang, Yang Liu, Zemian Ke, Zhu Liu, Vimal Bhat:

CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation. 560-574 - Nishanth Sridhar Nakshatri, Nikhil Mehta, Siyi Liu, Sihao Chen, Daniel Hopkins, Dan Roth, Dan Goldwasser:

Talking Point based Ideological Discourse Analysis in News Events. 575-594 - Runheng Liu, Xingchen Xiao, Heyan Huang, Zewen Chi, Zhijing Wu:

FlashBack: Efficient Retrieval-Augmented Language Modeling for Fast Inference. 595-608 - Guangya Yu, Yanhao Li, Zongying Jiang, Yuxiong Jin, Li Dai, Yupian Lin, Ruihui Hou, Weiyan Zhang, Yongqi Fan, Qi Ye, Jingping Liu, Tong Ruan:

CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation. 609-626 - Liyu Zhang, Weiqi Wang, Tianqing Fang, Yangqiu Song:

ConKE: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning. 627-635 - ChengAo Shen, Zhengzhang Chen, Dongsheng Luo, Dongkuan Xu, Haifeng Chen, Jingchao Ni:

Exploring Multi-Modal Data with Tool-Augmented LLM Agents for Precise Causal Discovery. 636-660 - Yaxun Dai, Haiqin Yang, Hao Mou, Pingfu Chao:

PARSQL: Enhancing Text-to-SQL through SQL Parsing and Reasoning. 661-681 - Yuntai Bao, Xuhong Zhang, Tianyu Du, Xinkui Zhao, Zhengwen Feng, Hao Peng, Jianwei Yin:

Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks. 682-700 - Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, Aditya Grover:

Comparing Bad Apples to Good Oranges Aligning Large Language Models via Joint Preference Optimization. 701-723 - Junhao Yu, Yan Zhuang, Yuxuan Sun, Weibo Gao, Qi Liu, Mingyue Cheng, Zhenya Huang, Enhong Chen:

TestAgent: An Adaptive and Intelligent Expert for Human Assessment. 724-747 - Quan Ze Chen, Kevin Feng, Chan Young Park, Amy X. Zhang:

SPICA: Retrieving Scenarios for Pluralistic In-Context Alignment. 748-765 - Kushal Jain, Moritz Miller, Niket Tandon, Kumar Shridhar:

First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning. 766-778 - Wei Xiang, Chuanhong Zhan, Qing Zhang, Bang Wang:

Evaluating Instructively Generated Statement by Large Language Models for Directional Event Causality Identification. 779-785 - Chengwei Wei, Bin Wang, Jung-Jae Kim, Guimei Liu, Nancy F. Chen:

CoinMath: Harnessing the Power of Coding Instruction for Math LLM. 786-797 - Zain Muhammad Mujahid, Dilshod Azizov, Maha Tufail Agro, Preslav Nakov:

Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts. 798-819 - Kun Zhang, Oana Balalau, Ioana Manolescu:

Structured Discourse Representation for Factual Consistency Verification. 820-838 - Chuyi Kong, Ziyang Luo, Hongzhan Lin, Zhiyuan Fan, Yaxin Fan, Yuxi Sun, Jing Ma:

SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing LLMs. 839-866 - Luke Gessler, Alexis Palmer, Katharina von der Wense:

Understanding the Gap: an Analysis of Research Collaborations in NLP and Language Documentation. 867-877 - Juntao Tan, Liangwei Yang, Zuxin Liu, Zhiwei Liu, Rithesh R. N., Tulika Manoj Awalgaonkar, Jianguo Zhang, Weiran Yao, Ming Zhu, Shirley Kokane, Silvio Savarese, Huan Wang, Caiming Xiong, Shelby Heinecke:

PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data. 878-893 - Simret Araya Gebreegziabher, Kuangshi Ai, Zheng Zhang, Elena L. Glassman, Toby Jia-Jun Li:

Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning. 894-906 - Eric Modesitt, Ke Yang, Spencer Hulsey, Xin Liu, ChengXiang Zhai, Volodymyr V. Kindratenko:

ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study. 907-926 - Xiaobo Guo, Soroush Vosoughi:

Serial Position Effects of Large Language Models. 927-953 - Zhiyin Yu, Chao Zheng, Chong Chen, Xian-Sheng Hua, Xiao Luo:

scRAG: Hybrid Retrieval-Augmented Generation for LLM-based Cross-Tissue Single-Cell Annotation. 954-970 - Abu Ubaida Akash, Ahmed Fahmy, Amine Trabelsi:

Can Large Language Models Address Open-Target Stance Detection? 971-985 - Congchi Yin, Yongpeng Zhang, Xuyun Wen, Piji Li:

Improve Language Model and Brain Alignment via Associative Memory. 986-999 - Ziyang Ma, Xiquan Li, Yakun Song, Wenxi Chen, Chenpeng Du, Jian Wu, Yuanzhe Chen, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen:

Towards Reliable Large Audio Language Model. 1000-1014 - Sho Takase, Ryokan Ri, Shun Kiyono, Takuya Kato:

Large Vocabulary Size Improves Large Language Models. 1015-1026 - Zihan Wang, Xiaocui Yang, Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang:

MUSE: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles. 1027-1053 - Michelle Wastl, Jannis Vamvas, Rico Sennrich:

Machine Translation Models are Zero-Shot Detectors of Translation Direction. 1054-1074 - Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Boxing Chen, Sarath Chandar:

Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination. 1075-1096 - Jie He, Jennifer Neville, Mengting Wan, Longqi Yang, Hui Liu, Xiaofeng Xu, Xia Song, Jeff Z. Pan, Pei Zhou:

GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation. 1097-1122 - Chengxing Xie, Bowen Li, Chang Gao, He Du, Wai Lam, Difan Zou, Kai Chen:

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution. 1123-1139 - Zixuan Wu, Yoolim Kim, Carolyn Jane Anderson:

GlyphPattern: An Abstract Pattern Recognition for Vision-Language Models. 1140-1175 - Qianli Wang, Nils Feldhus, Simon Ostermann, Luis Felipe Villa-Arenas, Sebastian Möller, Vera Schmitt:

FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation. 1176-1191 - Guocong Li, Weize Liu, Yihang Wu, Ping Wang, Shuaihan Huang, Hongxia Xu, Jian Wu:

From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs. 1192-1209 - Di Wu, Xin Lu, Yanyan Zhao, Bing Qin:

Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models. 1210-1225 - Rongwu Xu, Xiaojian Li, Shuo Chen, Wei Xu:

Nuclear Deployed!: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents. 1226-1310 - Dacao Zhang, Kun Zhang, Shimao Chu, Le Wu, Xin Li, Si Wei:

MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning. 1311-1324 - Xin-Yu Xiao, Yalei Liu, Xiangyu Liu, Zengrui Li, Erwei Yin, Qianchen Xia:

Lunar Twins: We Choose to Go to the Moon with Large Language Models. 1325-1339 - Dora Zhao, Qianou Ma, Xinran Zhao, Chenglei Si, Chenyang Yang, Ryan Louie, Ehud Reiter, Diyi Yang, Tongshuang Wu:

SPHERE: An Evaluation Card for Human-AI Systems. 1340-1365 - Maximillian Chen, Ruoxi Sun, Sercan Ö. Arik:

Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling. 1366-1387 - Haochen Liu, Song Wang, Chen Chen, Jundong Li:

Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models. 1388-1400 - Huaizhi Qu, Xinyu Zhao, Jie Peng, Kwonjoon Lee, Behzad Dariush, Tianlong Chen:

UQ-Merge: Uncertainty Guided Multimodal Large Language Model Merging. 1401-1417 - Korbinian Q. Weidinger, T. Y. S. S. Santosh, Oana Ichim, Matthias Grabmair:

AQuAECHR: Attributed Question Answering for European Court of Human Rights. 1418-1447 - Yuhao Zhang, Xiangnan Ma, Kaiqi Kou, Peizhuo Liu, Weiqiao Shan, Benyou Wang, Tong Xiao, Yuxin Huang, Zhengtao Yu, JingBo Zhu:

Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation. 1448-1460 - Yiqin Wang, Haoji Zhang, Jingqi Tian, Yansong Tang:

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control. 1461-1473 - Jiayi Gui, Yiming Liu, Jiale Cheng, Xiaotao Gu, Xiao Liu, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang:

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models. 1474-1491 - Jiarui Ji, Runlin Lei, Jialing Bi, Zhewei Wei, Xu Chen, Yankai Lin, Xuchen Pan, Yaliang Li, Bolin Ding:

LLM-Based Multi-Agent Systems are Scalable Graph Generative Models. 1492-1523 - Tiankai Yang, Yi Nian, Li Li, Ruiyao Xu, Yuangang Li, Jiaqi Li, Zhuo Xiao, Xiyang Hu, Ryan A. Rossi, Kaize Ding, Xia Hu, Yue Zhao:

AD-LLM: Benchmarking Large Language Models for Anomaly Detection. 1524-1547 - Jie Liu, Guohua Wang, Ronghui Yang, Jiajie Zeng, Mengchen Zhao, Yi Cai:

RTADev: Intention Aligned Multi-Agent Framework for Software Development. 1548-1581 - Shivam Shandilya, Menglin Xia, Supriyo Ghosh, Huiqiang Jiang, Jue Zhang, Qianhui Wu, Victor Rühle, Saravan Rajmohan:

TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning. 1582-1597 - Kyeongman Park, Minbeom Kim, Kyomin Jung:

A Character-Centric Creative Story Generation via Imagination. 1598-1645 - Minghan Wang, Viet-Thanh Pham, Farhad Moghimifar, Thuy-Trang Vu:

Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model. 1646-1662 - Yang Zhang, Shixin Yang, Chenjia Bai, Fei Wu, Xiu Li, Zhen Wang, Xuelong Li:

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration. 1663-1699 - Chuanyuan Tan, Wenbiao Shao, Hao Xiong, Tong Zhu, Zhenhua Liu, Kai Shi, Wenliang Chen:

UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions. 1700-1715 - Minjie Qiang, Zhongqing Wang, Xiaoyi Bao, Haoyuan Ma, Shoushan Li, Guodong Zhou:

Exploring Knowledge Filtering for Retrieval-Augmented Discriminative Tasks. 1716-1729 - Chong Li, Yingzhuo Deng, Jiajun Zhang, Chengqing Zong:

Group then Scale: Dynamic Mixture-of-Experts Multilingual Language Model. 1730-1754 - Fangxu Yu, Junjie Guo, Zhen Wu, Xinyu Dai:

Beyond Verbal Cues: Emotional Contagion Graph Network for Causal Emotion Entailment. 1755-1767 - Xin Zheng, Jie Lou, Boxi Cao, Xueru Wen, Yuqiu Ji, Hongyu Lin, Yaojie Lu, Xianpei Han, Debing Zhang, Le Sun:

Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic. 1768-1806 - Sondre Wold, Lucas Georges Gabriel Charpentier, Étienne Simon:

Systematic Generalization in Language Models Scales with Information Entropy. 1807-1819 - Byung-Doh Oh, Hongao Zhu, William Schuler:

The Inverse Scaling Effect of Pre-Trained Language Model Surprisal Is Not Due to Data Leakage. 1820-1827 - Ganlin Xu, Zhoujia Zhang, Wangyi Mei, Jiaqing Liang, Weijia Lu, Xiaodong Zhang, Zhifei Yang, Xiaofeng Ma, Yanghua Xiao, Deqing Yang:

Logical Consistency is Vital: Neural-Symbolic Information Retrieval for Negative-Constraint Queries. 1828-1847 - Rena Wei Gao, Xuetong Wu, Siwen Luo, Caren Han, Feng Liu:

'No' Matters: Out-of-Distribution Detection in Multimodality Multi-Turn Interactive Dialogue Download PDF. 1848-1864 - Qizhi Wan, Liu Tao, Changxuan Wan, Rong Hu, Keli Xiao, Yuxin Shuai:

Event Pattern-Instance Graph: A Multi-Round Role Representation Learning Strategy for Document-Level Event Argument Extraction. 1865-1877 - Lukas Edman, Helmut Schmid, Alexander Fraser:

EXECUTE: A Multilingual Benchmark for LLM Token Understanding. 1878-1887 - Wei-Fan Chen, Zhixue Zhao, Akbar Karimi, Lucie Flek:

Explainable Hallucination through Natural Language Inference Mapping. 1888-1896 - Hao Liu, Zhengren Wang, Xi Chen, Zhiyu Li, Feiyu Xiong, Qinhan Yu, Wentao Zhang:

HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation. 1897-1913 - Markus Frohmann, Gabriel Meseguer-Brocal, Markus Schedl, Elena V. Epure:

Double Entendre: Robust Audio-Based AI-Generated Lyrics Detection via Multi-View Fusion. 1914-1926 - Sangmin Woo, Donguk Kim, Jaehyuk Jang, Yubin Choi, Changick Kim:

Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vision Language Models. 1927-1951 - Xiaoning Dong, Wenbo Hu, Wei Xu, Tianxing He:

SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task Linkage. 1952-1987 - Yifan Hu, Rui Liu, Yi Ren, Xiang Yin, Haizhou Li:

Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis. 1988-2003 - Aochuan Chen, Jiashun Cheng, Zijing Liu, Ziqi Gao, Fugee Tsung, Yu Li, Jia Li:

Parameter-Efficient Fine-Tuning via Circular Convolution. 2004-2019 - Jiahao Li, Zhendong Mao, Quan Wang:

Alleviating Hallucinations in Large Language Models via Truthfulness-driven Rank-adaptive LoRA. 2020-2031 - Xinye Li, Zunwen Zheng, Qian Zhang, Dekai Zhuang, Jiabao Kang, Liyan Xu, Qingbin Liu, Xi Chen, Zhiying Tu, Dianhui Chu, Dianbo Sui:

ScEdit: Script-based Assessment of Knowledge Editing. 2032-2052 - Seanie Lee, Dong Bok Lee, Dominik Wagner, Minki Kang, Haebin Seong, Tobias Bocklet, Juho Lee, Sung Ju Hwang:

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models. 2053-2069 - Rena Wei Gao, Ming-Bin Chen, Lea Frermann, Jey Han Lau:

Moderation Matters: Measuring Conversational Moderation Impact in English as a Second Language Group Discussion. 2070-2095 - Katherine Atwell, Mandy Simons, Malihe Alikhani:

Measuring Bias and Agreement in Large Language Model Presupposition Judgments. 2096-2107 - Jeonghun Baek, Akiko Aizawa, Kiyoharu Aizawa:

Harnessing PDF Data for Improving Japanese Large Multimodal Models. 2108-2123 - Pranaydeep Singh, Eneko Agirre, Gorka Azkune, Orphée De Clercq, Els Lefever:

EnerGIZAr: Leveraging GIZA++ for Effective Tokenizer Initialization. 2124-2137 - Yuxiang Chai, Siyuan Huang, Yazhe Niu, Han Xiao, Liang Liu, Guozhi Wang, Dingyu Zhang, Shuai Ren, Hongsheng Li:

AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents. 2138-2156 - Houjun Liu, John Bauer, Christopher D. Manning:

Drop Dropout on Single Epoch Language Model Pretraining. 2157-2166 - Zongqi Wang, Baoyuan Wu, Jingyuan Deng, Yujiu Yang:

Robust and Minimally Invasive Watermarking for EaaS. 2167-2191 - Andrei Jarca, Florinel-Alin Croitoru, Radu Tudor Ionescu:

Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text. 2192-2201 - Taneesh Gupta, Shivam Shandilya, Xuchao Zhang, Rahul Madhavan, Supriyo Ghosh, Chetan Bansal, Huaxiu Yao, Saravan Rajmohan:

CARMO: Dynamic Criteria Generation for Context Aware Reward Modelling. 2202-2261 - Wenxi Chen, Ziyang Ma, Ruiqi Yan, Yuzhe Liang, Xiquan Li, Ruiyang Xu, Zhikang Niu, Yanqiao Zhu, Yifan Yang, Zhanxun Liu, Kai Yu, Yuxuan Hu, Jinyu Li, Yan Lu, Shujie Liu, Xie Chen:

SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training. 2262-2282 - Yanyang Li, Tin Long Wong, Cheung To Hung, Jianqiao Zhao, Duo Zheng, Ka Wai Liu, Michael R. Lyu, Liwei Wang:

C²LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation. 2283-2306 - Wei Zhou, Mohsen Mesgar, Heike Adel, Annemarie Friedrich:

Texts or Images? A Fine-grained Analysis on the Effectiveness of Input Representations and Models for Table Question Answering. 2307-2318 - Keyeun Lee, Seolhee Lee, Esther Hehsun Kim, Yena Ko, Jinsu Eun, Dahee Kim, Hyewon Cho, Haiyi Zhu, Robert E. Kraut, Eunyoung Suh, Eun-mee Kim, Hajin Lim:

Adaptive-VP: A Framework for LLM-Based Virtual Patients that Adapts to Trainees' Dialogue to Facilitate Nurse Communication Training. 2319-2352 - Hai Huang, Yan Xia, Shengpeng Ji, Shulei Wang, Hanting Wang, Minghui Fang, Jieming Zhu, Zhenhua Dong, Sashuai Zhou, Zhou Zhao:

Enhancing Multimodal Unified Representations for Cross Modal Generalization. 2353-2366 - Da Ju, Hagen Blix, Adina Williams:

Domain Regeneration: How well do LLMs match syntactic properties of text domains? 2367-2388 - Raphaël Mouravieff, Benjamin Piwowarski, Sylvain Lamprier:

Structural Deep Encoding for Table Question Answering. 2389-2402 - Bo Li, Gexiang Fang, Wei Ye, Zhenghua Xu, Jinglei Zhang, Hao Cheng, Shikun Zhang:

MPL: Multiple Programming Languages with Large Language Models for Information Extraction. 2403-2414 - Zheng Chu, Huiming Fan, Jingchang Chen, Qianyu Wang, Mingda Yang, Jiafeng Liang, Zhongjie Wang, Hao Li, Guo Tang, Ming Liu, Bing Qin:

Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering. 2415-2438 - Ruizhe Li, Yanjun Gao:

Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice Questions. 2439-2465 - Sreyan Ghosh, Mohammad Sadegh Rasooli, Michael Levit, Peidong Wang, Jian Xue, Dinesh Manocha, Jinyu Li:

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation. 2466-2482 - Ruikang Hu, Shaoyu Lin, Yeliang Xiu, Yongmei Liu:

LTRAG: Enhancing Autoformalization and Self-refinement for Logical Reasoning with Thought-Guided RAG. 2483-2493 - Giuseppe Ruggiero, Matteo Testa, Jurgen Van de Walle, Luigi Di Caro:

Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation. 2494-2504 - Ke Wang, Junting Pan, Linda Wei, Aojun Zhou, Weikang Shi, Zimu Lu, Han Xiao, Yunqiao Yang, Houxing Ren, Mingjie Zhan, Hongsheng Li:

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning. 2505-2534 - Boyang Xue, Hongru Wang, Rui Wang, Sheng Wang, Zezhong Wang, Yiming Du, Bin Liang, Wenxuan Zhang, Kam-Fai Wong:

MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models. 2535-2556 - Keyuan Cheng, Zijian Kan, Zhuoran Zhang, Muhammad Asif Ali, Lijie Hu, Di Wang:

COMPKE: Complex Question Answering under Knowledge Editing. 2557-2576 - Junhao Hu, Wenrui Huang, Weidong Wang, Zhenwen Li, Tiancheng Hu, Zhixia Liu, Xusheng Chen, Tao Xie, Yizhou Shan:

RaaS: Reasoning-Aware Attention Sparsity for Efficient LLM Reasoning. 2577-2590 - Rongguang Ye, Ming Tang:

One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models. 2591-2604 - Shangda Wu, Zhancheng Guo, Ruibin Yuan, Junyan Jiang, Seungheon Doh, Gus Xia, Juhan Nam, Xiaobing Li, Feng Yu, Maosong Sun:

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages. 2605-2625 - Ming Zhang, Yuhui Wang, Yujiong Shen, Tingyi Yang, Changhao Jiang, Yilong Wu, Shihan Dou, Qinhao Chen, Zhiheng Xi, Zhihao Zhang, Yi Dong, Zhen Wang, Zhihui Fei, Mingyang Wan, Tao Liang, Guojun Ma, Qi Zhang, Tao Gui, Xuanjing Huang:

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts. 2626-2649 - Lang Qin, Yao Zhang, Hongru Liang, Adam Jatowt, Zhenglu Yang:

Listening to Patients: Detecting and Mitigating Patient Misreport in Medical Dialogue System. 2650-2664 - Xiaoyang Hu, Richard L. Lewis:

Do Language Models Understand the Cognitive Tasks Given to Them? Investigations with the N-Back Paradigm. 2665-2677 - Yuxia Geng, Runkai Zhu, Jiaoyan Chen, Jintai Chen, Xiang Chen, Zhuo Chen, Shuofei Qiao, Yuxiang Wang, Xiaoliang Xu, Sheng-Jun Huang:

Graph-guided Cross-composition Feature Disentanglement for Compositional Zero-shot Learning. 2678-2690 - Wenhao Li, Yuxin Zhang, Gen Luo, Daohai Yu, Rongrong Ji:

Training Long-Context LLMs Efficiently via Chunk-wise Optimization. 2691-2700 - Jiashun Cheng, Aochuan Chen, Nuo Chen, Ziqi Gao, Yuhan Li, Jia Li, Fugee Tsung:

Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps. 2701-2718 - Keyuan Cheng, Xudong Shen, Yihao Yang, TengyueWang TengyueWang, Yang Cao, Muhammad Asif Ali, Hanbin Wang, Lijie Hu, Di Wang:

CODEMENV: Benchmarking Large Language Models on Code Migration. 2719-2744 - V. S. D. S. Mahesh Akavarapu, Hrishikesh Terdalkar, Pramit Bhattacharyya, Shubhangi Agarwal, Vishakha Deulgaonkar, Chaitali Dangarikar, Pralay Manna, Arnab Bhattacharya:

A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs. 2745-2761 - Jilong Li, Zhenxi Song, Jiaqi Wang, Meishan Zhang, Honghai Liu, Min Zhang, Zhiguo Zhang:

BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation. 2762-2778 - Yahan Yu, Duzhen Zhang, Yong Ren, Xuanle Zhao, Xiuyi Chen, Chenhui Chu:

Progressive LoRA for Multimodal Continual Instruction Tuning. 2779-2796 - Lukasz Borchmann:

ARC 'Challenge' Is Not That Challenging. 2797-2804 - Vera Neplenbroek, Arianna Bisazza, Raquel Fernández:

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation. 2805-2830 - Tomás Vergara Browne, Alvaro Soto:

Tracr-Injection: Distilling Algorithms into Pre-trained Language Models. 2831-2843 - Ximing Dong, Shaowei Wang, Dayi Lin, Ahmed E. Hassan:

Model Performance-Guided Evaluation Data Selection for Effective Prompt Optimization. 2844-2859 - Wei Yao, Wenkai Yang, Ziqiao Wang, Yankai Lin, Yong Liu:

Revisiting Weak-to-Strong Generalization in Theory and Practice: Reverse KL vs. Forward KL. 2860-2888 - Felix Drinkall, Stefan Zohren, Michael McMahon, Janet B. Pierrehumbert:

Stories that (are) Move(d by) Markets: A Causal Exploration of Market Shocks and Semantic Shifts across Different Partisan Groups. 2889-2904 - Miao Yu, Shilong Wang, Guibin Zhang, Junyuan Mao, Chenlong Yin, Qijiong Liu, Kun Wang, Qingsong Wen, Yang Wang:

NetSafe: Exploring the Topological Safety of Multi-agent System. 2905-2938 - Qiji Zhou, Yifan Gong, Guangsheng Bao, Hongjie Qiu, Jinqiang Li, Xiangrong Zhu, Huajian Zhang, Yue Zhang:

Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation. 2939-2957 - Hanlun Zhu, Yunshi Lan, Xiang Li, Weining Qian:

Initializing and Retrofitting Key-Value Adaptors for Traceable Model Editing. 2958-2971 - Jiaqi Li, Yixuan Tang, Yi Yang:

Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning. 2972-2989 - Siqi Fan, Xuezhi Fang, Xingrun Xing, Peng Han, Shuo Shang, Yequan Wang:

Position-Aware Depth Decay Decoding (D³): Boosting Large Language Model Inference Efficiency. 2990-3001 - Anirudh Maiya, Razan Alghamdi, Maria Leonor Pacheco, Ashutosh Trivedi, Fabio Somenzi:

Explaining Puzzle Solutions in Natural Language: An Exploratory Study on 6x6 Sudoku. 3002-3009 - Andrea Pedrotti, Michele Papucci, Cristiano Ciaccio, Alessio Miaschi, Giovanni Puccetti, Felice Dell'Orletta, Andrea Esuli:

Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors. 3010-3031 - Siqi Ouyang,



Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID