


default search action
Kai Sun 0006
Person information
- affiliation: Cornell University, Ithaca, NY, USA
Other persons with the same name
- Kai Sun — disambiguation page
- Kai Sun 0001
— University of Tennessee, Department of EECS, Knoxville, TN, USA (and 2 more) - Kai Sun 0002
— University of Southampton, Nanoelectronics & Nanotechnology Research Group, UK - Kai Sun 0003
— Inner Mongolia University, College of Electronic Information Engineering, Hohhot, China (and 1 more) - Kai Sun 0004
— Tsinghua University, Department of Electrical Engineering, Beijing, China (and 3 more) - Kai Sun 0005
— Imperial College London, UK - Kai Sun 0007
— Xi'an Jiaotong University, School of Mathematics and Statistics, China - Kai Sun 0008 — East China Normal University, School of Computer Science and Software Engineering, Shanghai, China
- Kai Sun 0009
— Chinese Academy of Sciences, Institute of Geographic Sciences and Natural Resources Research, Beijing, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[i27]Yin Huang, Yifan Ethan Xu, Kai Sun, Vera Yan, Alicia Sun, Haidar Khan, Jimmy Nguyen, Mohammad Kachuee, Zhaojiang Lin, Yue Liu, Aaron Colak, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
ConfQA: Answer Only If You Are Confident. CoRR abs/2506.07309 (2025)
[i26]Mohammad Kachuee, Teja Gollapudi, Minseok Kim, Yin Huang, Kai Sun, Xiao Yang, Jiaqi Wang, Nirav Shah, Yue Liu, Aaron Colak, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning. CoRR abs/2507.18857 (2025)
[i25]Yushi Sun, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
KERAG: Knowledge-Enhanced Retrieval-Augmented Generation for Advanced Question Answering. CoRR abs/2509.04716 (2025)
[i24]Kai Sun, Yin Huang, Srishti Mehra, Mohammad Kachuee, Xilun Chen, Renjie Tao, Zhaojiang Lin, Andrea Jessee, Nirav Shah, Alex Betty, Yue Liu, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
Knowledge Extraction on Semi-Structured Content: Does It Remain Relevant for Question Answering in the Era of LLMs? CoRR abs/2509.25107 (2025)
[i23]Zhepei Wei, Xiao Yang, Kai Sun, Jiaqi Wang, Rulin Shao, Sean Chen, Mohammad Kachuee, Teja Gollapudi, Tony Liao, Nicolas Scheffer, Rakesh Wanga, Anuj Kumar, Yu Meng, Wen-tau Yih, Xin Luna Dong:
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning. CoRR abs/2509.25760 (2025)
[i22]Shicheng Liu, Kai Sun, Lisheng Fu, Xilun Chen, Xinyuan Zhang, Zhaojiang Lin, Rulin Shao, Yue Liu, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning. CoRR abs/2510.01832 (2025)
[i21]Siddhant Arora, Haidar Khan, Kai Sun, Xin Luna Dong, Sajal Choudhary, Seungwhan Moon, Xinyuan Zhang, Adithya Sagar, Surya Teja Appini, Kaushik Patnaik, Sanat Sharma, Shinji Watanabe, Anuj Kumar, Ahmed Aly, Yue Liu, Florian Metze, Zhaojiang Lin:
Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage. CoRR abs/2510.02044 (2025)
[i20]Kai Zhang, Xinyuan Zhang, Ejaz Ahmed, Hongda Jiang, Caleb Kumar, Kai Sun, Zhaojiang Lin, Sanat Sharma, Shereen Oraby, Aaron Colak, Ahmed Aly, Anuj Kumar, Xiaozhong Liu, Xin Luna Dong:
AssoMem: Scalable Memory QA with Multi-Signal Associative Retrieval. CoRR abs/2510.10397 (2025)
[i19]Jiaqi Wang, Xiao Yang, Kai Sun, Parth Suresh, Sanat Sharma, Adam Czyzewski, Derek Andersen, Surya Teja Appini, Arkav Banerjee, Sajal Choudhary, Shervin Ghasemlou, Ziqiang Guan, Akil Iyer, Haidar Khan, Lingkun Kong, Roy Luo, Tiffany Ma, Zhen Qiao, David Tran, Wenfang Xu, Skyler Yeatman, Chen Zhou, Gunveer Gujral, Yinglong Xia, Shane Moon, Nicolas Scheffer, Nirav Shah, Eun Chang, Yue Liu, Florian Metze, Tammy Stark, Zhaleh Feizollahi, Andrea Jessee, Mangesh Pujari, Ahmed Aly, Babak Damavandi, Rakesh Wanga, Anuj Kumar, Rohit Patel, Wen-tau Yih, Xin Luna Dong:
CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark. CoRR abs/2510.26160 (2025)- 2024
[j5]Xin Luna Dong, Meng Jiang, Kai Sun, Xiao Yang:
Letter from the Special Issue Editor. IEEE Data Eng. Bull. 48(4): 2 (2024)
[j4]Xiao Yang, Yifan Ethan Xu, Kai Sun, Jiaqi Wang, Lingkun Kong, Wen-tau Yih, Xin Luna Dong:
KDD Cup CRAG competition: Systems, Findings and Learnings. IEEE Data Eng. Bull. 48(4): 163-182 (2024)
[j3]Yushi Sun
, Xin Hao, Kai Sun, Yifan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
Are Large Language Models a Good Replacement of Taxonomies? Proc. VLDB Endow. 17(11): 2919-2932 (2024)
[c11]Kai Sun, Yifan Ethan Xu, Hanwen Zha, Yue Liu, Xin Luna Dong:
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? NAACL-HLT 2024: 311-325
[c10]Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Scott Yih, Xin Dong:
CRAG - Comprehensive RAG Benchmark. NeurIPS 2024
[i18]Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
CRAG - Comprehensive RAG Benchmark. CoRR abs/2406.04744 (2024)
[i17]Yushi Sun, Hao Xin, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
Are Large Language Models a Good Replacement of Taxonomies? CoRR abs/2406.11131 (2024)
[i16]Wang Bill Zhu, Deqing Fu, Kai Sun, Yi Lu, Zhaojiang Lin, Seungwhan Moon, Kanika Narang, Mustafa Canim, Yue Liu, Anuj Kumar, Xin Luna Dong:
VisualLens: Personalization through Visual History. CoRR abs/2411.16034 (2024)- 2023
[i15]Kai Sun, Yifan Ethan Xu, Hanwen Zha, Yue Liu, Xin Luna Dong:
Head-to-Tail: How Knowledgeable are Large Language Models (LLM)? A.K.A. Will LLMs Replace Knowledge Graphs? CoRR abs/2308.10168 (2023)- 2022
[c9]Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Claire Cardie:
Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge. ACL (1) 2022: 8736-8747- 2021
[b1]Kai Sun:
Machine Reading Comprehension: Challenges and Approaches. Cornell University, USA, 2021
[c8]Dian Yu
, Kai Sun, Dong Yu, Claire Cardie:
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data. EMNLP (Findings) 2021: 56-68
[c7]Kai Sun, Seungwhan Moon, Paul A. Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie:
Adding Chit-Chat to Enhance Task-Oriented Dialogues. NAACL-HLT 2021: 1570-1583
[i14]Dian Yu, Kai Sun, Dong Yu, Claire Cardie:
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question Answering Data. CoRR abs/2102.01226 (2021)- 2020
[j2]Kai Sun, Dian Yu, Dong Yu, Claire Cardie:
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension. Trans. Assoc. Comput. Linguistics 8: 141-155 (2020)
[c6]Dian Yu, Kai Sun, Claire Cardie, Dong Yu:
Dialogue-Based Relation Extraction. ACL 2020: 4927-4940
[c5]Liang Xu, Hai Hu
, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li
, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, Zhenzhong Lan:
CLUE: A Chinese Language Understanding Evaluation Benchmark. COLING 2020: 4762-4772
[i13]Liang Xu, Xuanwei Zhang, Lu Li, Hai Hu, Chenjie Cao, Weitang Liu, Junyi Li, Yudong Li, Kai Sun, Yechen Xu, Yiming Cui, Cong Yu, Qianqian Dong, Yin Tian, Dian Yu, Bo Shi, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Zhenzhong Lan:
CLUE: A Chinese Language Understanding Evaluation Benchmark. CoRR abs/2004.05986 (2020)
[i12]Dian Yu, Kai Sun, Claire Cardie, Dong Yu:
Dialogue-Based Relation Extraction. CoRR abs/2004.08056 (2020)
[i11]Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Claire Cardie:
Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge. CoRR abs/2009.05831 (2020)
[i10]Kai Sun, Seungwhan Moon, Paul A. Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie:
Adding Chit-Chats to Enhance Task-Oriented Dialogues. CoRR abs/2010.12757 (2020)
2010 – 2019
- 2019
[j1]Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Yejin Choi, Claire Cardie:
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension. Trans. Assoc. Comput. Linguistics 7: 217-231 (2019)
[c4]Xiaoman Pan, Kai Sun, Dian Yu, Jianshu Chen, Heng Ji, Claire Cardie, Dong Yu:
Improving Question Answering with External Knowledge. MRQA@EMNLP 2019: 27-37
[c3]Hai Wang, Dian Yu, Kai Sun, Jianshu Chen, Dong Yu:
Improving Pre-Trained Multilingual Model with Vocabulary Expansion. CoNLL 2019: 316-327
[c2]Hai Wang, Dian Yu, Kai Sun, Jianshu Chen, Dong Yu, David A. McAllester, Dan Roth:
Evidence Sentence Extraction for Machine Reading Comprehension. CoNLL 2019: 696-707
[c1]Kai Sun, Dian Yu, Dong Yu, Claire Cardie:
Improving Machine Reading Comprehension with General Reading Strategies. NAACL-HLT (1) 2019: 2633-2643
[i9]Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Yejin Choi, Claire Cardie:
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension. CoRR abs/1902.00164 (2019)
[i8]Xiaoman Pan, Kai Sun, Dian Yu, Heng Ji, Dong Yu:
Improving Question Answering with External Knowledge. CoRR abs/1902.00993 (2019)
[i7]Hai Wang, Dian Yu, Kai Sun, Jianshu Chen, Dong Yu, Dan Roth, David A. McAllester:
Evidence Sentence Extraction for Machine Reading Comprehension. CoRR abs/1902.08852 (2019)
[i6]Kai Sun, Dian Yu, Dong Yu, Claire Cardie:
Probing Prior Knowledge Needed in Challenging Chinese Machine Reading Comprehension. CoRR abs/1904.09679 (2019)
[i5]Hai Wang, Dian Yu, Kai Sun, Jianshu Chen, Dong Yu:
Improving Pre-Trained Multilingual Models with Vocabulary Expansion. CoRR abs/1909.12440 (2019)- 2018
[i4]Kai Sun, Dian Yu, Dong Yu, Claire Cardie:
Improving Machine Reading Comprehension with General Reading Strategies. CoRR abs/1810.13441 (2018)- 2017
[i3]Mohamed Al-Badrashiny, Jason Bolton, Arun Tejasvi Chaganty, Kevin Clark, Craig Harman, Lifu Huang, Matthew Lamm, Jinhao Lei, Di Lu, Xiaoman Pan, Ashwin Paranjape, Ellie Pavlick, Haoruo Peng, Peng Qi, Pushpendre Rastogi, Abigail See, Kai Sun, Max Thomas, Chen-Tse Tsai, Hao Wu, Boliang Zhang, Chris Callison-Burch, Claire Cardie, Heng Ji, Christopher D. Manning, Smaranda Muresan, Owen Rambow, Dan Roth, Mark Sammons, Benjamin Van Durme:
TinkerBell: Cross-lingual Cold-Start Knowledge Base Construction. TAC 2017
[i2]Kai Sun, Claire Cardie:
Cornell Belief and Sentiment System at TAC 2017. TAC 2017- 2016
[i1]Vlad Niculae, Kai Sun, Xilun Chen, Yao Cheng, Xinya Du, Esin Durmus, Arzoo Katiyar, Claire Cardie:
Cornell Belief and Sentiment System at TAC 2016. TAC 2016
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-28 02:18 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







