Skip to content
View Peng-fei-Wang's full-sized avatar
πŸ’­
study
πŸ’­
study
  • Ant Group
  • Hangzhou
  • 16:31 (UTC +08:00)

Block or report Peng-fei-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Peng-fei-Wang/README.md

πŸ‘‹ Hi there, I’m Pengfei Wang

πŸš€ About Me

NLP Engineer Β· AI Researcher

  • 🌱 Currently, I focus on Text-to-SQL, Agentic RL, and AI Agents.
  • πŸŽ“ I received my master's degree from Zhejiang University under the supervision of Prof. Yunjun Gao.
  • πŸ“§ [email protected]
  • πŸ”— Google Scholar

πŸ“° News

  • 2025.09: πŸ† We have achieved #1 Rank on the challenging Text-to-SQL leaderboard BIRD leaderboard with 81.67% execution accuracy!

πŸ’Ό Work Experience

Period Position & Company
2024.04–now Senior Algorithm Engineer @ Ant Digital Technologies, Ant Group
2023.05–2023.08 Summer Algorithm Intern @ Seed (formerly AI-Lab-NLP), ByteDance

πŸ“š Selected Research

  • Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling
    Pengfei Wang, Baolin Sun, Xuemei Dong, Yaxun Dai, Hongwei Yuan, Mengdie Chu, Yingqi Gao, Xiang Qi, Peng Zhang, Ying Yan
    Technical Report | Paper | Code GitHub stars

  • PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching
    Pengfei Wang, Xiaocan Zeng, Lu Chen, Fan Ye, Yuren Mao, Junhao Zhu, Yunjun Gao
    VLDB 2023 | Paper | Code

  • Towards Explainable Table Interpretation Using Multi-view Explanations
    Yunjun Gao, Pengfei Wang, Xiaocan Zeng, Lu Chen, Yuren Mao, Ziheng Wei, Miao Li
    ICDE 2023 | Paper | Code

  • CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration
    Congcong Ge, Pengfei Wang, Lu Chen, Xiaoze Liu, Baihua Zheng, Yunjun Gao
    TKDE | Paper | Code

  • MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching
    Xiaocan Zeng, Pengfei Wang, Yuren Mao, Lu Chen, Xiaoze Liu, Yunjun Gao
    ICDE 2024 | Paper | Code

  • ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities
    Yunjun Gao, Xiaoze Liu, Junyang Wu, Tianyi Li, Pengfei Wang, Lu Chen
    KDD 2022 | Paper

  • UniView: A Unified Autonomous Materialized View Management System for Various Databases
    Zhenrong Xu, Pengfei Wang, Guoze Xue, Qitong Yan, Shenghao Gong, Yelan Jiang, Yuren Mao, Yunjun Gao, Shu Shen, Wei Zhang, Dan Luo, Lu Chen
    VLDB demo 2024 | Paper

  • DESIRE: An Efficient Dynamic Cluster-based Forest Indexing for Similarity Search in Multi-Metric Spaces
    Yifan Zhu, Lu Chen, Yunjun Gao, Baihua Zheng, Pengfei Wang
    VLDB 2022 | Paper

  • Question Calibration and Multi-Hop Modeling for Temporal Question Answering
    Chao Xue, Di Liang, Pengfei Wang, Jing Zhang
    AAAI 2024 | Paper

Pinned Loading

  1. antgroup/Agentar-Scale-SQL antgroup/Agentar-Scale-SQL Public

    Agentar-Scale-SQL is a novel framework that leverages scalable computation to significantly improve Text-to-SQL performance.

    Python 319 31

  2. ZJU-DAILY/PromptEM ZJU-DAILY/PromptEM Public

    Code for the paper "PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching". VLDB 2023.

    Python 28 4

  3. ZJU-DAILY/ExplainTI ZJU-DAILY/ExplainTI Public

    Code for the paper "Towards Explainable Table Interpretation Using Multi-view Explanations". ICDE 2023.

    Python 10 1

  4. ZJU-DAILY/CollaborEM ZJU-DAILY/CollaborEM Public

    Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.

    Python 41 3

  5. ZJU-DAILY/MultiEM ZJU-DAILY/MultiEM Public

    Code for the paper "MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching". ICDE 2024.

    Python 17

  6. HUAWEI-Code-Craft-2020 HUAWEI-Code-Craft-2020 Public

    Datasets and code for the HUAWEI-Code-Craft-2020

    C++ 14 6