default search action

combined dblp search
author search
venue search
publication search

ask others

Hang Chen 0001

> Home > Persons

Person information

affiliation: University of Science and Technology of China, National Engineering Research Center of Speech and Language Information Processing, Hefei, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/inffus/ChenWWDSWPD25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/inffus/ChenWWDSWPD25
Hang Chen, Chenxi Wang, Qing Wang, Jun Du, Sabato Marco Siniscalchi, Genshun Wan, Jia Pan, Huijun Ding:
Cross-attention among spectrum, waveform and SSL representations with bidirectional knowledge distillation for speech enhancement. Inf. Fusion 122: 103218 (2025)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenZWDSXW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenZWDSXW25
Hang Chen, Chen-Yue Zhang, Qing Wang, Jun Du, Sabato Marco Siniscalchi, Shifu Xiong, Genshun Wan:
HPCNet: Hybrid Pixel and Contour Network for Audio-Visual Speech Enhancement With Low-Quality Video. IEEE J. Sel. Top. Signal Process. 19(4): 671-684 (2025)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/HanCLD25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HanCLD25
Yi Han, Hang Chen, Lijuan Liu, Jun Du:
Dual-Branch Codec With Orthogonality Constraint and Knowledge Distillation for Noisy Environment. IEEE Signal Process. Lett. 32: 3017-3021 (2025)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/LiCDZSNX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/LiCDZSNX25
Ke-Wei Li, Hang Chen, Jun Du, Hengshun Zhou, Sabato Marco Siniscalchi, Shutong Niu, Shifu Xiong:
Lightweight Audio-Visual Wake Word Spotting With Diverse Acoustic Knowledge Distillation. IEEE Trans. Circuits Syst. Video Technol. 35(7): 7308-7320 (2025)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/WangWCWDL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/WangWCWDL25
Qing Wang, Yajian Wang, Hang Chen, Shuxian Wang, Jun Du, Chin-Hui Lee:
Video Segmentation and Tokenization for Model-Based Video Scene Classification. IEEE Trans. Multim. 27: 6489-6502 (2025)
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0001W0Y025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0001W0Y025
Hang Chen, Chang Wang, Jun Du, Chao-Han Huck Yang, Jun Qi:
Projection Valued-based Quantum Machine Learning Adapting to Differential Privacy Algorithm for Word-level Lipreading. ICASSP 2025: 1-5
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoW0000CSS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoW0000CSS25
Ming Gao, Shilong Wu, Hang Chen, Jun Du, Chin-Hui Lee, Shinji Watanabe, Jingdong Chen, Sabato Marco Siniscalchi, Odette Scharenborg:
The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition. INTERSPEECH 2025
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xiong00SZWZL0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xiong00SZWZL0025
Shifu Xiong, Hang Chen, Shi Cheng, Kai Shen, Hengshun Zhou, Genshun Wan, Chenyue Zhang, Kewei Li, Jun Du, Lirong Dai:
MISP-QEKS: A Large-Scale Dataset with Multimodal Cues for Query-by-Example Keyword Spotting. ACM Multimedia 2025: 13148-13155
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13971
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13971
Ming Gao, Shilong Wu, Hang Chen, Jun Du, Chin-Hui Lee, Shinji Watanabe, Jingdong Chen, Sabato Marco Siniscalchi, Odette Scharenborg:
The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition. CoRR abs/2505.13971 (2025)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-14750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-14750
Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Hang Chen, Jun Du:
Exploring Speaker Diarization with Mixture of Experts. CoRR abs/2506.14750 (2025)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-12334
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-12334
Qing Wang, Ya Jiang, Hang Chen, Sabato Marco Siniscalchi, Jun Du, Jianqing Gao:
Cross-Modal Knowledge Distillation with Multi-Level Data Augmentation for Low-Resource Audio-Visual Sound Event Localization and Detection. CoRR abs/2508.12334 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-05205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-05205
Jiajian Chen, Jiakang Chen, Hang Chen, Qing Wang, Yu Gao, Jun Du:
MEAN-RIR: Multi-Modal Environment-Aware Network for Robust Room Impulse Response Estimation. CoRR abs/2509.05205 (2025)
2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenWDYPL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenWDYPL24
Hang Chen, Qing Wang, Jun Du, Bao-Cai Yin, Jia Pan, Chin-Hui Lee:
Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2508-2521 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ChenWDWXYPL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ChenWDWXYPL24
Hang Chen, Qing Wang, Jun Du, Genshun Wan, Shifu Xiong, Baocai Yin, Jia Pan, Chin-Hui Lee:
Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading. IEEE Trans. Multim. 26: 9358-9371 (2024)
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DaiCDWCW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DaiCDWCW024
Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Haotian Wang, Chin-Hui Lee:
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition. CVPR 2024: 27435-27445
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LanCHCD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LanCHCD24
Hongbo Lan, Tianyou Cheng, Maokui He, Hang Chen, Jun Du:
The USTC System for Cadenza 2024 Challenge. ICASSP Workshops 2024: 57-58
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWWDLSWCSWYP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWWDLSWCSWYP24
Hang Chen, Shilong Wu, Chenxi Wang, Jun Du, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Jingdong Chen, Odette Scharenborg, Zhong-Qiu Wang, Bao-Cai Yin, Jia Pan:
Summary on the Multimodal Information-Based Speech Processing (MISP) 2023 Challenge. ICASSP Workshops 2024: 123-124
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuWCDZ0LD0CSSWP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuWCDZ0LD0CSSWP24
Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao:
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction. ICASSP 2024: 8351-8355
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuTFWCZDZSFGWPG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuTFWCZDZSFGWPG24
Minghui Wu, Haitao Tang, Jiahuan Fan, Ruoyu Wang, Hang Chen, Yanyong Zhang, Jun Du, Hengshun Zhou, Lei Sun, Xin Fang, Tian Gao, Genshun Wan, Jia Pan, Jianqing Gao:
Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization. ICASSP 2024: 10051-10055
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhangCDSJL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhangCDSJL24
Chen-Yue Zhang, Hang Chen, Jun Du, Sabato Marco Siniscalchi, Ya Jiang, Chin-Hui Lee:
Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge. ICME Workshops 2024: 1-6
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoCDXGBYL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoCDXGBYL024
Ming Gao, Hang Chen, Jun Du, Xin Xu, Hongxiao Guo, Hui Bu, Jianxing Yang, Ming Li, Chin-Hui Lee:
Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design. INTERSPEECH 2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HanCDKXP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HanCDKXP24
Yi Han, Hang Chen, Jun Du, Chang-Qing Kong, Shifu Xiong, Jia Pan:
Layer-Adaptive Low-Rank Adaptation of Large ASR Model for Low-Resource Multilingual Scenarios. ISCSLP 2024: 696-700
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/GaoCDXGBLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/GaoCDXGBLL24
Ming Gao, Hang Chen, Jun Du, Xin Xu, Hongxiao Guo, Hui Bu, Ming Li, Chin-Hui Lee:
Summary of Low-Resource Dysarthria Wake-Up Word Spotting Challenge. SLT 2024: 592-599
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-04245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-04245
Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee:
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition. CoRR abs/2403.04245 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10304
Ming Gao, Hang Chen, Jun Du, Xin Xu, Hongxiao Guo, Hui Bu, Jianxing Yang, Ming Li, Chin-Hui Lee:
Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design. CoRR abs/2406.10304 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17603
Mengzhi Wang, Shifu Xiong, Genshun Wan, Hang Chen, Jianqing Gao, Li-Rong Dai:
Deep CLAS: Deep Contextual Listen, Attend and Spell. CoRR abs/2409.17603 (2024)
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ChaiCDLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ChaiCDLL23
Li Chai, Hang Chen, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech. Speech Commun. 153: 102958 (2023)
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenDWWRLLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenDWWRLLL23
Hang Chen, Jun Du, Zhe Wang, Chenxi Wang, Yuling Ren, Qinglong Li, Ruibo Liu, Chin-Hui Lee:
Correlated Multi-Level Speech Enhancement for Robust Real-World ASR Applications Using Mask-Waveform-Feature Optimization. APSIPA ASC 2023: 96-101
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangDCWYZRLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangDCWYZRLL23
Chang Wang, Jun Du, Hang Chen, Ruoyu Wang, Chao-Han Huck Yang, Jiangjiang Zhao, Yuling Ren, Qinglong Li, Chin-Hui Lee:
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition. APSIPA ASC 2023: 635-642
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WanCLWPY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WanCLWPY23
Genshun Wan, Hang Chen, Tan Liu, Chenxi Wang, Jia Pan, Zhongfu Ye:
Progressive Multi-scale Self-supervised Learning for Speech Recognition. APSIPA ASC 2023: 978-982
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WanCLPY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WanCLPY23
Genshun Wan, Hang Chen, Pengcheng Li, Jia Pan, Zhongfu Ye:
Improved Data2vec with Soft Supervised Hidden Unit for Mandarin Speech Recognition. APSIPA ASC 2023: 983-987
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WuDHNCTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WuDHNCTL23
Shilong Wu, Jun Du, Mao-Kui He, Shutong Niu, Hang Chen, Haitao Tang, Chin-Hui Lee:
Semi-Supervised Multi-Channel Speaker Diarization With Cross-Channel Attention. ASRU 2023: 1-8
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWDWD0C0SSLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWDWD0C0SSLY23
Hang Chen, Shilong Wu, Yusheng Dai, Zhe Wang, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge. ICASSP 2023: 1-2
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangCDWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangCDWL23
Ya Jiang, Hang Chen, Jun Du, Qing Wang, Chin-Hui Lee:
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion. ICASSP 2023: 1-5
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWCHDLCWSSLYPGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWCHDLCWSSLYPGL23
Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. ICASSP 2023: 1-5
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCDYPL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCDYPL23
Chenyue Zhang, Hang Chen, Jun Du, Bao-Cai Yin, Jia Pan, Chin-Hui Lee:
Incorporating Visual Information Reconstruction into Progressive Learning for Optimizing audio-visual Speech Enhancement. ICASSP 2023: 1-5
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/DaiCDDDJL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/DaiCDDDJL23
Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. ICME 2023: 2627-2632
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXCDSWZWMHJC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXCDSWZWMHJC23
Haotian Wang, Yuxuan Xi, Hang Chen, Jun Du, Yan Song, Qing Wang, Hengshun Zhou, Chenxi Wang, Jiefeng Ma, Pengfei Hu, Ya Jiang, Shi Cheng, Jie Zhang, Yuzhe Weng:
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023. ACM Multimedia 2023: 9531-9535
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-06326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-06326
Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition. CoRR abs/2303.06326 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-08488
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-08488
Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. CoRR abs/2308.08488 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14638
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14638
Ruoyu Wang, Maokui He, Jun Du, Hengshun Zhou, Shutong Niu, Hang Chen, Yanyan Yue, Gaobin Yang, Shilong Wu, Lei Sun, Yanhui Tu, Haitao Tang, Shuangqing Qian, Tian Gao, Mengzhi Wang, Genshun Wan, Jia Pan, Jianqing Gao, Chin-Hui Lee:
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge. CoRR abs/2308.14638 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07925
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07925
Haotian Wang, Yuxuan Xi, Hang Chen, Jun Du, Yan Song, Qing Wang, Hengshun Zhou, Chenxi Wang, Jiefeng Ma, Pengfei Hu, Ya Jiang, Shi Cheng, Jie Zhang, Yuzhe Weng:
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023. CoRR abs/2309.07925 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08348
Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao:
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction. CoRR abs/2309.08348 (2023)
2022
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenZDLCWSSLYPG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenZDLCWSSLYPG22
Hang Chen, Hengshun Zhou, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results. ICASSP 2022: 9266-9270
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenDDLS0SCYP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenDDLS0SCYP22
Hang Chen, Jun Du, Yusheng Dai, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Baocai Yin, Jia Pan:
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1766-1770
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangDCWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangDCWL22
Yajian Wang, Jun Du, Hang Chen, Qing Wang, Chin-Hui Lee:
Deep Segment Model for Acoustic Scene Classification. INTERSPEECH 2022: 4177-4181
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangCJWWDL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangCJWWDL22
Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. ISCSLP 2022: 250-254
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangCDYP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangCDYP22
Chenxi Wang, Hang Chen, Jun Du, Baocai Yin, Jia Pan:
Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement. ISCSLP 2022: 255-259
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14581
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14581
Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. CoRR abs/2210.14581 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03480
Genshun Wan, Tan Liu, Hang Chen, Jia Pan, Cong Liu, Zhongfu Ye:
Progressive Multi-Scale Self-Supervised Learning for Speech Recognition. CoRR abs/2212.03480 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03482
Pengcheng Li, Genshun Wan, Fenglin Ding, Hang Chen, Jianqing Gao, Jia Pan, Cong Liu:
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit. CoRR abs/2212.03482 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/ChenDHDYL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/ChenDHDYL21
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021)
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouDCJXL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouDCJXL21
Hengshun Zhou, Jun Du, Hang Chen, Zijun Jing, Shifu Xiong, Chin-Hui Lee:
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments. Interspeech 2021: 341-345
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenD00YL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenD00YL21
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005
2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-09561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-09561
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-14360
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-14360
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icmssp/LeiCHC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmssp/LeiCHC019
Qinhui Lei, Hang Chen, Junfeng Hou, Liang Chen, Lirong Dai:
Deep Neural Network Based Regression Approach for Acoustic Echo Cancellation. ICMSSP 2019: 94-98

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.