


default search action
APSIPA 2019: Lanzhou, China
- 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, Lanzhou, China, November 18-21, 2019. IEEE 2019, ISBN 978-1-7281-3248-8

- Guo-Chih Hong, Chung-Nan Lee, Ming-Feng Lee:

Dynamic Threshold for DDoS Mitigation in SDN Environment. 1-7 - Po-Chiang Lin:

Large-Scale and High-Dimensional Cell Outage Detection in 5G Self-Organizing Networks. 8-12 - Kouji Hirata, Takuji Tachibana:

Implementation of multiple routing configurations on software-defined networks with P4. 13-16 - Koki Shimizu, Yuya Kumai, Kimiko Motonaka, Tomotaka Kimura

, Kouji Hirata:
Evaluation of countermeasure against future malware evolution with deterministic modeling. 17-21 - Wen-Ping Lai, Kuan-Chun Chiu:

NUMAP: NUMA-aware Multi-core Pinning and Pairing for Network Slicing at the 5G Mobile Edge. 22-27 - Shaojie Yang

, Wenbo Chen, Shanxi Li, Qingxiang Xu:
Approach using Transforming Structural Data into Image for Detection of Malicious MS-DOC Files based on Deep Learning Models. 28-32 - Kavin Kamaraj, Behnam Dezfouli, Yuhong Liu:

Edge Mining on IoT Devices Using Anomaly Detection. 33-40 - Fang Feng

, Qingquan Lv, Mingsong Wang, Xuhui Yang, Qingguo Zhou, Rui Zhou:
A Hybrid Feature Selection Algorithm Applied to High-dimensional Imbalanced Small-sample Data Classification. 41-46 - Yikang Lin, Peng Zhang:

Blockchain-based Complete Self-tallying E-voting Protocol. 47-52 - Licheng Xiao, Hairong Wang, Nam Ling:

Image Compression with Deeper Learned Transformer. 53-57 - Lin Zhang, Yuhong Liu:

Modeling the Views of WeChat Articles by Branching Processes. 58-63 - Congcong Wang, Pengyu Liu, Kebin Jia, Siwei Chen:

Lightweight models for weather identification. 64-68 - Yu-Min Huang, Huan-Hsin Tseng, Jen-Tzung Chien

:
Stochastic Fusion for Multi-stream Neural Network in Video Classification. 69-74 - Jie Cao, Yinping Qiu, Dongliang Chang, Xiaoxu Li, Zhanyu Ma:

Dynamic Attention Loss for Small-Sample Image Classification. 75-79 - Xiaoxu Li, Jijie Wu, Dongliang Chang, Weifeng Huang, Zhanyu Ma, Jie Cao:

Mixed Attention Mechanism for Small-Sample Fine-grained Image Classification. 80-85 - Jie Cao, Yaofeng Zhou, Hong Yu, Xiaoxu Li, Dan Wang, Zhanyu Ma:

A Loss With Mixed Penalty for Speech Enhancement Generative Adversarial Network. 86-90 - Xiaoxu Li, Liyun Yu, Jie Cao, Dongliang Chang, Zhanyu Ma, Nian Liu:

Small-Sample Image Classification Method of Combining Prototype and Margin Learning. 91-95 - Muhammad Hasnain, Muhammad Fermi Pasha

, Chern Hong Lim
, Imran Ghani:
Recurrent Neural Network for Web Services Performance Forecasting, Ranking and Regression Testing. 96-105 - Tuan Vu Ho, Masato Akagi:

Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational Autoencoder. 106-111 - Berrak Sisman, Karthika Vijayan, Minghui Dong, Haizhou Li:

SINGAN: Singing Voice Conversion with Generative Adversarial Networks. 112-118 - Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:

Experimental investigation on the efficacy of Affine-DTW in the quality of voice conversion. 119-124 - Jinsen Hu, Chunyan Yu, Faqian Guan:

Non-parallel Many-to-many Singing Voice Conversion by Adversarial Learning. 125-132 - Thuan Van Ngo, Rieko Kubo, Masato Akagi:

Evaluation of the Lombard effect model on synthesizing Lombard speech in varying noise level environments with limited data. 133-137 - Hiroki Murakami, Sunao Hara, Masanobu Abe:

DNN-based Voice Conversion with Auxiliary Phonemic Information to Improve Intelligibility of Glossectomy Patients' Speech. 138-142 - Kento Matsumoto, Sunao Hara, Masanobu Abe:

Speech-like Emotional Sound Generator by WaveNet. 143-147 - Shunsuke Goto, Daisuke Saito, Nobuaki Minematsu:

DNN-based Statistical Parametric Speech Synthesis Incorporating Non-negative Matrix Factorization. 148-153 - Masanori Morise, Genta Miyashita:

Efficient quantization of vocoded speech parameters without degradation. 154-158 - Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das

, Yi Zhou, Haizhou Li:
Speaker-independent Spectral Mapping for Speech-to-Singing Conversion. 159-164 - Chuxiong Zhang, Sheng Zhang, Haibing Zhong:

A Prosodic Mandarin Text-to-Speech System Based on Tacotron. 165-169 - Jiangyan Yi, Jianhua Tao:

Distilling Knowledge for Distant Speech Recognition via Parallel Data. 170-175 - Jiangyan Yi, Jianhua Tao:

Batch Normalization based Unsupervised Speaker Adaptation for Acoustic Models. 176-180 - Ming Liu, Yujun Wang, Zhaoyu Yan, Jing Wang, Xiang Xie:

Robust Speech Recognition based on Multi-Objective Learning with GRU Network. 181-185 - Hiroshi Sato, Takafumi Moriya, Yusuke Shinohara, Ryo Masumura, Takaaki Fukutomi, Kiyoaki Matsui, Takanori Ashihara, Yoshikazu Yamaguchi, Yushi Aono:

Revisiting Dynamic Adjustment of Language Model Scaling Factor for Automatic Speech Recognition. 186-191 - Shuji Komeiji, Toshihisa Tanaka:

A Language Model-Based Design of Reduced Phoneme Set for Acoustic Model. 192-197 - Thi-Ly Vu, Zhiping Zeng, Haihua Xu, Eng Siong Chng:

Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition. 198-203 - Wataru Nakamura, Yosuke Kaga, Masakazu Fujio, Kenta Takahashi:

Security and Efficiency of Biometric Template Protection for Identification. 210-217 - Daiki Izumoto, Yasushi Yamazaki:

Security enhancement for touch panel based user authentication on smartphones. 218-223 - Tetsushi Ohki, Vishu Gupta, Masakatsu Nishigaki

:
Efficient Spoofing Attack Detection against Unknown Sample using End-to-End Anomaly Detection. 224-230 - Keisuke Takano, Hironobu Takano:

Eye-blink based Personal Authentication Using Time-series Directional Features and Waveform Features. 231-235 - Shion Tagawa, Hironobu Takano:

Personal Authentication with Eye Movement Features During PIN Input. 236-240 - Yi-Chun Lin, Yusei Suzuki, Hiroya Kawai, Koichi Ito, Hwann-Tzong Chen, Takafumi Aoki:

Attribute Estimation Using Multi-CNNs from Hand Images. 241-244 - Hyewon Song, Beom Kwon

, Seongmin Lee, Sanghoon Lee:
Dictionary based Compression Type Classification using a CNN Architecture. 245-248 - Zhihao Du, Xueliang Zhang, Jiqing Han:

Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training. 249-254 - Akira Tamamori, Tomoko Matsui

:
A sequential prediction method of quasi-periodicity based on Gaussian process state space model. 255-261 - Ming-Hsiang Su, Chung-Hsien Wu

, Po-Chen Shih:
Automatic Ontology Population Using Deep Learning for Triple Extraction. 262-267 - Karan Makhija, Thi-Nga Ho, Eng Siong Chng:

Transfer Learning for Punctuation Prediction. 268-273 - Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:

A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising. 274-278 - Ryo Tanabe, Takashi Endo, Yuki Nikaido, Kenji Ichige, Nguyen Phong, Yohei Kawaguchi

, Koichi Hamada:
Location-Independent Multi-Channel Acoustic Scene Classification Using Blind Dereverberation, Blind Source Separation, and Model Ensemble. 279-283 - Lantian Li

, Zhiyuan Tang, Ying Shi, Dong Wang:
Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification. 284-288 - Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Shijin Wang

, Lingjing Jin, Yunxia Li:
Dementia Detection by Analyzing Spontaneous Mandarin Speech. 289-296 - Hao Li, Xueliang Zhang, Guanglai Gao:

Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech. 297-301 - Jennifer Santoso, Takeshi Yamada, Shoji Makino:

Classification of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum. 302-306 - Jiahong Zhao

, Christian Ritz:
Semi-Coprime Microphone Arrays for Estimating Direction of Arrival of Speech Sources. 308-313 - Junyi Peng, Rongzhi Gu, Yuexian Zou, Wenwu Wang:

Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification. 314-319 - Yao Du, Zhiyong Wu, Shiyin Kang, Dan Su, Dong Yu, Helen Meng:

Prosodic Structure Prediction using Deep Self-attention Neural Network. 320-324 - Rongzhi Gu, Junyi Peng, Yuexian Zou, Dong Yu:

Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation. 325-331 - Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari:

Acceleration of rank-constrained spatial covariance matrix estimation for blind speech extraction. 332-338 - Hsiao-Tzu Hung, Chung-Yang Wang, Yi-Hsuan Yang, Hsin-Min Wang

:
Improving Automatic Jazz Melody Generation by Transfer Learning Techniques. 339-346 - Yupeng Shi, Nengheng Zheng, Yuyong Kang, Weicong Rong:

Speech Loss Compensation by Generative Adversarial Networks. 347-351 - Keisuke Nishijima, Ken'ichi Furuya

:
Snoring sound classification using multiclass classifier under actual environments. 352-356 - Meng Liang, Zhong-Hua Fu, Xiang Zhao, Jinglei Zhou, Haikun Wang:

Nonlinear Echo Cancellation Based on Polyphase Filter Bank. 357-362 - Yunqi Cai, Dong Wang:

Question Mark Prediction By Bert. 363-367 - Li Li, Jianwu Dang, Yangping Wang, Song Wang, Zhenhai Zhang:

Part-Based Bilinear CNN For Person Re-Identification. 368-374 - Christoph M. Wilk, Shigeki Sagayama:

Polyphonic Voicing Optimization for Automatic Music Completion. 375-382 - Bo-Cheng Jiang, Chung-Nan Lee:

Online Layered Multiple Object Tracking Using Residual-Residual Networks. 383-390 - Biao Yue, Yangping Wang, Yongzhi Min, Zhenhai Zhang, Wenrun Wang, Jiu Yong:

Rail Surface Defect Recognition Method Based on AdaBoost Multi-classifier Combination. 391-396 - Qiuxian Zhang, Jiangyan Yi, Jianhua Tao, Mingliang Gu, Yong Ma:

Focal Loss for End-to-end Short Utterances Chinese Dialect Identification. 397-401 - Daisuke Saito, So Suzuki, Nobuaki Minematsu:

Speech representation based on tensor factor analysis and its application to speaker recognition and language identification. 402-406 - Jacob Lambert, Eijiro Takeuchi, Kazuya Takeda:

Optimizing Learned Object Detection on Point Clouds from 3D Lidars Through Range and Sparsity Information. 407-413 - Huan-Yu Chen, Yun-Shao Lin, Chi-Chun Lee

:
Through the Eyes of Viewers: A Comment-Enhanced Media Content Representation for TED Talks Impression Recognition. 414-418 - Yuanfang Zhao, Yunli Chen:

End-to-end autonomous driving based on the convolution neural network model. 419-423 - Ping-Rong Chen, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin:

DSNet: An Efficient CNN for Road Scene Segmentation. 424-432 - Miao Zhao, Rongjin Li, Shijiang Yan, Zheng Li, Hao Lu, Shipeng Xia, Qingyang Hong, Lin Li:

Phone-Aware Multi-task Learning and Length Expanding for Short-Duration Language Recognition. 433-437 - Zhibo Rao

, Mingyi He
, Zhidong Zhu, Yuchao Dai, Renjie He:
SDBF-Net: Semantic and Disparity Bidirectional Fusion Network for 3D Semantic Detection on Incidental Satellite Images. 438-444 - Zhiyong Chen, Zongze Ren, Shugong Xu

:
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification. 445-449 - Zheng Li, Hao Lu, Jianfeng Zhou, Lin Li, Qingyang Hong:

Speaker Embedding Extraction with Multi-feature Integration Structure. 450-454 - Zhuozheng Wang

, Meng Zhang, Wei Liu:
An Effective Road Extraction Method from Remote Sensing Images Based on Self-Adaptive Threshold Function. 455-460 - Jiayao Wu, Zhiyuan Tang, Dong Wang:

Structure Growth for Small-Footprint Speech Recognition. 461-465 - Na Li, Yongfei Zhang, Yun Zhang

, C.-C. Jay Kuo
:
On Energy Compaction of 2D Saab Image Transforms. 466-475 - Hangjing Zhang

, Yuejiang Li, Yang Hu, Yan Chen
, H. Vicky Zhao:
Measuring the Hazard of Malicious Nodes in Information Diffusion over Social Networks. 476-481 - Benliu Qiu

, Yuejiang Li, Yan Chen
, H. Vicky Zhao:
Controlling Information Diffusion with Irrational Users. 482-485 - Hong Hu

, Yuejiang Li, H. Vicky Zhao, Yan Chen
:
Modeling Multi-source Information Diffusion: A Graphical Evolutionary Game Approach. 486-492 - Zheming Yang, Wen Ji:

A Universal Intelligence Measurement Method Based on Meta-analysis. 493-498 - Qinyuan Ye, Yuejiang Li, Yan Chen

, H. Vicky Zhao:
Modeling Content Interaction in Information Diffusion with Pre-trained Sentence Embedding. 499-507 - Fu-Sheng Tsai, Yi-Ming Weng, Chip-Jin Ng, Chi-Chun Lee

:
Pain versus Affect? An Investigation in the Relationship between Observed Emotional States and Self-Reported Pain. 508-512 - Yuxuan Xi, Pengcheng Li, Yan Song, Yiheng Jiang, Lirong Dai:

Speaker to Emotion: Domain Adaptation for Speech Emotion Recognition with Residual Adapters. 513-518 - Bagus Tris Atmaja

, Kiyoaki Shirai, Masato Akagi:
Speech Emotion Recognition Using Speech Feature and Word Embedding. 519-523 - Zhichao Peng, Zhi Zhu, Masashi Unoki

, Jianwu Dang, Masato Akagi:
Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Networks. 524-528 - Lu Yi, Man-Wai Mak:

Adversarial Data Augmentation Network for Speech Emotion Recognition. 529-534 - Xueyi Wang, Lantian Li

, Dong Wang:
VAE-based Domain Adaptation for Speaker Verification. 535-539 - Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng:

Replay detection using CQT-based modified group delay feature and ResNeWt network in ASVspoof 2019. 540-545 - Zhimin Feng, Qiqi Tong, Yanhua Long, Shuang Wei, Chunxia Yang, Qiaozheng Zhang:

SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge. 548-552 - Bin Gu, Wu Guo, Yao Liu, Jian Sun:

Clustering-Based Score Normalization for Speaker Verification. 553-557 - Zongze Ren, Zhiyong Chen, Shugong Xu

:
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification. 558-562 - Wei-Cheng Liao, Jian-Jiun Ding:

Automatic Handwriting Verification and Suspect Identification for Chinese Characters Using Space and Frequency Domain Features. 563-571 - Dongkwon Jin, Kyungsun Lim, Chang-Su Kim

:
Robust Change Detection in High Resolution Satellite Images with Geometric Distortions. 572-577 - Zhibo Rao

, Mingyi He
, Yuchao Dai, Zhidong Zhu, Bo Li
, Renjie He:
MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching. 578-583 - Zheng Cheng

, Ping Han, Binbin Han, Jiahui Sun:
Classification of Polarimetric SAR Image based on Improved Fuzzy Clustering. 584-589 - Nien-Hsin Chou, Li-Chung Chuang, Ming-Sui Lee:

Intensity-aware GAN for Single Image Reflection Removal. 590-594 - Aamir Naveed Abbasi, Mingyi He

:
CNN with ICA-PCA-DCT Joint Preprocessing for Hyperspectral Image Classification. 595-600 - Man Wang, Fangkun Qi, Hongwu Yang

, Jingwen Sun:
Dongxiang speech synthesis based on statistical parameter method. 601-607 - Daichi Kondo, Masanori Morise:

Human-in-the-loop speech-design system and its evaluation. 608-612 - Masanori Morise, Takuro Shono:

High-quality waveform generator from fundamental frequency, spectral envelope, and band aperiodicity. 613-617 - Hyeonjoo Kang, Young-Sun Joo

, Inseon Jang, Chunghyun Ahn, Hong-Goo Kang:
A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis. 618-622 - Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Lirong Dai:

End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. 623-627 - Jingwen Sun, Gang Zhou, Hongwu Yang

, Man Wang:
End-to-end Tibetan Ando dialect speech recognition based on hybrid CTC/attention architecture. 628-632 - Sining Sun, Shuran Zhou, Mei-Yuh Hwang, Lei Xie, Qin Li, Xin Lei:

Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition. 633-637 - Zhaoyi Liu, Yuexian Zou:

Teacher-Student BLSTM Mask Model for Robust Acoustic Beamforming. 638-643 - Fanchang Meng, Shouye Peng, Guohui Zhang:

Using Convolution and Sequence-discriminative Training to Improving Children Speech Recognition. 644-649 - Yahui Shan, Min Liu, Qingran Zhan, Shixuan Du, Jing Wang, Xiang Xie:

Speech Recognition Based on Deep Tensor Neural Network and Multifactor Feature. 650-654 - Ryo Masumura, Yusuke Ijima, Satoshi Kobashikawa, Takanobu Oba, Yushi Aono:

Can We Simulate Generative Process of Acoustic Modeling Data? Towards Data Restoration for Acoustic Modeling. 655-661 - Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Ye Bai:

Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network. 662-666 - Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:

Domain Adversarial Training for Speech Enhancement. 667-672 - Fuqiang Ye, Yu Tsao

, Fei Chen:
Subjective Feedback-based Neural Network Pruning for Speech Enhancement. 673-677 - Tassadaq Hussain

, Yu Tsao
, Hsin-Min Wang
, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao
:
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement. 678-683 - Xupeng Jia, Dongmei Li:

Speech Enhancement Based on Deep Mixture of Distinguishing Experts. 684-688 - Jingjun Liang, Shizhe Chen, Qin Jin:

Semi-supervised Multimodal Emotion Recognition with Improved Wasserstein GANs. 695-703 - Yijun Yuan, Jinwei Wan, Bo Chen:

Robust Attack on Deep Learning based Radar HRRP Target Recognition. 704-707 - Xiaoyong Lu, Yanqin Li, Haizhen An, Tao Pan, Renjun Li, Yanbin Hu, Aibao Zhou, Hongwu Yang

:
Development of a Chinese Depressed Speech Corpus Based on The Disturbed Effect of Self-Processing. 718-722 - Junichiro Yoshimoto, Jumpei Ozaki, Kohta Mizutani, Takashi Nakano

, Kazushi Ikeda, Takayuki Yamashita:
Statistical analysis on characteristic whisker movements observed in reward processing. 723-726 - Mikiko Konda, Takatomi Kubo, Naruki Morimura, Kazushi Ikeda:

Interaction Analysis in Hunting Behavior of Finless Porpoises. 727-730 - Yuichi Sakumura, Katsuyuki Kunida:

Extraction of Biomolecular Signals Controlling Complex Behavior of Biological Cells. 731-735 - Yaming Hu, Shun Nakamura, Tsuyoshi Yamanaka, Toshihisa Tanaka:

Physiological signals responses to normal and abnormal brake events in simulated autonomous car. 736-740 - Boning Li

, Xuyang Zhao, Qibin Zhao, Toshihisa Tanaka, Jianting Cao:
A One-Dimensional Convolutional Neural Network Model for Automated Localization of Epileptic Foci. 741-744 - Zhichao Zhang, Maokang Luo, Ke Deng, Tao Yu:

Cohen's class time-frequency representation in linear canonical domains: definition and properties. 745-752 - Xiaolong Chen

, Qiaowen Jiang, Ningyuan Su, Baoxin Chen, Jian Guan:
LFM Signal Detection and Estimation Based on Deep Convolutional Neural Network. 753-758 - Yannan Sun, Bingzhao Li:

Nonuniform fast linear canonical transform. 759-764 - Bing Deng, Qingshun Huang, Lin Zhang:

Digital implementation of Hilbert Transform in the LCT domain associated with FIR filter. 770-773 - Juan Zhao, Xia Bai:

Adaptive Matching Pursuit Method Based on Auxiliary Residual for Sparse Signal Recovery. 774-778 - Navid Tafaghodi Khajavi, Anthony Kuh:

Decomposition of Covariance Matrix Using Cascade of Trees. 779-783 - Junho Jo, Jae Woong Soh, Nam Ik Cho:

Handwritten Text Segmentation in Scribbled Document via Unsupervised Domain Adaptation. 784-790 - Chunyao Fang, Kebin Jia, Pengyu Liu, Liang Zhang:

Research on Cloud Recognition Technology Based on Transfer Learning. 791-796 - Muwei Jian

, Ruihong Wang, Hui Yu
, Junyu Dong, Yujuan Wang, Yilong Yin, Kin-Man Lam:
Saliency Detection via Robust Seed Selection of Foreground and Background Priors. 797-801 - Yuma Kinoshita, Kouki Seo, Hitoshi Kiya:

A Hue Correction Scheme Based on Constant-Hue Plane for Color Image Enhancement. 802-806 - S. K. Felix Yu, Zi-Xin Xu, Yuk-Hee Chan, Daniel Pak-Kong Lun:

A spatial domain secret image embedding technique with image authentication feature. 807-813 - Jing-Ming Guo, Sankarasrinivasan Seshathiri

:
Reconstruction of Multitone BTC Images using Conditional Generative Adversarial Nets. 814-817 - Yuanjun Zhao, Roberto Togneri, Victor Sreeram:

Data augmentation and post selection for improved replay attack detection. 818-821 - Bin Liu, Shuai Nie, Wenju Liu, Hui Zhang, Xiangang Li, Changliang Li:

Deep Segment Attentive Embedding for Duration Robust Speaker Verification. 822-826 - Qian-Bei Hong, Chung-Hsien Wu

, Ming-Hsiang Su, Hsin-Min Wang
:
Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification. 827-832 - Ryoya Yaguchi, Sayaka Shiota, Nobutaka Ono, Hitoshi Kiya:

Improving replay attack detection by combination of spatial and spectral features. 833-837 - Yitong Liu, Rohan Kumar Das

, Haizhou Li:
Multi-band Spectral Entropy Information for Detection of Replay Attacks. 838-843 - Jingyi Xu, Junfeng Hou, Yan Song, Wu Guo, Lirong Dai:

Knowledge Distillation from Multilingual and Monolingual Teachers for End-to-End Multilingual Speech Recognition. 844-849 - Rui Na, Junfeng Hou, Wu Guo, Yan Song, Lirong Dai:

Learning Adaptive Downsampling Encoding for Online End-to-End Speech Recognition. 850-854 - Yueh-Ting Lee, Xuan-Bo Chen, Hung-Shin Lee, Jyh-Shing Roger Jang, Hsin-Min Wang

:
Multi-task Learning for Acoustic Modeling Using Articulatory Attributes. 855-861 - Yuuki Tachioka:

Hypothesis Correction Based on Semi-character Recurrent Neural Network for End-to-end Speech Recognition. 862-867 - Haoxin Ma, Ye Bai, Jiangyan Yi, Jianhua Tao:

Hypersphere Embedding and Additive Margin for Query-by-example Keyword Spotting. 868-872 - Nan Zhou, Jun Du, Yanhui Tu, Tian Gao, Chin-Hui Lee:

A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition. 873-877 - Jing Yuan, Changchun Bao:

CycleGAN-based speech enhancement for the unpaired training data. 878-883 - Rui Cheng

, Changchun Bao:
Phase Unwrapping Based Speech Enhancement. 884-889 - Dujuan Wang, Changchun Bao:

End-to-End Speech Enhancement Using Fully Convolutional Networks with Skip Connections. 890-895 - Jingdong Li, Hui Zhang, Xueliang Zhang, Changliang Li:

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks. 896-900 - Yao Zhou, Changchun Bao, Rui Cheng

:
GSC Based Speech Enhancement with Generative Adversarial Network. 901-906 - Hideki Kawahara, Ken-Ichi Sakakibara, Eri Haneishi, Kaori Hagiwara:

Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope. 907-910 - Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:

Likability Estimation of Call-center Agents by Suppressing Annotator Variability. 911-916 - Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:

Urgent Voicemail Detection Focused on Long-term Temporal Variation. 917-921 - Liangqi Liu, Zhiyong Wu, Runnan Li, Jia Jia, Helen Meng:

Learning Contextual Representation with Convolution Bank and Multi-head Self-attention for Speech Emphasis Detection. 922-926 - Xiaoqun Dong, Xueqin Zhao:

Effect of Relative Frequency of Lexical Meanings on Accessing Lexical Ambiguities: Evidence from the Coordinator 'and'. 927-932 - Naoki Umeno, Masaru Yamashita, Hiroyuki Takada

, Shoichi Matsunaga:
Training Data Expansion for Classification between Normal and Abnormal Lung Sounds. 935-938 - Xinjie Shi, Tianqi Wang, Lan Wang, Hanjun Liu, Nan Yan:

Hybrid Convolutional Recurrent Neural Networks Outperform CNN and RNN in Task-state EEG Detection for Parkinson's Disease. 939-944 - Jinfeng Huang

, Bin Zhao, Jianwu Dang, Minbo Chen:
Investigation of speech-planning mechanism based on eye movement and EEG. 945-950 - Takeshi D. Itoh, Takatomi Kubo, Kiyoka Ikeda, Yuki Maruno

, Yoshiharu Ikutani, Hideaki Hata
, Kenichi Matsumoto, Kazushi Ikeda:
Towards Generation of Visual Attention Map for Source Code. 951-954 - Xiaokong Miao, Meng Sun

, Xiongwei Zhang:
Voice Conversion by Dual-Domain Bidirectional Long Short-Term Memory Networks with Temporal Attention. 955-959 - Siwei Chen, Kebin Jia, Pengyu Liu, Xunping Huang:

Taxi Drivers' Smoking Behavior Detection in Traffic Monitoring Video. 968-973 - Jiaqi Feng, Shuai Li, Yunfeng Sui, Lingtong Meng, Ce Zhu:

Integrating Action-aware Features for Saliency Prediction via Weakly Supervised Learning. 974-979 - Hochang Rhee, Nam Ik Cho:

Efficient and Robust Pseudo-Labeling for Unsupervised Domain Adaptation. 980-985 - Wei Gao

:
A Multi-Objective Optimization Perspective for Joint Consideration of Video Coding Quality. 986-991 - Yuyang Liu, Hongwei Guo, Ce Zhu, Yipeng Liu:

Spherical Position Dependent Rate-Distortion Optimization for 360-degree Video Coding. 992-996 - Qiang Fang:

Is average RMSE appropriate for evaluating acoustic-to-articulatory inversion? 997-1003 - Wenwei Dong, Yanlu Xie:

Normalization of GOP for Chinese Mispronunciation Detection. 1004-1008 - Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Takanobu Oba, Yushi Aono:

Disfluency Detection Based on Speech-Aware Token-by-Token Sequence Labeling with BLSTM-CRFs and Attention Mechanisms. 1009-1013 - Rui Yang, Zhen-Hua Ling:

Linguistic Steganography by Sampling-based Language Generation. 1014-1019 - Wenwei Dong, Yanlu Xie, Binghuai Lin:

Unsupervised Pronunciation Fluency Scoring by infoGan. 1020-1023 - Zhenye Gan, Yi Jiao, Hongwu Yang

, Gaungying Zhao, Zhimeng Song:
Study on the Tones Biases of Mandarin Speaker in Amdo Tibetan Areas Based on Statistics. 1024-1028 - Leilan Zhang, Qiang Zhou:

Automatically Annotate TV Series Subtitles for Dialogue Corpus Construction. 1029-1035 - Leilan Zhang, Qiang Zhou:

Topic Segmentation for Dialogue Stream. 1036-1043 - Huang-Cheng Chou

, Yi-Wen Liu, Chi-Chun Lee
:
Joint Learning of Conversational Temporal Dynamics and Acoustic Features for Speech Deception Detection in Dialog Games. 1044-1050 - Kengo Ohta, Ryota Nishimura, Norihide Kitaoka:

Type of Response Selection utilizing User Utterance Word Sequence, LSTM and Multi-task Learning for Chat-like Spoken Dialog Systems. 1051-1055 - Aijun Li, Gan Huang, Zhiqiang Li:

Prosodic Cues in the Interpretation of Echo Questions in Chinese Spoken Dialogues. 1056-1061 - Julan Xie, Fanghao Cheng, Zishu He, Huiyong Li:

A DOA Estimation Method of coherent and uncorrelated sources based on Nested Arrays. 1062-1065 - Julan Xie, Fanghao Cheng, Zishu He, Huiyong Li:

A DOA Estimation Method in the presence of unknown mutual coupling based on Nested Arrays. 1066-1071 - Feiran Yang, Jun Yang, Felix Albu

:
An Alternative Solution to the Dynamically Regularized RLS Algorithm. 1072-1075 - Xinqi Huang, Yingsong Li, Felix Albu

:
A Norm Constraint Lorentzian Algorithm Under Alpha-stable Measurement Noise. 1076-1079 - Liyun Xu:

Random Signal Estimation by Ergodicity associated with Linear Canonical Transform. 1080-1083 - Shanpeng Zhao, Shaoxiang Zhao, Youpeng Zhang, Zhengjie Xu:

Study on Pre-warning Model of Railway Signal System with Fuzzy Analytic Hierarchy Process. 1084-1090 - Yongzhi Min, Jie Hu:

Calibration of Position and Orientation between Cameras without Common Field of View Using Cooperative Target. 1100-1104 - Shu-Feng Duan, Ligu Zhu, Yujing Shi, Lei Zhang, Bo Hui:

Frequency Decomposition Model of Popularity Evolution in Online Social Media. 1105-1111 - Yanyan Wang, Yingsong Li, Lu Shen, Yuriy V. Zakharov:

Acoustic-Domain Self-Interference Cancellation for Full-Duplex Underwater Acoustic Communication Systems. 1112-1116 - Zhicheng Guo, Jianwu Dang, Yangping Wang, Jing Jin:

Background Modeling Algorithm for Multi-feature Fusion. 1117-1121 - Yi-Fan Chen, Amey Kiran Patel, Chia-Ping Chen:

Image Haze Removal By Adaptive CycleGAN. 1122-1127 - Jinxiang Liang, Jianwu Dang, Yangping Wang, Jingyu Yang, Zhenhai Zhang:

Remote Sensing Image Scene Classification Based on SURF Feature and Deep Learning. 1128-1133 - ShaoQuan Wang, DeYong Gao, Yangping Wang, Song Wang:

An Improved Retinex low-illumination image enhancement algorithm. 1134-1139 - Peiya Li, Zhenhui Situ:

Encrypted JPEG image retrieval using histograms of transformed coefficients. 1140-1144 - Changmeng Peng

, Luting Cai, Zhizhong Fu, Xiaofeng Li:
CNN-based bit-depth enhancement by the suppression of false contour and color distortion. 1145-1151 - Lixin Pan, Sheng Li

, Longbiao Wang, Jianwu Dang:
Effective Training End-to-End ASR systems for Low-resource Lhasa Dialect of Tibetan Language. 1152-1156 - Tianjiao Xu, Hui Zhang, Xueliang Zhang:

Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement. 1157-1162 - Haruka Tanji, Kazunori Kojima, Hiroaki Nanjo, Shi-wook Lee

, Yoshiaki Itoh:
A Rescoring Method Using Web Search and Word Vectors for Spoken Term Detection. 1163-1167 - Nguyen Binh Thien, Yukoh Wakabayashi, Takahiro Fukumori, Takanobu Nishiura:

Derivative of instantaneous frequency for voice activity detection using phase-based approach. 1168-1172 - Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Bin Liu:

Voice Activity Detection Based on Time-Delay Neural Networks. 1173-1178 - Wei-Cheng Lin, Yu Tsao

, Fei Chen, Hsin-Min Wang
:
Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement. 1179-1184 - Tianjiao Xu, Hao Li, Hui Zhang, Xueliang Zhang:

Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection. 1185-1189 - Karthika Vijayan, K. Sri Rama Murty

, Haizhou Li:
Allpass Modeling of Phase Spectrum of Speech Signals for Formant Tracking. 1190-1196 - Minghao Guo, Cai Rui, Wei Wang, Binghuai Lin, Jinsong Zhang, Yanlu Xie:

A Study on Mispronunciation Detection Based on Fine-grained Speech Attribute. 1197-1201 - Ze-Yu Zou, Yun-Xia Liu, Wen-Na Zhang, Yuehui Chen, Yun-Li Zang, Yang Yang, Bonnie Ngai-Fong Law:

Robust Camera Model Identification Based on Richer Convolutional Feature Network. 1202-1207 - Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki

:
A Robust Method for Blindly Estimating Speech Transmission Index using Convolutional Neural Network with Temporal Amplitude Envelope. 1208-1214 - Dan He, Yubin Zhong:

Compressing Speech Recognition Networks with MLP via Tensor-Train Decomposition. 1215-1219 - Wenxia Lu, Lijun Zhang, Jie Chen, Jingdong Chen:

Generalized Combined Nonlinear Adaptive Filters for Nonlinear Acoustic Echo Cancellation. 1220-1225 - Yao Du, Zhiyong Wu, Shiyin Kang, Dan Su, Dong Yu, Helen Meng:

Automatic Prosodic Structure Labeling using DNN-BGRU-CRF Hybrid Neural Network. 1234-1238 - Feng Li, Kaizhi Qian, Mark Hasegawa-Johnson, Masato Akagi:

Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking. 1239-1243 - Qing Zhou, Yong Ma, Benyan Luo, Mingliang Gu, Zude Zhu:

Identification of Alzheimer's Disease Patients Based on Oral Speech Features. 1244-1249 - Yang Yi, Kuan-Yu Chen, Hung-Yan Gu:

Mixture of CNN Experts from Multiple Acoustic Feature Domain for Music Genre Classification. 1250-1255 - Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun:

Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation. 1256-1261 - Jian Sun, Wu Guo, Bin Gu, Yao Liu:

Bidirectional Temporal Convolution with Self-Attention Network for CTC-Based Acoustic Modeling. 1262-1266 - Kun Zhang, Zhiyong Wu, Jia Jia, Helen M. Meng, Binheng Song:

Query-by-Example Spoken Term Detection using Attentive Pooling Networks. 1267-1272 - Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Hemant A. Patil:

Novel Adaptive Generative Adversarial Network for Voice Conversion. 1273-1281 - Yi Zhou, Xiaohai Tian, Rohan Kumar Das

, Haizhou Li:
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network. 1282-1287 - Ping Gao, Cheng-You You, Tai-Shih Chi:

A Multi-Scale Fully Convolutional Network for Singing Melody Extraction. 1288-1293 - Guanyu Li, Lisai Luo, Chunwei Gong, Shiliang Lv

:
End-to-end Tibetan Speech Synthesis Based on Phones and Semi-syllables. 1294-1297 - Neelesh Nursiah, KokSheik Wong

, Minoru Kuribayashi
:
Reversible Data Hiding in PDF Document Exploiting Prefix Zeros in Glyph Coordinates. 1298-1302 - Jianyuan Wu, Zheng Wang, Hui Zeng

, Xiangui Kang:
Multiple-Operation Image Anti-Forensics with WGAN-GP Framework. 1303-1307 - Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng:

Improving code-switching speech recognition with data augmentation and system combination. 1308-1312 - Jisheng Bai, Chen Chen, Jianfeng Chen:

A Multi-feature Fusion Based Method For Urban Sound Tagging. 1313-1317 - Taiki Izumi, Shingo Uenohara, Ken'ichi Furuya

, Yuuki Tachioka:
Activation Driven Synchronized Joint Diagonalization for Underdetermined Sound Source Separation. 1318-1322 - Weiqing Wang, Haiwei Wu, Ming Li:

Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection. 1323-1327 - Liang He

, Xianhong Chen, Can Xu, Jia Liu:
Subtraction-Positive Similarity Learning. 1328-1332 - Zhuozheng Wang

, Yingjie Dong, Wei Liu:
A Novel Effective Dimensionality Reduction Algorithm for Water Chiller Fault Data. 1333-1341 - Ying Chen, Wentao Xiao, Jie Cui, Hanyu Xu:

Speech Prosody and Eye Movements in Processing Discourse Information: A Preliminary Study in Mandarin Chinese. 1342-1346 - Guan-Bo Wang, Wei-Qiang Zhang:

An RNN and CRNN Based Approach to Robust Voice Activity Detection. 1347-1350 - Linna Zhou, Derui Liao:

Study of Chinese Text Steganography using Typos. 1351-1357 - Kosuke Fukumori, Toshihisa Tanaka:

A Simple Gaussian Kernel Classifier with Automated Hyperparameter Tuning. 1358-1363 - Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, Lei Xie:

Exploring RNN-Transducer for Chinese speech recognition. 1364-1369 - Jiu Yong, Yangping Wang, Xiaomei Lei, Fang Yong, Zhenhai Zhang:

Long-term 3D Registration Method Based on LCT Tracking and Improved ORB Detection. 1370-1379 - Masahiro Tsumori, Shinichiro Nagai, Ryosuke Harakawa, Toru Sasaki

, Masahiro Iwahashi:
Restoration of Minute Light Emissions Observed by Streak Camera Based on N-CUP Method. 1380-1384 - Kheng Hui Ng, Yiqi Tew

, Mum Wai Yip:
A Prefatory Study on Data Channelling Mechanism towards Industry 4.0. 1385-1390 - Weiwei Shan, Shogo Muramatsu, Akira Oshima, Hiroyoshi Yamada:

Successive Stripe Artifact Removal Based on Robust PCA for Millimeter Wave Automotive Radar Image. 1391-1394 - Thittaporn Ganokratanaa

, Supavadee Aramvith
, Nicu Sebe
:
Anomaly Event Detection Using Generative Adversarial Network for Surveillance Videos. 1395-1399 - Tien-Hong Lo, Berlin Chen:

Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data. 1400-1404 - Toranosuke Tanio, Kouya Takeda, Jaehoon Yu, Masanori Hashimoto

:
Training Data Reduction using Support Vectors for Neural Networks. 1405-1410 - Shota Fukui, Jaehoon Yu, Masanori Hashimoto

:
Distilling Knowledge for Non-Neural Networks. 1411-1416 - Meng Meng, Go Tanaka:

Proposal of Minimization Problem Based Lightness Modification Method Considering Visual Characteristics of Protanopia and Deuteranopia. 1417-1422 - Hiroshi Tsutsui

, Kentaro Yamada, Akihiro Sudou, Yoshikazu Miyanaga
:
An Evaluation of Stack Light Indicator Color Detection System Using Web Cameras for Automatic Production Lines. 1423-1426 - Dingli Luo, Songlin Du, Takeshi Ikenaga:

Multi-Task and Multi-Level Detection Neural Network Based Real-Time 3D Pose Estimation. 1427-1434 - Yu Wang, Xueting Li, Yun Zhu, Feilong He:

A Fast Inter-view Mode Selection Algorithm Based on Video Array Processor. 1435-1442 - Junyong Deng, Haoyue Wu, Rui Shan, Yiwen Fu, Xinchuang Liu, Ping Wang:

NPFONoC: A Low-loss, Non-blocking, Scalable Passive Optical Interconnect Network-on-Chip Architecture. 1443-1448 - Xiaoyan Xie, Xiang Lei, Jinna Zhou, Yun Zhu, Lin Jiang:

A Reconfigurable Implementation of Motion Compensation in HEVC. 1449-1454 - Bowen Zhang, Huaxi Gu, Ruiqi Guo:

SCRA: A Hybrid Deterministic Routing Algorithm for Aging-Resilient Network-an-Chip. 1455-1458 - Ryota Sugimoto, Osamu Takyu:

Access Decision based on Secure Capacity for prevention to CSI Impersonation of Untrusted Relay. 1459-1462 - Akinori Kamio, Fumihito Sasamori, Shiro Handa, Osamu Takyu, Mai Ohta, Takeo Fujii:

Recognition and Countermeasure to Hidden Terminal Problem by Packet Analysis in Wireless LAN. 1463-1467 - Shunsuke Tsuchida, Takumi Takahashi, Shinsuke Ibi, Seiichi Sampei:

Machine Learning-Aided Indoor Positioning Based on Unified Fingerprints of Wi-Fi and BLE. 1468-1472 - Kazunori Hayashi

, Ayano Nakai-Kasai, Ryo Hayakawa:
An Overloaded SC-CP IoT Signal Detection Method via Sparse Complex Discrete-Valued Vector Reconstruction. 1473-1478 - Jumpei Kawakami, Hendrik Lumbantoruan, Koichi Adachi:

NOMA Based UAV Relay Communication Protocol in Cellular Network. 1479-1484 - Changyan Zheng, Jibin Yang, Xiongwei Zhang, Meng Sun

, Kun Yao:
Improving the Spectra Recovering of Bone-Conducted Speech via Structural SIMilarity Loss Function. 1485-1490 - Ke-Xin He, Wei-Qiang Zhang, Jia Liu, Yao Liu:

Dilated-Gated Convolutional Neural Network with A New Loss Function on Sound Event Detection. 1491-1495 - Bolun Wang, Zhong-Hua Fu, Hao Wu:

Augmented Strategy For Polyphonic Sound Event Detection. 1496-1500 - Rui Wang, Mou Wang

, Xiao-Lei Zhang, Susanto Rahardja
:
Domain Adaptation Neural Network for Acoustic Scene Classification in Mismatched Conditions. 1501-1505 - Ziye Yang, Xiao-Lei Zhang:

Boosting Spatial Information for Deep Learning Based Multichannel Speaker-Independent Speech Separation In Reverberant Environments. 1506-1510 - Mou Wang

, Rui Wang, Xiao-Lei Zhang, Susanto Rahardja
:
Hybrid Constant-Q Transform Based CNN Ensemble for Acoustic Scene Classification. 1511-1516 - Jiakang Li, Meng Sun

, Xiongwei Zhang:
Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection. 1517-1522 - Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Hideki Banno, Masanori Morise, Toshio Irino:

Frequency domain variant of Velvet noise and its application to acoustic measurements. 1523-1532 - Beth Jelfs

, Christopher Gilliam
:
Fast & Efficient Delay Estimation Using Local All-Pass & Kalman Filters. 1533-1539 - Qian Ren, Zhenhai Zhang:

Dynamic Adjustment of Railway Emergency Plan Based on Utility Risk Entropy. 1540-1544 - Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Ankur T. Patil, Rajul Acharya, Hemant A. Patil:

Speech Demodulation-based Techniques for Replay and Presentation Attack Detection. 1545-1550 - Huachao Lu, Zhijin Zhao:

Spectrum Sensing Algorithm Based on LSTM and Its Implementation of Multiple USRP. 1551-1555 - Woojae Kim, Jaekyung Kim, Sanghoon Lee:

Quality of Experience using Deep Convolutional Neural Networks and future trends. 1556-1559 - Yibo Du, Kebin Jia, Chang Liu:

Stereo Matching and Image Inpainting Based on Binocular Camera. 1560-1564 - Daichi Kitahara, Swathi Ananda, Akira Hirabayashi:

Optimization-Based Fundus Image Decomposition for Diagnosis Support of Diabetic Retinopathy. 1565-1572 - Ming-Ze Wang, Shuai Wan, Hao Gong, Yuanfang Yu, Yang Liu:

An Integrated CNN-based Post Processing Filter For Intra Frame in Versatile Video Coding. 1573-1577 - Hongwei Zhang, Liuai Wu, Yanchun Yang:

Parameter-free Image Segmentation Based on Extreme Learning Machine. 1578-1581 - Swathi Ananda, Daichi Kitahara, Akira Hirabayashi, K. R. Udaya Kumar Reddy:

Automatic Fundus Image Segmentation for Diabetic Retinopathy Diagnosis by Multiple Modified U-Nets and SegNets. 1582-1588 - Haesoo Chung, Yoonsik Kim, Junho Jo, Sang-Hoon Lee, Nam Ik Cho:

Kernel Prediction Network for Detail-Preserving High Dynamic Range Imaging. 1589-1594 - Minoru Kuribayashi

, Nobuo Funabiki:
Efficient Decentralized Tracing Protocol for Fingerprinting System with Index Table. 1595-1601 - Xiang Feng, Qun Song, Qingfang Guo, Duo Liu, Zhanfeng Zhao, Yi-an Zhao:

Hand Gesture Recognition with Ensemble Time-Frequency Signatures Using Enhanced Deep Convolutional Neural Network. 1602-1605 - Amna Qureshi

, David Megías
:
Blockchain-based P2P multimedia content distribution using collusion-resistant fingerprinting. 1606-1615 - Ponlawat Chophuk, Kanjana Pattanaworapan, Kosin Chamnongthai:

Consideration of a Selecting Frame of Finger-Spelled Words from Backhand View. 1621-1624 - Yiheng Jiang, Yan Song, Jie Yan, Lirong Dai, Ian McLoughlin

:
Triplet-Center Loss Based Deep Embedding Learning Method for Speaker Verification. 1625-1629 - Rohan Kumar Das

, Jichen Yang, Haizhou Li:
Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech. 1630-1635 - Can Xu, Xianhong Chen, Liang He

, Jia Liu:
Geometric Discriminant Analysis for I-vector Based Speaker Verification. 1636-1640 - Jianfeng Zhou, Tao Jiang, Qingyang Hong, Lin Li:

Extraction of Noise-Robust Speaker Embedding Based on Generative Adversarial Networks. 1641-1645 - Haiwei Wu, Weicheng Cai, Ming Li, Ji Gao, Shanshan Zhang, Zhiqiang Lyu, Shen Huang:

DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge. 1646-1651 - Xu Xiang

, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. 1652-1656 - Tianxiang Ma, Bo Peng, Wei Wang

, Jing Dong:
Any-to-one Face Reenactment Based on Conditional Generative Adversarial Network. 1657-1664 - Naoki Hamasaki, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:

Discrimination between Handwritten and Computer-Generated Texts using a Distribution of Patch-Wise Font Features. 1665-1671 - Jeongwoo Lim, Naoko Nitta, Kazuaki Nakamura, Noboru Babaguchi:

Generating Spoofing Tweets considering Points of Interest of Target User. 1672-1678 - Yuki Hirose, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:

Anonymization of Gait Silhouette Video by Perturbing Its Phase and Shape Components. 1679-1685 - Ngoc-Dung T. Tieu, Huy H. Nguyen

, Fuming Fang, Junichi Yamagishi, Isao Echizen:
An RGB Gait Anonymization Model for Low-Quality Silhouettes. 1686-1693 - Hiroki Tanji, Takahiro Murakami, Hiroyuki Kamata:

A Generalization of Laplace Nonnegative Matrix Factorization and Its Multichannel Extension. 1694-1699 - Yuanlei Qi, Feiran Yang, Jun Yang:

A Late Reverberation Power Spectral Density Aware Approach to Speech Dereverberation Based on Deep Neural Networks. 1700-1703 - Khanh T. K. Nguyen

, Hien M. Nguyen:
A Comparison Study of GRAPPA and Generalized Series Methods for parallel MRI at high acceleration factor. 1704-1709 - Tomonori Maeda, Kiyoshi Nishikawa:

Consideration on application of the concept of Saak transform to convolutional neural networks. 1710-1716 - Wanlu Shi, Yingsong Li, Felix Albu

:
A Norm Penalized Noise-free Maximum Correntropy Criterion Algorithm. 1717-1720 - Kazunori Hayashi

, Kaede Shiohara, Tetsuya Sasaki:
Differentiable Programming based Step Size Optimization for LMS and NLMS Algorithms. 1721-1727 - Qiushi Li

, Zilong Shao, Shunquan Tan, Jishen Zeng, Bin Li:
Non-structured Pruning for Deep-learning based Steganalytic Frameworks. 1735-1739 - Wen-Na Zhang, Yun-Xia Liu, Ze-Yu Zou, Yun-Li Zang, Yang Yang, Bonnie Ngai-Fong Law:

Effective Source Camera Identification based on MSEPLL Denoising Applied to Small Image Patches. 1740-1744 - MaungMaung AprilPyone, Yuma Kinoshita, Hitoshi Kiya:

Filtering Adversarial Noise with Double Quantization. 1745-1749 - Kenta Iida, Hitoshi Kiya:

Image Identification of Grayscale-Based JPEG Images for Privacy-Preserving Photo Sharing Services. 1750-1755 - Warit Sirichotedumrong, Yuma Kinoshita, Hitoshi Kiya:

Privacy-Preserving Deep Neural Networks Using Pixel-Based Image Encryption Without Common Security Keys. 1756-1761 - Koi Yee Ng

, Simying Ong
, KokSheik Wong
:
Delving into the Methods of Coverless Image Steganography. 1763-1772 - Haiwei Wu, Jiantao Zhou, Yuanman Li:

Image Reconstruction from Local Descriptors Using Conditional Adversarial Networks. 1773-1779 - Juqiang Chen

, Xuliang He:
Computational perception of information foci produced by Chinese English learners and American English speakers. 1780-1785 - Jie Hou

, Yu Chen, Yutong Xing, Jianwu Dang:
Acoustic Attributes of Citation Tones in Standard Chinese Produced by Prelingually Deaf Adults. 1786-1790 - Linxuan Wei, Wenwei Dong, Binghuai Lin, Jinsong Zhang:

Multi-Task Based Mispronunciation Detection of Children Speech Using Multi-Lingual Information. 1791-1794 - Bin Li

, Yihan Guan, Si Chen:
Sounds of Personality: Inference from Voices by Non-Native Speakers. 1795-1799 - Xi Chen, Si Chen:

Acquisition and Interpretation of Mandarin Speech Prosody by Native Speakers and Cantonese Learners. 1800-1809 - Yiran Ding, Yanlu Xie, Jinsong Zhang:

Acquisition of L2 Mandarin Rhythm By Russian and Japanese Learners. 1810-1814 - Rong Han, Ming Wu, Kexun Chi, Lan Yin, Hongling Sun, Jun Yang:

A min-max optimization algorithm for global active acoustic radiation control. 1815-1818 - Kenta Iwai

, Takanobu Nishiura:
Audio Integrated Active Noise Control System with Auto Gain Controller. 1819-1823 - Kyosuke Nakagawa, Chuang Shi

, Yoshinobu Kajikawa:
Beam Steering of Portable Parametric Array Loudspeaker. 1824-1827 - Chuang Shi

, Nan Jiang, Rong Xie
, Huiyong Li:
A Simulation Investigation of Modified FxLMS Algorithms for Feedforward Active Noise Control. 1833-1837 - Meixia Fu, Songlin Sun, Kaili Ni, Xiaoying Hou:

Mobile Robot Object Recognition in The Internet of Things based on Fog Computing. 1838-1842 - Haohui Jia, Na Chen

, Takeshi Higashino, Minoru Okada:
Joint Sparse Channel Estimation in Downlink NOMA System. 1843-1846 - Chengbo Liu, Na Chen

, Yafei Hou, Minoru Okada:
Time-Domain Signal Recovery for OFDM System in the Industrial Environment. 1847-1851 - Mau-Luen Tham

, Amjad Iqbal
, Yoong Choon Chang:
Deep Reinforcement Learning for Resource Allocation in 5G Communications. 1852-1855 - Ying Loong Lee

, Donghong Qin:
A Survey on Applications of Deep Reinforcement Learning in Resource Management for 5G Heterogeneous Networks. 1856-1862 - Yukoh Wakabayashi, Nobutaka Ono:

Griffin-Lim phase reconstruction using short-time Fourier transform with zero-padded frame analysis. 1863-1867 - Naoki Makishima, Norihiro Takamune, Hiroshi Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo:

Robust Demixing Filter Update Algorithm Based on Microphone-wise Coordinate Descent for Independent Deeply Learned Matrix Analysis. 1868-1873 - Masakazu Une, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Shoji Makino:

Evaluation of Multichannel Hearing Aid System by Rank-Constrained Spatial Covariance Matrix Estimation. 1874-1879 - Ningning Pan, Jingdong Chen, Biing-Hwang Fred Juang:

Comparative Study of Deep Learning Based and Traditional Single-Channel Noise-Reduction Algorithms. 1880-1884 - Zhi-Wei Tan, Anh H. T. Nguyen, Andy W. H. Khong:

An Efficient Dilated Convolutional Neural Network for UAV Noise Reduction at Low Input SNR. 1885-1892 - Soky Kak

, Sheng Li
, Tatsuya Kawahara
, Sopheap Seng:
Multi-lingual Transformer Training for Khmer Automatic Speech Recognition. 1893-1896 - Zhaodi Qi, Yong Ma, Mingliang Gu:

A Study on Low-resource Language Identification. 1897-1902 - Sardar Parhat, Gao Ting, Mijit Ablimit, Askar Hamdulla:

A morpheme sequence and convolutional neural network based Kazakh text classification. 1903-1906 - Jiawei Yu, Jinsong Zhang:

Zero-resource Language Recognition. 1907-1911 - Qingran Zhan, Petr Motlícek

, Shixuan Du, Yahui Shan, Sifan Ma, Xiang Xie:
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features. 1912-1916 - Zhiyuan Tang, Dong Wang, Liming Song:

AP19-OLR Challenge: Three Tasks and Their Baselines. 1917-1921 - Yi-Hsuan Hsu, Jiun-In Guo:

A Real-time and Online Multiple-Type Object Tracking Method with Deep Features. 1922-1928 - Phuong Le Thi, Tuan Pham, Jia-Ching Wang:

Convolutional Attention Model for Retinal Edema Segmentation. 1929-1932 - Kai-Wen Liang, Yu-Hao Tseng, Pao-Chi Chang:

Parallel Capsule Neural Networks for Sound Event Detection. 1933-1936 - Duc-Quang Vu, Thi-Thu-Trang Phung, Chien-Yao Wang, Jia-Ching Wang:

Age and Gender Recognition Using Multi-task CNN. 1937-1941 - Leong Chee Him, Yu Yang Poh, Lee Wah Pheng:

IoT-based Predictive Maintenance for Smart Manufacturing Systems. 1942-1944 - Seongmin Lee, Woojae Kim, Sewoong Ahn, Jaekyung Kim, Sanghoon Lee:

Physical parameter prediction by embedding human perceptual parameter for 3D garment modeling. 1945-1949 - Zifei Jiang

, Zhen Li, Wei Li, Xueqing Li, Jingliang Peng:
Generic Video-Based Motion Capture Data Retrieval. 1950-1957 - Lulu Guo, Huihui Bai, Yao Zhao:

A Lightweight and Robust Face Recognition Network on Noisy Condition. 1964-1969 - Junheum Park, Chul Lee

, Chang-Su Kim
:
Deep Learning Approach to Video Frame Rate Up-Conversion Using Bilateral Motion Estimation. 1970-1975 - Chia-Hung Yeh, Min-Hui Lin, Wei-Chieh Lu:

3D Reconstruction using HDR-based SLAM. 1976-1980 - Henry Clifton, Alanna Vial, Andrew Miller, Christian Ritz, Matthew Field

, Lois Holloway, Montserrat Ros, Martin Carolan, David Stirling:
Using Machine Learning Applied to Radiomic Image Features for Segmenting Tumour Structures. 1981-1988 - Ricky Sutopo, Ting Yau Teo, Joanne Mun-Yee Lim, KokSheik Wong

:
Computational Intelligence-based Real-time Lane Departure Warning System Using Gabor Features. 1989-1992 - Chung Hou Ng, Wern Han Lim

, Mei Kuan Lim
:
Optimising Search Operations with Swarm Intelligence. 1993-1997 - JunYi Lim, Md Istiaque Al Jobayer, Vishnu Monn Baskaran, Joanne Mun-Yee Lim, KokSheik Wong

, John See
:
Gun Detection in Surveillance Videos using Deep Neural Networks. 1998-2002 - Mahamat Moussa, Chern Hong Lim

:
Interpreting Abnormality of a Complex Static Scene using Generative Adversarial Network. 2003-2007 - Tetsuya Asakawa

, Masaki Aono:
Median based Multi-label Prediction by Inflating Emotions with Dyads for Visual Sentiment Analysis. 2008-2014 - Yupeng Li, Yuxiao Wang, Yongfeng Jiang, Liang Zhang:

Action Recognition using Convolutional Neural Networks with Joint Supervision. 2015-2020 - Zhenqi Fu, Yan Yang

, Feng Shao, Xinghao Ding:
A Study of Perceptual Quality Assessment for Stereoscopic Image Retargeting. 2021-2024 - Yifan Zhao, Jingchun Cheng, Wei Zhou, Chunxi Zhang, Xiong Pan:

Infrared Pedestrian Detection with Converted Temperature Map. 2025-2031 - Binbin Han, Ping Han, Zheng Cheng

:
A Fast and Accurate Cluster Center Initialization Algorithm for PolSAR Superpixel Segmentation. 2032-2037 - Jian Gong, Yameng Yu, William Bellamy, Feng Wang, Xiaoli Ji, Zhenzhen Yang:

Comparing Native Chinese Listeners' Speech Reception Thresholds for Mandarin and English Consonants. 2038-2041 - Jiajing Zhang

, Ying Chen, Jie Cui:
Prosodic Realization of Focus in English by Bidialectal Mandarin Speakers. 2042-2047 - Yating Cao, Hua Chen:

World Englishes and Prosody: Evidence from the Successful Public Speakers. 2048-2052 - Jiangbo Zhang, Aijun Li, Na Zhi:

An Experimental Study on English Majors Weak Form Productions of Prepositions. 2054-2063 - Yixin Zhang, Jinsong Zhang:

Oral Motor Exercises For CSL Learners to Master Productions of Retroflex And Non-Retroflex Consonants. 2064-2069 - Yi Liu, Bairong Zhuang, Zhiyu Li, Takahiro Shinozaki:

Cross-Domain Speaker Recognition using Cycle-Consistent Adversarial Networks. 2070-2074

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














