


default search action
APSIPA 2017: Kuala Lumpur, Malaysia
- 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017. IEEE 2017, ISBN 978-1-5386-1542-3

- Chin-Hui Lee:

Keynote speech 1: An integrated deep learning approach to acoustic signal pre-processing and acoustic modeling with applications to robust automatic speech recognition. v-viii - Yan Chen, Chih-Yu Wang:

Tutorial 1: Sequential decision making: Theories and applications. ix-xii - Binh Trans:

Online learning in the Asia Pacific region. - Jie Yan, Lei Xie, Guangsen Wang, Zhong-Hua Fu:

A segmental DNN/i-vector approach for digit-prompted speaker verification. 1-5 - Szu-Wei Fu, Yu Tsao

, Xugang Lu, Hisashi Kawai:
Raw waveform-based speech enhancement by fully convolutional networks. 6-12 - Jian-Jiun Ding, Shiang-Chih Hua, Ronald Y. Chang

, Yih-Cherng Lee:
Generalized atom and dictionary design and compressive sensing for vocal signal expansion. 13-18 - Chien-Yao Wang

, Andri Santoso
, Jia-Ching Wang:
Acoustic scene classification using self-determination convolutional neural network. 19-22 - I-Hsiang Wang, Jian-Jiun Ding, Hung-Wei Hsu:

Prediction techniques for wavelet based 1-D signal compression. 23-26 - Xiaoming Zhang, Hidetaka Aoki, Akiko Sato, Mohd Amin Abd Majid:

An empirical study on performance optimization at district cooling plant of Universiti Teknologi PETRONAS. 27-32 - Tomoya Sakai, Shun Ogawa, Hiroki Kuhara:

Sequential decomposition of 2D apparent motion fields based on low-rank and sparse approximation. 33-38 - Ettikan Kandasamy Karuppiah:

Internet of Things: Trend, technologies, and evolution. 37-38 - Lounell B. Gueta, Akiko Sato:

Classifying road surface conditions using vibration signals. 39-43 - Ryosuke Kawami, Hidetomo Kataoka, Daichi Kitahara, Akira Hirabayashi, Takashi Ijiri, Shigeharu Shimamura

, Hiroshi Kikuchi, Tomoo Ushio
:
Fast high-quality three-dimensional reconstruction from compressive observation of phased array weather radar. 44-49 - Akie Sakiyama, Yuichi Tanaka

:
Graph reduction method using localization operator and its application to pyramid transform. 50-55 - Vui Ann Shim, Miaolong Yuan, Boon Hwa Tan:

Automatic object searching by a mobile robot with single RGB-D camera. 56-62 - Yan Wu

, Ruohan Wang, Yong Ling Tay, Clarice Jiaying Wong:
Investigation on the roles of human and robot in collaborative storytelling. 63-68 - Gayane Shalunts, Gerhard Backfried, Helmy Syakh Alam

:
Sentiment analysis in Indonesian and French by SentiSAIL. 69-75 - Luis Fernando D'Haro

, Andreea I. Niculescu, Caixia Cai, Suraj Nair, Rafael E. Banchs, Alois C. Knoll, Haizhou Li
:
An integrated framework for multimodal human-robot interaction. 76-82 - Andreea I. Niculescu, Luis Fernando D'Haro

, Rafael E. Banchs:
When industrial robots become social: On the design and evaluation of a multimodal interface for welding robots. 83-89 - Xiao-Zhi Zhang, Ya Li, Bingo Wing-Kuen Ling, Chao Song, Kok Lay Teo:

Spread spectrum compressed sensing magnetic resonance imaging via fractional Fourier transform. 90-93 - Yi-Ping Bao, Yan-Na Zhang

, Yu-E. Song, Bing-Zhao Li, Pei Dang:
Nonuniform sampling theorems for random signals in the offset linear canonical transform domain. 94-99 - Yi-Qian Wang

, Bing-Zhao Li, Qi-Yuan Cheng:
The fractional Fourier transform on graphs. 105-110 - Aykut Koç

, Haldun M. Özaktas, Burak Bartan, Erhan Gundogdu, Tolga Çukur
:
Digital computation of fractional Fourier and linear canonical transforms and sparse image representation. 111-117 - Iman Tabatabaei Ardekani, Xiao Zhang, Hamid R. Sharifzadeh, Jari P. Kaipio:

Maximum a posteriori adjustment of adaptive transversal filters in active noise control. 118-123 - Masato Nakayama, Takanobu Nishiura:

Synchronized amplitude-and-frequency modulation for a parametric loudspeaker. 130-135 - Tomoki Murata, Yoshinobu Kajikawa, Seiji Miyoshi:

Statistical-mechanical analysis of the FXLMS algorithm for multiple-channel active noise control. 136-139 - Michael Anthony, Cheng-Yuan Chang, Sen M. Kuo:

Active noise control for muffler. 140-144 - Nan Chen, Changchun Bao, Xianyun Wang:

Speech enhancement based on binaural cues. 145-148 - Yan Yang, Changchun Bao, Xianyun Wang:

Codebook-driven speech enhancement using DNN and harmonic emphasis. 149-154 - Xin Wang

, Jun Du, Yannan Wang:
A maximum likelihood approach to deep neural network based speech dereverberation. 155-158 - Tohari Ahmad

, Burhanudin Rasyid:
SCFT: Sector-based cancelable fingeprint template. 156-160 - Xiao-Lei Zhang:

Speech separation by cost-sensitive deep learning. 159-162 - Shasha Xia, Hao Li, Xueliang Zhang:

Using optimal ratio mask as training target for supervised speech separation. 163-166 - Minghui Dong, Zhengchen Zhang, Huaiping Ming:

Representing raw linguistic information in chinese text-to-speech system. 167-170 - Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng

:
An end-to-end neural network approach to story segmentation. 171-176 - Dong Wang, Lantian Li

, Zhiyuan Tang, Thomas Fang Zheng:
Deep speaker verification: Do we need end to end? 177-181 - Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Hiroyasu Ando, Kaoru Hiramatsu, Kunio Kashino:

Non-native speech conversion with consistency-aware recursive network and generative adversarial network. 182-188 - Sivanagaraja Tatinati, Mun Kit Ho, Andy W. H. Khong, Yubo Wang

:
End-to-end speech emotion recognition using multi-scale convolution networks. 189-192 - Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki

:
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification. 193-202 - Anu Aryal, Shoko Imaizumi

, Takahiko Horiuchi, Hitoshi Kiya:
Integrated algorithm for block-permutation-based encryption with reversible data hiding. 203-208 - Simying Ong

, KokSheik Wong
, Kiyoshi Tanaka:
Redesigning data hiding: Interpolation-based scrambling-embedding method. 209-213 - KuanYew Tan, KokSheik Wong

, Simying Ong
, Kiyoshi Tanaka:
Rewritable data insertion in encrypted JPEG using coefficient prediction method. 214-219 - Koichi Ito, Takehisa Okano, Takafumi Aoki:

Recent advances in biometrie security: A case study of liveness detection in face recognition. 220-227 - Meng Yang, Nanning Zheng, Fei Wang

, Ce Zhu:
A new bilateral filter for post-removing the noise of synthesis view in 3D video. 228-231 - Hongsheng Liu, Baozhu Guo, Zhizhong Fu, Xiaofeng Li:

A new active contour model based on complexity of textures for segmentation of natural image. 232-236 - Yifan Zhang, Ting Wang, Renjie He, Mingyi He

:
Subpixel mapping of hyperspectral images with hybrid endmember library and optimized abundances. 237-241 - Yifan Zhang, Tuo Zhao, Mingyi He

:
Hyperspectral and multispectral image fusion using local spatial-spectral dictionary pair. 242-246 - Cho-Ying Wu, Jian-Jiun Ding:

A fast non-convex regularizer for low rank matrix completion. 247-250 - Chia-Wei Wang, Tzu-Chieh Yang, Sheng-Ho Chiang, Tsaipei Wang:

Identifying and filling occlusion holes on planar surfaces for 3-D scene editing. 251-254 - Wisarut Chantara, Yo-Sung Ho:

Initial depth estimation using EPIs and structure tensor. 255-258 - Guiqing He, Siyuan Xing, Dandan Dong, Ximei Zhao:

Panchromatic and multi-spectral image fusion method based on two-step sparse representation and wavelet transform. 259-262 - Yuma Kinoshita, Taichi Yoshida, Sayaka Shiota, Hitoshi Kiya:

Pseudo multi-exposure fusion using a single image. 263-269 - Wen-Nung Lie, Chih-Hao Hu, Yi-Kai Chen, Jui-Chiu Chiang:

Multi-layer background sprite model for 2D-to-3D video conversion. 270-274 - Chao Zhang, Ce Zhu, Yipeng Liu, Hongdiao Wen, Zhengtao Wang

:
Image ordinal estimation: Classification and regression benefit each other. 275-278 - Yusaku Akiyoshi, Taichi Sumi, Yoshimitsu Kuroki

:
Dictionary design and disparity interpolation on distributed compressed sensing for light field image. 279-282 - Kwan-Jung Oh, Minsik Park, Jinwoong Kim:

Digital hologram data representation method. 283-286 - Yufei Zhao, Zhizhong Fu, Jin Xu, Linghua Mao:

Image fusion algorithm based on gradient similarity filter. 287-291 - Manoj Ramanathan, Wei-Yun Yau, Eam Khwang Teoh, Nadia Magnenat-Thalmann

:
Pose-invariant kinematic features for action recognition. 292-299 - Tingtian Li

, Daniel Pak-Kong Lun:
Salient object detection using array images. 300-303 - Jia Du, Wei Xiong, Wenyu Chen, Jierong Cheng, Ying Gu:

Accurate subset selection for pose estimation from uncertain points and lines. 304-308 - Xin Rong Soh, Vishnu Monn Baskaran, Adamu Muhammad Buhari

, Raphael C.-W. Phan:
A real time micro-expression detection system with LBP-TOP on a many-core processor. 309-315 - Ryo Miyagi, Masaki Aono:

Sliced voxel representations with LSTM and CNN for 3D shape recognition. 320-323 - Yi Yang Ang, Nam Nguyen, Joni Polili Lie, Woon-Seng Gan

:
Localization of harmonic source using a single moving sensor of known trajectory. 324-328 - Yi Yang Ang, Nam Nguyen, Joni Polili Lie, Woon-Seng Gan

:
Grid-free compressive beamforming using a single moving sensor of known trajectory. 329-332 - Suraj Kumar Nayak, Karan Pande, Pratyush Kumar Patnaik, Shikshya Nayak, Shankar J. Patel, Arfat Anis

, Anilesh Dey
, Kunal Pal
:
Understanding the effect of cannabis abuse on the ANS and cardiac physiology of the Indian women paddy-field workers using RR interval and ECG signal analyses. 333-341 - Phuttapong Sertsi, Surasak Boonkla, Vataya Chunwijitra, Nattapong Kurpukdee, Chai Wutiwiwatchai:

Robust voice activity detection based on LSTM recurrent neural networks and modulation spectrum. 342-346 - Yu-Siang Huang, Szu-Yu Chou, Yi-Hsuan Yang:

Music thumbnailing via neural attention modeling of music emotion. 347-350 - Shohei Mori

, Hideo Saito:
Augmented visualization: Observing as desired. 351-356 - Kazuhisa Yamagishi:

QoE-estimation models for video streaming services. 357-363 - Kazuo Sugimoto, Robert A. Cohen, Dong Tian

, Anthony Vetro:
Trends in efficient representation of 3D point clouds. 364-369 - Yohei Kawaguchi

, Ryoichi Takashima, Takashi Endo, Masahito Togami:
Time-domain subsampling and reconstruction for microphone array. 370-374 - Yiqi Tew

, Tiong Yew Tang
, Yoon-Ket Lee:
A study on enhanced educational platform with adaptive sensing devices using IoT features. 375-379 - Yoon-Ket Lee, Jay Ming Lim, Kok Seng Eu, Yeh Huann Goh, Yiqi Tew

:
Real time image processing based obstacle avoidance and navigation system for autonomous wheelchair application. 380-385 - Jian Han Lim

, Eng Yeow Teh, Ming Han Geh, Chern Hong Lim:
Automated classroom monitoring with connected visioning system. 386-393 - Xin Li, Xueting Wei, Wei Zhou, Zhemin Duan:

Techniques for overheating detection and sensor allocation in a real dual-core processor. 394-400 - Jun Li

, Keng Peng Tee, Lawrence Chen, Kong-Wah Wan
, Wei-Yun Yau:
A perception system for robot arms to convey objects to in-car passengers. 401-408 - Yi Feng, Zhifeng Huang, Yun Zhang:

Motion planning of a 6-Dofs robot arm for bandaging nursing task. 409-413 - Jiadong Wang, Wenjuan Ouyang, Wenchao Gao, Qinyuan Ren

:
Locomotion control of a serpentine crawling robot inspired by central pattern generators. 414-419 - Nicola Catenacci Volpi

, Yan Wu
, Dimitri Ognibene
:
Towards event-based MCTS for autonomous cars. 420-427 - Yuya Chiba, Takashi Nose

, Akinori Ito
:
Analysis of efficient multimodal features for estimating user's willingness to talk: Comparison of human-machine and human-human dialog. 428-431 - Xia Bai, Jiatong Han, Juan Zhao:

Sparse-based disturbance cancellation approach for passive radar. 432-436 - Juan Zhao, Xia Bai:

An improved orthogonal matching pursuit based on randomly enhanced adaptive subspace pursuit. 437-441 - Shiori Mikami, Arata Kawamura, Youji Iiguni:

Residual drum sound estimation for RPCA singing voice extraction. 442-446 - Hyeonggwon Kim, Yoonsik Choe:

Background subtraction via truncated nuclear norm minimization. 447-451 - Yohei Kawaguchi

, Sandra Ramaswami, Ryoichi Takashima, Takashi Endo, Rintaro Ikeshita:
Sub-Nyquist non-uniform sampling for low-cost sound monitoring. 452-456 - Valiantsin Belyi

, Woon-Seng Gan
:
Psychoacoustic subband active noise control algorithm. 457-463 - Shun Hirose, Yoshinobu Kajikawa:

Effectiveness of headrest ANC system with virtual sensing technique for factory noise. 464-468 - Dong-Yuan Shi, Chuang Shi

, Woon-Seng Gan
:
Effect of the audio amplifier's distortion on feedforward active noise control. 469-473 - Caixia Lu, Feiran Yang, Jun Yang:

A frequency-domain adaptive feedback cancellation algorithm based on convex combination. 474-477 - Kouei Yamaoka, Nobutaka Ono

, Shoji Makino, Takeshi Yamada:
Abnormal sound detection by two microphones using virtual microphone technique. 478-482 - Feng Bao, Waleed H. Abdulla:

Signal power estimation based on convex optimization for speech enhancement. 483-487 - Yanhui Tu, Jun Du, Lei Sun, Chin-Hui Lee:

LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement. 488-491 - Zexin Liu, Heather T. Ma, Fei Chen

:
A new data-driven band-weighting function for predicting the intelligibility of noise-suppressed speech. 492-496 - Miao Zhang, Yixiang Chen, Lantian Li

, Dong Wang:
Speaker recognition with cough, laugh and "Wei". 497-501 - Hosana Kamiyama, Atsushi Ando, Satoshi Kobashikawa, Yushi Aono:

Robust children and adults speech identification and confidence measure based on DNN posteriorgram. 502-505 - Feng Li, Huihui Bai, Yao Zhao:

Visual attention guided eye movements for 360 degree images. 506-511 - Cairong Xing, Anhong Wang, Suyue Li, Peihao Li, Jing Zhang:

Random aliasing modulation with decision-directed demodulation. 512-515 - Chang Duan, Yuhuan Shen, Yingying Zhang, Shuai Wang, Ce Zhu, Meng Yang:

Enhancing wedgelet-based depth modeling in 3D-HEVC. 516-519 - Xiaoqiang Cao, Ce Zhu, Minjie Yang, Yongbing Lin, Jianhua Zheng:

A new intra prediction method based on consistent luminance changes. 520-523 - Szu-Wei Fu, Jian-Jiun Ding, Ying-Wun Huang, Ching-Wen Hsiao, Hsin-Hui Chen:

Collagen image compression using the JPEG-based predictive lossless coding scheme. 524-533 - Sze-Teng Liong

, KokSheik Wong
:
Micro-expression recognition using apex frame with phase information. 534-537 - Jierong Cheng, Wei Xiong, Jia Du, Wenyu Chen, Ying Gu:

Detection of meaningful line segment configurations. 538-541 - Jinyoung Jang, Dong-Won Shin, Yo-Sung Ho:

Disparity map refinement method using coarse-to-fine image segmentation. 542-545 - Dong-Won Shin, Yo-Sung Ho:

Local patch descriptor using deep convolutional generative adversarial network for loop closure detection in SLAM. 546-549 - Chen Chen, Shangwen Li, Xiang Fu, Yuzhuo Ren, Yueru Chen, C.-C. Jay Kuo

:
Exploring confusing scene classes for the places dataset: Insights and solutions. 550-558 - Nirmesh J. Shah, Hemant A. Patil:

On the convergence of INCA algorithm. 559-562 - Maulik C. Madhavi

, Hemant A. Patil:
Combining evidences from detection sources for query-by-example spoken term detection. 563-568 - Yuanjun Zhao, Roberto Togneri, Victor Sreeram:

Compressed high dimensional features for speaker spoofing detection. 569-572 - Vishnu Vidyadhara Raju Vegesna, Hari Krishna Vydana, Suryakanth V. Gangashetty

, Anil Kumar Vuppala:
Importance of non-uniform prosody modification for speech recognition in emotion conditions. 573-576 - Chitralekha Gupta

, Haizhou Li
, Ye Wang
:
Perceptual evaluation of singing quality. 577-586 - Kishin Migimatsu, Takuya Wakazono, Isao T. Tokuda:

Experimental study on source-filter interaction using physical model of the vocal folds. 587-590 - Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao

, Hsin-Min Wang
:
Fast locally linear embedding algorithm for exemplar-based voice conversion. 591-595 - Shengke Lin, Takashi Tsunakawa, Masafumi Nishida, Masafumi Nishimura:

DNN-based feature transformation for speech recognition using throat microphone. 596-599 - Hitoshi Yamamoto, Koji Okabe, Takafumi Koshinaka:

Robust i-vector extraction tightly coupled with voice activity detection using deep neural networks. 600-604 - Chen-Yen Lai, Yu-Wen Lo, Yih-Liang Shen, Tai-Shih Chi:

Plastic multi-resolution auditory model based neural network for speech enhancement. 605-609 - Kazuho Morikawa, Tomoki Toda

:
Electrolaryngeal speech modification towards singing aid system for laryngectomees. 610-613 - Peixin Chen, Wu Guo, Qingnan Wang, Yan Song:

Topic classification based on distributed document representation and latent topic information. 614-617 - Michael Hentschel, Atsunori Ogawa, Marc Delcroix

, Tomohiro Nakatani, Yuji Matsumoto:
Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs. 618-621 - Junfeng Hou, Shiliang Zhang, Li-Rong Dai, Hui Jiang:

Feedforward sequential memory networks based encoder-decoder model for machine translation. 622-625 - Yu Chen, Yanting Chen, Hua Lin, Jie Hou, Yutong Xing, Jianwu Dang:

A study of high level tone in standard chinese produced by prelingually deaf adults. 626-629 - Hao Zhang

, Nan Yan, Lan Wang, Manwa L. Ng:
Energy distribution analysis and nonlinear dynamical analysis of phonation in patients with Parkinson's disease. 630-635 - Chuanying Niu, Jinsong Zhang

, Xuesong Yang
, Yanlu Xie:
A study on landmark detection based on CTC and its application to pronunciation error detection. 636-640 - Chin-Hong Shih, Bi-Cheng Yan, Shih-Hung Liu, Berlin Chen:

Investigating Siamese LSTM networks for text categorization. 641-646 - Yu Chen, Jie Hou, Yutong Xing, Yanting Chen, Hua Lin, Jianwu Dang:

The acoustic characteristics of tone 3 in standard chinese produced by prelingually deaf adults. 647-650 - Nina Zhou, Xuancong Wang, AiTi Aw:

Dynamic boundary detection for speech translation. 651-656 - Hung-Wei Hsu, Jian-Jiun Ding:

FasterMDNet: Learning model adaptation by RNN in tracking-by-detection based visual tracking. 657-660 - Zheng-Teng Zhang, Chia-Hung Yeh, Li-Wei Kang, Min-Hui Lin:

Efficient CTU-based intra frame coding for HEVC based on deep learning. 661-664 - Zun-Ci Lee, Raphael C.-W. Phan, Su-Wei Tan, Kuan-Heng Lee:

Multimodal decomposition for enhanced subtle emotion recognition. 665-671 - Jing-Ming Guo, S. Sankarasrinivasan

:
Enhanced block truncation coding image using digital multitone screen. 672-676 - Gayane Shalunts, Martin Cerman, Daniel Albertini:

Detection of sculpted faces on building facades. 677-685 - Yueru Chen, Pranav Aggarwal, Jongmoo Choi, C.-C. Jay Kuo:

A deep learning approach to drone monitoring. 686-691 - Guan-Ting Lin

, Patrisia Sherryl Santoso, Che-Tsung Lin, Chia-Chi Tsai
, Jiun-In Guo:
Stop line detection and distance measurement for road intersection based on deep learning neural network. 692-695 - Takuya Araki, Yuichi Nakamura:

Future trend of deep learning frameworks - From the perspective of big data analytics and HPC. 696-703 - Nam Kyun Kim, Jiwon Lee, Hun Kyu Ha, Geon Woo Lee, Jung Hyuk Lee, Hong Kook Kim:

Speech emotion recognition based on multi-task learning using a convolutional neural network. 704-707 - Jonghee Kim, Jinsu Kim, Seokeon Choi, Muhammad Abul Hasan, Changick Kim:

Robust template matching using scale-adaptive deep convolutional features. 708-711 - Sung-Phil Kim, Jae-Hwan Kang, Young Chang Jo, Ian Oakley:

Development of a multi-modal personal authentication interface. 712-715 - Shohei Ogai, Toshihisa Tanaka:

A drag-and-drop type human computer interaction technique based on electrooculogram. 716-720 - Jaeyoung Shin, Klaus-Robert Müller

, Han-Jeong Hwang:
Hybrid EEG-NIRS brain-computer interface under eyes-closed condition. 721-723 - Sunghan Lee

, Hohyun Cho
, Sung Chan Jun
:
Simultaneous bio-signal measurement system for multiple users - development and validation. 724-727 - Xiaoling Wu, Shuhua Gao, Dong-Yan Huang, Cheng Xiang:

Voichap: A standalone real-time voice change application on iOS platform. 728-732 - Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Shipeng Xu:

Free linguistic and speech resources for Tibetan. 733-736 - Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng:

A multilingual language processing tool for Uyghur, Kazak and Kirghiz. 737-740 - Shipeng Xu, Hongzhi Yu, Thomas Fang Zheng, Guanyu Li, Gegeentana:

Language resource construction for Mongolian. 741-744 - Ying Shi, Askar Hamdullah, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng:

A free Kazakh speech database and a speech recognition baseline. 745-748 - Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen:

AP17-OLR challenge: Data, plan, and baseline. 749-753 - Yeun Lok Lin, Ngai-Fong Law, Chi-Wai Do

:
Portable vision screenings system. 754-759 - Kin-On Cheng, Ngai-Fong Law, Wan-Chi Siu:

Compressing population DNA sequences using multiple reference sequences. 760-764 - Yun-Xia Liu, Yang Yang, Yuehui Chen:

Lung sound classification based on Hilbert-Huang transform features and multilayer perceptron network. 765-768 - Yun-Xia Liu, Yang Yang, Yuehui Chen:

Automatic detection of circulating tumor cells based on microscopic images. 769-773 - Shengyan Li, Bin Li, Shixiong Zhang, Hong Fu

, Wai-Lun Lo, Jie Yu, Cindy H. P. Sit, Ruimin Li:
A markerless visual-motor tracking system for behavior monitoring in DCD assessment. 774-777 - Lei Wang, Zexin Liu, Fei Chen

:
Perceptual roles of temporal and segmentation cues in single-channel noise reduction processing. 778-785 - Alan Kan

:
Improving speech intelligibility for bilateral cochlear implant users using Weiner filters and its impact on cognitive load. 786-792 - Jing Chen, Zhen Fu, Xiuyong Ding, Jiping Wu, Xihong Wu:

Electrically-evoked frequency following responses (EFFRs) and electrically-evoked auditory brainstem responses (EABRs) in guinea pigs. 793-802 - Rebecca E. Millman, Michael A. Stone, Chin-Tuan Tan

:
Objective neurophysiological assessment for sound quality perception by hearing-impaired listeners. 803-807 - Syu-Siang Wang

, Yu Tsao
, Hsiao-Lan Sharon Wang, Ying-Hui Lai, Lieber Po-Hung Li:
A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise. 808-812 - Chih-Chiang Chen, Shang-Ho Lawrence Tsai, Yuan-Pei Lin, Chia-Hua Lin

:
Resource allocation and minimum rate for precoded non-orthogonal multiple access. 813-817 - Tzu-Chiao Lin

, See-May Phoong:
MSE-optimized CP-based CFO estimation in OFDM systems over multipath channels. 818-822 - Y.-W. Peter Hong

, An-An Lee, Yu-An Chen:
Successive MMSE group decoding and max-min power control for uplink multiceli NOMA systems under pilot contamination. 823-831 - Syu-Siang Long, Pei-Yun Tsai, Yuan-Hao Huang, I-Wei Lai:

Trellis coded generalized spatial modulation with spatial multiplexing. 832-837 - Chia-Yang Mei, Wan-Jen Huang:

Low-complexity zero-forcing detector for large-scale MIMO-OFDM systems. 838-841 - Yanhong Wu, Xiaolong Li, Yao Zhao, Rongrong Ni:

A new detector for JPEG decompressed bitmap identification. 842-845 - Omer Hemida, Yaoran Huo, Fan Chen, Hongjie He:

Block-DCT based alterable-coding restorable fragile watermarking scheme with superior localization. 846-851 - Kenta Iida, Hitoshi Kiya:

Robust image identification without any visible information for double-compressed JPEG images. 852-857 - Tatsuya Chuman, Kenta Iida, Hitoshi Kiya:

Image manipulation on social media for encryption-then-compression systems. 858-863 - Liuying Sun, Anthony T. S. Ho, Zhe Xia, Jiageng Chen

, Xuzhe Huang, Yidan Zhang:
Detection and classification of malicious patterns in network traffic using Benford's law. 864-872 - John Håkon Husøy:

On the selection and design of filter banks in normalised subband adaptive filters (NSAF). 877-883 - Li Su

:
Between homomorphic signal processing and deep neural networks: Constructing deep algorithms for polyphonic music transcription. 884-891 - Soichiro Aoki, Hiroki Tanji, Takahiro Murakami:

Array shape calibration using near field pilot sources with unknown distance. 892-896 - Xiangyuan Li, Cheng Cai, Jinrong He

:
Density-based multi-manifold ISOMAP for data classification. 897-903 - Tomoya Wada, Toshihisa Tanaka:

Doubly adaptive kernel filtering. 904-909 - Kenzo Yamamoto, Kenji Suyama:

Active enumeration of local minima for IIR filter design using PSO. 910-917 - Chung-Nan Lee, Sheng-Wei Chu:

A fairness aware and resource reuse algorithm for LTE layered video multicast service. 918-925 - Wei Zhang, Lixia Hao, Yanlu Xie, Jinsong Zhang

:
A study on quantitative computation for prosodie strength of Mandarin speech. 926-930 - Huiyong Li, Zihui Luo, Julan Xie, Jun Li:

Joint estimation of signal and mutual coupling parameters based on spatially spread polarization sensitive array. 931-937 - Shuichi Ohno, M. Rizwan Tariq, Masaaki Nagahara

:
Min-max IIR filter design for feedback quantizers. 938-942 - Ayano Nakai, Kazunori Hayashi

:
Diffusion LMS using consensus propagation. 943-948 - Bandhit Suksiri, Masahiro Fukumoto:

Enhanced array manifold matrices for L-shaped microphone array-based 2-D DOA estimation. 955-960 - Chisa Kodama, Kunihito Kato, Satoshi Tamura, Satoru Hayamizu:

Swallowing function evaluation using deep-learning-based acoustic signal processing. 961-964 - Xiaobai Chen, Jinlong Xu, Zhiyi Yu:

A fast and energy efficient FPGA-based system for real-time object tracking. 965-968 - Yuuki Saito, Akira Tanaka:

Optimal kernel in kernel regression problems with autocorrelation prior. 969-972 - Tatsuya Yokota, Hidekata Hontani:

An efficient method for adapting step-size parameters of primal-dual hybrid gradient method in application to total variation regularization. 973-979 - Surasak Boonkla, Masashi Unoki

, Chai Wutiwiwatchai, Stanislav S. Makhanov
:
F0 estimation using empirical mode decomposition and complex cepstrum analysis in reverberant environments. 980-986 - Andre McDonald, Anton van Wyk:

Construction of semi-Markov ergodic maps with selectable spectral characteristics via the solution of the inverse eigenvalue problem. 987-993 - Yujia Lu, Kazunori Hayashi

:
A new pool control method for Boolean compressed sensing based adaptive group testing. 994-999 - Samad S. Kolahi, Bashar Barmada, Keysha Mudaliar:

Defence mechanisms evaluation against RA flood attacks for Linux-victim node. 1000-1005 - Tatsuya Kawahara

:
Automatic meeting transcription system for the Japanese parliament (diet). 1006-1010 - Zhenzhen Wang, Jingjing Meng, Tan Yu, Junsong Yuan

:
Common visual pattern discovery and search. 1011-1018 - Anthony Kuh, Muhammad Sharif Uddin, Phyllis Ng:

Online unsupervised kernel learning algorithms. 1019-1025 - Shiuan Huang, Hsueh-Ming Hang:

Multi-query image retrieval using CNN and SIFT features. 1026-1034 - Lilei Zheng, Ying Zhang, Vrizlynn L. L. Thing:

Understanding multi-layer perceptrons on spatial image steganalysis features. 1035-1039 - Lantian Li

, Dong Wang, Askar Rozi, Thomas Fang Zheng:
Cross-lingual speaker verification with deep feature learning. 1040-1044 - Shinya Takamaeda-Yamazaki, Kodai Ueyoshi, Kota Ando, Ryota Uematsu, Kazutoshi Hirose, Masayuki Ikebe, Tetsuya Asai, Masato Motomura

:
Accelerating deep learning by binarized hardware. 1045-1051 - Hao Xu, Yueru Chen, Ruiyuan Lin, C.-C. Jay Kuo

:
Understanding CNN via deep features analysis. 1052-1060 - Yuan-Fu Li, Chia-Chi Tsai

, Yi-Ting Lai, Jiun-In Guo:
A multiple-lane vehicle tracking method for forward collision warning system applications. 1061-1064 - Mehrdad Babazadeh, Sokratis Kartakis, Julie A. McCann:

Highly-distributed sensor processing using IoT for critical infrastructure monitoring. 1065-1074 - Kuan-Chung Wang, Yoga Dwi Pranata, Jia-Ching Wang:

Automatic vehicle classification using center strengthened convolutional neural network. 1075-1078 - Cong Lai, Wen Luo, Shiqiang Chen, Qinhua Li, Qingyu Yang, Hongbin Sun, Nanning Zheng:

Zynq-based full HD around view monitor system for intelligent vehicle. 1079-1082 - Huan-Rui Chang, Hsueh-Ming Hang:

Wide angle virtual view synthesis using two-by-two Kinect V2. 1083-1091 - Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla:

Memory-augmented Chinese-Uyghur neural machine translation. 1092-1096 - Elok Cahyaningtyas, Dhany Arifianto:

Development of under-resourced Bahasa Indonesia speech corpus. 1097-1101 - Anocha Rugchatjaroen, Sittipong Saychum, Keiichiro Oura, Keiichi Tokuda:

Generalization of Thai tone contour in HMM-based speech synthesis. 1102-1105 - Aijun Li, Gongping Wang:

The longitudinal development of focus duration of Korean Chinese learners. 1106-1114 - Hankiz Yilahun, Aynur Nurtay, Askar Hamdulla:

Patterns of vowels in Uyghur Tri-syllabic words. 1115-1122 - Ismail M. El-Badawy

, Ashraf M. Aziz
, Zaid Bin Omar
, M. B. Malarvili
:
Correlation between different DNA period-3 signals: An analytical study for exons prediction. 1123-1128 - Nam H. Le, Khang N. Nguyen, Hien M. Nguyen:

Comparison analysis of ICA versus MCA-KSVD blind source separation on task-related fMRI data. 1129-1135 - Yudai Suzuki, Keigo Kawaji

, Amit R. Patel, Satoshi Tamura, Satoru Hayamizu:
Toward effective noise reduction for sub-Nyquist high-frame-rate MRI techniques with deep learning. 1136-1139 - Satoshi Ito:

Compressed sensing reconstruction of MR phase-varied images using multi-scale complex sparsifying transform. 1140-1143 - Masara Yamashita, Tasuku Miura, Shoichi Matsunaga:

Distinction between healthy individuals and patients with confident abnormal respiration. 1144-1147 - Jie Chen

, Yun Ni, Junhui Hou, Lap-Pui Chau
:
Light field scene flow with occlusion regularization. 1148-1151 - Yu Zhou

, Sam Kwong
, Junhui Hou
:
Single image superresolution by multiple geometrical regressors. 1152-1155 - Chia-Chun Hsu, Jian-Jiun Ding, Yih-Cherng Lee:

Efficient edge-oriented based image interpolation algorithm for non-integer scaling factor. 1156-1159 - Wei-Ting Lu, Chien-Wei Lin, Chih-Hung Kuo, Ying-Chan Tung:

Image super-resolution based on error compensation with convolutional neural network. 1160-1163 - Qinhui Fan, Hongsheng Liu, Zhizhong Fu, Xiaofeng Li:

Exemplar-based image inpainting based on pixel inhomogeneity factor. 1164-1168 - Yusuke Sugawara, Sayaka Shiota, Hitoshi Kiya:

A parallel computation algorithm for super-resolution methods using convolutional neural networks. 1169-1173 - Guiqing He, Dandan Dong, Siyuan Xing, Ximei Zhao:

Infrared and visible image fusion based on innovation feature simultaneous decomposition. 1174-1177 - Takuro Yamaguchi, Masaaki Ikehara:

Joint bilateral based image denoising using multi-sized 2D hard threshold. 1178-1181 - ShuMin Liu, Jiaxuan Zhang, Jiajia Chen:

Multi-focus image fusion using Gaussian filter and dynamic programming. 1182-1185 - Hyewon Song, Doyoung Kim, Hyuck-Joo Kwon, Sanghoon Lee

:
Natural scene statistics based publication classification algorithm using convolutional neural network. 1186-1189 - LieLin Pang, KokSheik Wong

, Sze-Teng Liong
:
Data embedding in scalable coded video. 1190-1194 - Kaavya Sriskandaraja

, Gajan Suthokumar, Vidhyasaharan Sethu
, Eliathamby Ambikairajah
:
Investigating the use of scattering coefficients for replay attack detection. 1195-1198 - Masashi Unoki

, Yuta Kashihara, Maori Kobayashi, Masato Akagi:
Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room. 1199-1204 - KokSheik Wong

, Hitoshi Kiya:
Reversible data hiding for compression-friendly image encryption method. 1205-1209 - Kai Liu, Xuan Li, Qiong Zhang, Xiangui Kang:

Multi-channel neural network for steganalysis. 1210-1213 - Qingnan Wang, Wu Guo, Peixin Chen, Yan Song:

Tibetan-Mandarin bilingual speech recognition based on end-to-end framework. 1214-1217 - Huang Chen, Shiliang Zhang, Junfeng Hou, Lirong Dai:

Learning the number of nodes in DNNs with activation mask. 1218-1221 - Hiromitsu Nishizaki:

Data augmentation and feature extraction using variational autoencoder for acoustic modeling. 1222-1227 - Hitoshi Ito, Aiko Hagiwara, Manon Ichiki, Takeshi Mishima, Shoei Sato, Akio Kobayashi:

End-to-end speech recognition for languages with ideographic characters. 1228-1232 - Yuki Yasui, Nakamasa Inoue, Koji Iwano

, Koichi Shinoda
:
Multimodal speech recognition using mouth images from depth camera. 1233-1236 - Yen-Ting Lin, Chen-Yu Chiang:

Deep learning-based speaking rate-dependent hierarchical prosodie model for Mandarin TTS. 1237-1242 - Akira Sasou:

Automatic identification of pathological voice quality based on the GRBAS categorization. 1243-1247 - Kengo Ohta, Rikito Marumoto, Ryota Nishimura

, Norihide Kitaoka:
Selecting type of response for chat-like spoken dialogue systems based on acoustic features of user utterances. 1248-1252 - Katsuki Inoue, Sunao Hara, Masanobu Abe, Nobukatsu Hojo, Yusuke Ijima:

An investigation to transplant emotional expressions in DNN-based TTS synthesis. 1253-1258 - Gaku Kotani, Daisuke Saito, Nobuaki Minematsu:

Voice conversion based on deep neural networks for time-variant linear transformations. 1259-1262 - Hiroto Ashikawa, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa

:
Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations. 1263-1267 - Yanfeng Lu, Chenyu Yang, Minghui Dong:

Word level prosody prediction using large audiobook dataset. 1268-1273 - Patrick Lumban Tobing

, Hirokazu Kameoka, Tomoki Toda
:
Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling. 1274-1277 - Hyungjun Lim

, Younggwan Kim, Yoonhoe Kim, Hoirin Kim:
CNN-based bottleneck feature for noise robust query-by-example spoken term detection. 1278-1281 - Chun-Ting Huang, Yueru Chen, Ruiyuan Lin, C.-C. Jay Kuo

:
Age/gender classification with whole-component convolutional neural networks (WC-CNN). 1282-1285 - Jing Zhang, Yuchao Dai, Fatih Porikli

, Mingyi He
:
Multi-scale salient object detection with pyramid spatial pooling. 1286-1291 - Zeng Peng, Cheng Cai:

An effective segmentation algorithm of apple watercore disease region using fully convolutional neural networks. 1292-1299 - Pei Chee Yong, Kit Yan Chan, Sven Nordholm

:
Utilizing neural network and critical band processing for speech enhancement. 1300-1303 - Conggui Liu, Nakamasa Inoue, Koichi Shinoda

:
A unified network for multi-speaker speech recognition with multi-channel recordings. 1304-1307 - Shinnosuke Takamichi:

Modulation spectrum-based speech parameter trajectory smoothing for DNN-based speech synthesis using FFT spectra. 1308-1311 - Takeshi Hori, Kazuyuki Nakamura

, Shigeki Sagayama:
Music chord recognition from audio data using bidirectional encoder-decoder LSTMs. 1312-1315 - Keisuke Imoto

, Nobutaka Ono
, Masahiro Niitsuma, Yoichi Yamashita:
Online sound structure analysis based on generative model of acoustic feature sequences. 1316-1321 - Nancy F. Chen

, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen
, Xiong Xiao, Sunil Sivadas, Eng Siong Chng
, Bin Ma, Haizhou Li
:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. 1322-1327 - Sunao Hara, Asako Hatakeyama, Shota Kobayashi, Masanobu Abe:

Sound sensing using smartphones as a crowdsourcing approach. 1328-1333 - Akira Tamamori, Tomoki Hayashi, Tomoki Toda

, Kazuya Takeda:
An investigation of recurrent neural network for daily activity recognition using multi-modal signals. 1334-1340 - Tatsuya Komatsu, Masahiro Tani, Takahiro Toizumi, Chaitanya Narisetty

, Masanori Kato, Yumi Arai, Osamu Hoshuyama, Yuzo Senda, Reishi Kondo:
An acoustic monitoring system and its field trials. 1341-1346 - Tin Lay Nwe, Tran Huy Dat, Bin Ma:

Convolutional neural network with multi-task learning scheme for acoustic scene classification. 1347-1350 - Danqing Luo, Yuexian Zou, Dongyan Huang:

Speech emotion recognition via ensembling neural networks. 1351-1355 - Yuanchao Li, Carlos Toshinori Ishi, Nigel G. Ward, Koji Inoue, Shizuka Nakamura, Katsuya Takanashi, Tatsuya Kawahara

:
Emotion recognition by combining prosody and sentiment analysis for expressing reactive emotion by humanoid robot. 1356-1359 - Ming Li, Luting Wang, Zhicheng Xu, Danwei Cai:

Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization. 1360-1363 - Rafael E. Banchs:

On the construction of more human-like chatbots: Affect and emotion analysis of movie dialogue data. 1364-1367 - Wan Ding, Dong-Yan Huang, Zhuo Chen, Xinguo Yu, Weisi Lin:

Facial action recognition using very deep networks for highly imbalanced class distribution. 1368-1372 - Felix Albu

, Linh Thi Thuc Tran, Sven Nordholm
:
A combined variable step size strategy for two microphones acoustic feedback cancellation using proportionate algorithms. 1373-1377 - Feiran Yang, Jun Yang:

A fast affine projection algorithm based on a modified Toeplitz matrix. 1378-1381 - Ryo Takehara, Arata Kawamura, Youji Iiguni:

Impulsive noise suppression using interpolated zero phase signal. 1382-1389 - Hala As'ad, Martin Bouchard

, A. Homayoun Kamkar-Parsi:
Binaural beamforming with spatial cues preservation for hearing aids in real-life complex acoustic environments. 1390-1399 - Kiyoshi Nishikawa, Kan Okubo, Yuta Katori, Nobunao Takeuchi:

Application of mean-shift clustering for removing flux trapping noise from geomagnetic field signals measured using HTS-SQUID magnetometers. 1400-1405 - Mohammad Mogharen Askarin

, KokSheik Wong
, Raphael C.-W. Phan:
Reduced contact lifting of latent fingerprint. 1406-1410 - Kong-Yik Chee, Zhe Jin

, Wun-She Yap, Bok-Min Goi:
Two-dimensional winner-takes-all hashing in template protection based on fingerprint and voice feature level fusion. 1411-1419 - Tatsunori Itakura, Toshihisa Tanaka:

Epileptic focus localization based on bivariate empirical mode decomposition and entropy. 1426-1429 - Anand Kumar Mukhopadhyay, Indrajit Chakrabarti, Mrigank Sharad:

Real-time digitized neural-spike storage scheme in multiple channels for biomedical applications. 1430-1435 - Jae Woong Soh, Hyun-Seung Lee, Nam Ik Cho:

An image compression algorithm based on the Karhunen Loève transform. 1436-1439 - Ji-Sang Bae, Jong-Ok Kim:

A rail detection algorithm based on pair particles filtering. 1440-1443 - Eunpil Park, Jae-Young Sim:

Gradient-based contrast enhancement and color correction for underwater images. 1444-1447 - Ji-Eun Lee, Min-Joo Kang, Je-Won Kang:

Ensemble of binary tree structured deep convolutional network for image classification. 1448-1451 - Bee Lim, Kyoung Mu Lee:

Deep recurrent resnet for video super-resolution. 1452-1455 - Chih-Yuan Lo, Yu-Wei Hua, Wei-Chuan Yu, Yu-Min Chuang:

Functional verification and performance testing for OpenAirinterface (OAI) eNodeB. 1456-1459 - Toshiyuki Shizuoka, Osamu Takyu, Mai Ohta, Takeo Fujii:

Multiband hierarchical ad hoc network with wireless environment recognition. 1464-1469 - Wen-Ping Lai, Yong-Hsiang Wang:

On the performance impact of virtual link types to 5G networking. 1470-1474 - Po-Chiang Lin, Sheng-Lun Huang, Xin-Yuan Li:

Teaching and learning next generation mobile communication networks through open source openAirInterface testbeds. 1475-1478 - Hongshen Tang, Rongrong Ni, Yao Zhao, Xiaolong Li:

Detection of various image operations based on CNN. 1479-1485 - Minoru Kuribayashi

, Takahiro Ueda, Nobuo Funabiki:
Secure data management system with traceability against internal leakage. 1486-1494 - Ahmad Akmal Aminuddin Mohd Kamal

, Keiichi Iwamura, Hyunho Kang:
Searchable encryption of image based on secret sharing scheme. 1495-1503 - Hoang-Quoc Nguyen-Son, Ngoc-Dung T. Tieu, Huy H. Nguyen

, Junichi Yamagishi, Isao Echizen:
Identifying computer-generated text using statistical analysis. 1504-1511 - Weiwei Sun, Jiantao Zhou:

Image origin identification for online social networks (OSNs). 1512-1515 - Jongheui Hong, Wonjoon Song:

Delta-modulated cross-correlation method for delay estimation on source localization. 1516-1519 - Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda

, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of how to design control parameters for statistical voice timbre control. 1520-1523 - Decha Moungsri, Tomoki Koriyama

, Takao Kobayashi:
Enhanced F0 generation for GPR-based speech synthesis considering syllable-based prosodic features. 1524-1527 - Nirmesh J. Shah, Pramod B. Bachhav, Hemant A. Patil:

A novel filtering-based F0 estimation algorithm with an application to voice conversion. 1528-1531 - Ming-Hsiang Su, Chung-Hsien Wu

, Kun-Yi Huang, Qian-Bei Hong, Hsin-Min Wang
:
Personality trait perception from speech signals using multiresolution analysis and convolutional neural networks. 1532-1536 - Berrak Sisman

, Haizhou Li
, Kay Chen Tan
:
Transformation of prosody in voice conversion. 1537-1546 - Karthika Vijayan

, Minghui Dong, Haizhou Li
:
A dual alignment scheme for improved speech-to-singing voice conversion. 1547-1555 - Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda

:
Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives. 1556-1564 - Masato Obara, Munehiro Moriya, Ryota Konno, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh:

Acceleration for query-by-example using posteriorgram of deep neural network. 1565-1569 - Zhi Hao Lim, Xiaohai Tian, Wei Rao, Eng Siong Chng

:
An investigation of spectral feature partitioning for replay attacks detection. 1570-1573 - Hanwu Sun, Kong-Aik Lee, Trung Hieu Nguyen, Bin Ma, Haizhou Li

:
I2R-NUS submission to oriental language recognition AP16-OL7 challenge. 1574-1578 - Yue Chen, Yanlu Xie, Jinsong Zhang

:
A comparison study of information contributions of phonemic contrasts in Mandarin. 1579-1582 - Aodong Li, Shiyue Zhang, Dong Wang, Thomas Fang Zheng:

Enhanced neural machine translation by learning from draft. 1583-1587 - Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling. 1588-1591 - Ryoichi Takashima, Yohei Kawaguchi

, Qinghua Sun, Takashi Sumiyoshi, Masahito Togami:
An application of noise-robust speech translation using asynchronous smart devices. 1592-1595 - Zhiping Zeng, Haihua Xu, Tze Yuang Chong, Eng Siong Chng

, Haizhou Li
:
Improving N-gram language modeling for code-switching speech recognition. 1596-1601 - Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng

:
Topic embedding of sentences for story segmentation. 1602-1607 - Xing Wei, Jingping Chen, Wei Wang, Yanlu Xie, Jinsong Zhang

:
A study of automatic annotation of PETs with articulatory features. 1608-1612 - Shumin An, Zhenhua Ling, Lirong Dai:

Emotional statistical parametric speech synthesis using LSTM-RNNs. 1613-1616 - Shogo Hara, Hiromitsu Nishizaki:

Acoustic modeling with a shared phoneme set for multilingual speech recognition without code-switching. 1617-1620 - Narumi Mae, Yoshiki Mitsui, Shoji Makino, Daichi Kitamura, Nobutaka Ono

, Takeshi Yamada, Hiroshi Saruwatari:
Sound source localization using binaural difference for hose-shaped rescue robot. 1621-1627 - Xiaowei Jiang, Shuai Wang, Xu Xiang

, Yanmin Qian:
Integrating online i-vector into GMM-UBM for text-dependent speaker verification. 1628-1632 - Linh Thi Thuc Tran, Henning F. Schepker

, Simon Doclo, Hai Huyen Dam, Sven E. Nordholm
:
Adaptive feedback control using improved variable step-size affine projection algorithm for hearing aids. 1633-1640 - Effrosyni Paschou, Fabian Esqueda, Vesa Välimäki

, John Mourjopoulos:
Modeling and measuring a Moog voltage-controlled filter. 1641-1647 - Kun-Yi Huang, Chung-Hsien Wu

, Ming-Hsiang Su, Chia-Hui Chou:
Mood disorder identification using deep bottleneck features of elicited speech. 1648-1652 - Siying Liu, Karianto Leman

:
Handling small motions without differential approximation. 1653-1656 - Ramanpreet Singh Pahwa, Tian-Tsong Ng

, Minh N. Do
:
Tracking objects using 3D object proposals. 1657-1660 - Sameer Khan

, Suet-Peng Yong:
A deep learning architecture for classifying medical images of anatomy object. 1661-1668 - Chern Hong Lim, Kam Meng Goh

:
Fuzzy qualitative approach for micro-expression recognition. 1669-1674 - Tai-En Wu, Chia-Chi Tsai

, Jiun-In Guo:
LiDAR/camera sensor fusion technology for pedestrian detection. 1675-1678 - Jundai Sun, Mao-shen Jia, Changchun Bao:

Multiple source localization by using energy weighted single source zone detection. 1679-1683 - Shahab Pasha

, Jacob Donley, Christian H. Ritz
:
Blind speaker counting in highly reverberant environments by clustering coherence features. 1684-1687 - Yuexian Zou, Rongzhi Gu, Disong Wang, Aimin Jiang

, Christian H. Ritz
:
Learning a robust DOA estimation model with acoustic vector sensor cues. 1688-1691 - Zhong-Hua Fu, Lei Xie, Peng Li, Jiaen Liang:

Frequency-invariant differential microphone array design in the STFT domain. 1692-1695 - Shahab Pasha

, Christian H. Ritz
, Yue Xian Zou:
Spatial multi-channel linear prediction for dereverberation of ad-hoc microphones. 1696-1700 - Suguru Hirokawa, Shin Kurihara, Hisakazu Kikuchi:

Distributed video coding based on compressive sensing and intra-predictive coding. 1701-1706 - Savath Saypadith

, Watchara Ruangsang
, Supavadee Aramvith
:
Optimized human detection on the embedded computer vision system. 1707-1711 - Yanlong Gao

, Yan Feng:
Classification of spectral compressive hyperspectral images using morphological profiles. 1712-1718 - Yongfei Zhang, Rui Fan, Chao Zhang, Gang Wang, Zhe Li:

SIMD acceleration for HEVC encoding on DSP. 1719-1725 - Seishi Takamura, Atsushi Shimizu:

Efficient video coding using rigid object tracking. 1726-1729 - Soo Hyun Bae, In Kyu Choi, Hyung Yong Kim, Kang Hyun Lee, Nam Soo Kim:

Overlapping acoustic event classification based on joint training with source separation. 1730-1734 - In Kyu Choi, Soo Hyun Bae, Sung Jun Cheon, Won-Ik Cho, Nam Soo Kim:

Weakly labeled acoustic event detection using local detector and global classifier. 1735-1738 - Gen Takahashi, Takeshi Yamada, Nobutaka Ono

, Shoji Makino:
Performance evaluation of acoustic scene classification using DNN-GMM and frame-concatenated acoustic features. 1739-1743 - Nattapong Kurpukdee, Tomoki Koriyama

, Takao Kobayashi, Sawit Kasuriya, Chai Wutiwiwatchai, Poonlap Lamsrichan:
Speech emotion recognition using convolutional long short-term memory neural network and support vector machines. 1744-1749 - Zhichao Peng, Zhi Zhu, Masashi Unoki

, Jianwu Dang, Masato Akagi:
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank. 1750-1755 - Samer Jammal, Tammam Tillo, Jimin Xiao:

Multi-resolution for disparity estimation with convolutional neural networks. 1756-1761 - Han-Ul Kim, Chang-Su Kim

:
PGT: Proposal-guided object tracking. 1762-1767 - Vien Gia An, Chul Lee

:
Single-shot high dynamic range imaging via deep convolutional neural network. 1768-1772 - Eu-Tteum Baek, Yo-Sung Ho:

Stereo matching using relative total variation and entropy. 1773-1776 - Ying Gu, Mark D. Rice, Wei Xiong, Liyuan Li:

A new approach for image segmentation with shape priors based on the Potts model. 1777-1782 - Ryo Hayakawa

, Kazunori Hayashi
:
Binary vector reconstruction via discreteness-aware approximate message passing. 1783-1789 - Kotaro Kihara, Toshihiko Nishimura, Takeo Ohgane, Yasutaka Ogawa:

Signal detection with belief propagation in Faster-than-Nyquist signaling. 1790-1794 - Akihide David Shigyo, Koji Ishibashi:

QR-decomposed generalized belief propagation with smart message reduction for low-complexity MIMO signal detection. 1795-1799 - Takumi Takahashi, Shinsuke Ibi, Seiichi Sampei:

Design of adaptively scaled belief in large MIMO detection for higher-order modulation. 1800-1505 - Shunsuke Imai, Osamu Takyu, Fumihito Sasamori, Shiro Handa:

A study of monitoring system for radio leak with massive radio sensors. 1806-1810 - Kosuke Shimizu, Taizo Suzuki:

Cube-based encryption connected prior to motion JPEG standard. 1811-1814 - Kazuya Kawai, Junya Yamada, Hidekata Hontani, Tatsuya Yokota, Muneyuki Sakata, Yuichi Kimura:

A robust PET image reconstruction using constrained non-negative matrix factorization. 1815-1818 - Fairoza Amira Binti Hamzah, Taichi Yoshida, Masahiro Iwahashi:

Four-dimensional image compression with region of interest based on non-separable double lifting integer wavelet transform. 1819-1823 - Satoshi Nagayama, Shogo Muramatsu, Hiroyoshi Yamada, Yuuichi Sugiyama:

Millimeter wave radar image denoising with complex nonseparable oversampled lapped transform. 1824-1829 - Yusuke Nomura, Ryutaro Ogawa, Seisuke Kyochi, Taizo Suzuki:

Multiscale directional transforms based on cosine-sine modulated filter banks for sparse directional image representation. 1830-1834

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














