default search action

combined dblp search
author search
venue search
publication search

ask others

20th Interspeech 2019: Graz, Austria

> Home > Conferences and Workshops > Interspeech

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/2019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2019
Gernot Kubin, Zdravko Kacic:
20th Annual Conference of the International Speech Communication Association, Interspeech 2019, Graz, Austria, September 15-19, 2019. ISCA 2019

ISCA Medal 2019 Keynote Speech

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Tokuda19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tokuda19
Keiichi Tokuda:
Statistical Approach to Speech Synthesis: Past, Present and Future.

Spoken Language Processing for Children’s Speech

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuGPK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuGPK19
Fei Wu, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network. 1-5
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeungA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeungA19
Gary Yeung, Abeer Alwan:
A Frequency Normalization Technique for Kindergarten Speech Recognition Inspired by the Role of f_o in Vowel Perception. 6-10
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaleCDSA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaleCDSA19
Robert Gale, Liu Chen, Jill Dolata, Jan P. H. van Santen, Meysam Asgari:
Improving ASR Systems for Children with Autism and Language Impairment Using Domain-Focused DNN Transfer Techniques. 11-15
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RibeiroERR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RibeiroERR19
Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals:
Ultrasound Tongue Imaging for Diarization and Alignment of Child Speech Therapy Sessions. 16-20
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoukinaKLQGMMZW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoukinaKLQGMMZW19
Anastassia Loukina, Beata Beigman Klebanov, Patrick L. Lange, Yao Qian, Binod Gyawali, Nitin Madnani, Abhinav Misra, Klaus Zechner, Zuowei Wang, John Sabatini:
Automated Estimation of Oral Reading Fluency During Summer Camp e-Book Reading with MyTurnToRead. 21-25
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LopesMC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LopesMC19
Vanessa Lopes, João Magalhães, Sofia Cavaco:
Sustained Vowel Game: A Computer Therapy Game for Children with Dysphonia. 26-30

Dynamics of Emotional Speech Exchanges in Multimodal Communication

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EspositoACRETC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EspositoACRETC19
Anna Esposito, Terry Amorese, Marialucia Cuciniello, Maria Teresa Riviello, Antonietta Maria Esposito, Alda Troncone, Gennaro Cordasco:
The Dependability of Voice on Elders' Acceptance of Humanoid Agents. 31-35
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiebuhrS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiebuhrS19
Oliver Niebuhr, Uffe Schjoedt:
God as Interlocutor - Real or Imaginary? Prosodic Markers of Dialogue Speech and Expected Efficacy in Spoken Prayer. 36-40
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CohnZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CohnZ19
Michelle Cohn, Georgia Zellou:
Expressiveness Influences Human Vocal Alignment Toward voice-AI. 41-45
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiAMTHF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiAMTHF19
Catherine Lai, Beatrice Alex, Johanna D. Moore, Leimin Tian, Tatsuro Hori, Gianpiero Francesca:
Detecting Topic-Oriented Speaker Stance in Conversational Speech. 46-50
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SebastianP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SebastianP19
Jilt Sebastian, Piero Pierucci:
Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts. 51-55
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajwadiGWCC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajwadiGWCC19
Marvin Rajwadi, Cornelius Glackin, Julie A. Wall, Gérard Chollet, Nigel Cannings:
Explaining Sentiment Classification. 56-60
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KleinleinJMCF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KleinleinJMCF19
Ricardo Kleinlein, Cristina Luna Jiménez, Juan Manuel Montero, Zoraida Callejas, Fernando Fernández Martínez:
Predicting Group-Level Skin Attention to Short Movies from Audio-Based LSTM-Mixture of Experts Models. 61-65

End-to-End Speech Recognition

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Schluter19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Schluter19
Ralf Schlüter:
Survey Talk: Modeling in Automatic Speech Recognition: Beyond Hidden Markov Models.
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PhamNN0W19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PhamNN0W19
Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Alex Waibel:
Very Deep Self-Attention Networks for End-to-End Speech Recognition. 66-70
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLGLKCNG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLGLKCNG19
Jason Li, Vitaly Lavrukhin, Boris Ginsburg, Ryan Leary, Oleksii Kuchaiev, Jonathan M. Cohen, Huyen Nguyen, Ravi Teja Gadde:
Jasper: An End-to-End Convolutional Neural Acoustic Model. 71-75
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoritzHR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoritzHR19
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition. 76-80
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BelinkovAG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BelinkovAG19
Yonatan Belinkov, Ahmed Ali, James R. Glass:
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition. 81-85

Speech Enhancement: Multi-Channel

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TawaraKO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TawaraKO19
Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa:
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder. 86-90
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeschRG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeschRG19
Kristina Tesch, Robert Rehr, Timo Gerkmann:
On Nonlinear Spatial Filtering in Multichannel Speech Enhancement. 91-95
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Martin-DonasHHG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Martin-DonasHHG19
Juan M. Martín-Doñas, Jens Heitkaemper, Reinhold Haeb-Umbach, Angel M. Gomez, Antonio M. Peinado:
Multi-Channel Block-Online Source Extraction Based on Utterance Adaptation. 96-100
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BagheriG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BagheriG19
Saeed Bagheri, Daniele Giacobello:
Exploiting Multi-Channel Speech Presence Probability in Parametric Multi-Channel Wiener Filter. 101-105
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TogamiK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TogamiK19
Masahito Togami, Tatsuya Komatsu:
Variational Bayesian Multi-Channel Speech Dereverberation Under Noisy Environments with Probabilistic Convolutive Transfer Function. 106-110
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataniK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataniK19
Tomohiro Nakatani, Keisuke Kinoshita:
Simultaneous Denoising and Dereverberation for Low-Latency Applications Using Frame-by-Frame Online Unified Convolutional Beamformer. 111-115

Speech Production: Individual Differences and the Brain

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SnyderCZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SnyderCZ19
Cathryn Snyder, Michelle Cohn, Georgia Zellou:
Individual Variation in Cognitive Processing Style Predicts Differences in Phonetic Imitation of Device and Human Voices. 116-120
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IllaG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IllaG19
Aravind Illa, Prasanta Kumar Ghosh:
An Investigation on Speaker Specific Articulatory Synthesis with Speaker Independent Articulatory Inversion. 121-125
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangBHLW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangBHLW19
Xiaohan Zhang, Chongke Bi, Kiyoshi Honda, Wenhuan Lu, Jianguo Wei:
Individual Difference of Relative Tongue Size and its Acoustic Effects. 126-130
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshinagaNW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshinagaNW19
Tsukasa Yoshinaga, Kazunori Nozaki, Shigeo Wada:
Individual Differences of Airflow and Sound Generation in the Vocal Tract of Sibilant /s/. 131-135
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UttamKSASMS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UttamKSASMS19
Shashwat Uttam, Yaman Kumar, Dhruva Sahrawat, Mansi Aggarwal, Rajiv Ratn Shah, Debanjan Mahata, Amanda Stent:
Hush-Hush Speak: Speech Reconstruction Using Silent Videos. 136-140
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SahaAF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahaAF19
Pramit Saha, Muhammad Abdul-Mageed, Sidney S. Fels:
SPEAK YOUR MIND! Towards Imagined Speech Recognition with Hierarchical Deep Learning. 141-145

Speech Signal Characterization 1

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungHTG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungHTG19
Yu-An Chung, Wei-Ning Hsu, Hao Tang, James R. Glass:
An Unsupervised Autoregressive Model for Speech Representation Learning. 146-150
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangB19
Feng Huang, Péter Balázs:
Harmonic-Aligned Frame Mask Based on Non-Stationary Gabor Transform with Application to Content-Dependent Speaker Comparison. 151-155
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MRD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MRD19
Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das:
Glottal Closure Instants Detection from Speech Signal by Deep Features Extracted from Raw Speech and Linear Prediction Residual. 156-160
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PascualRSBB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PascualRSBB19
Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte, Yoshua Bengio:
Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks. 161-165
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NelloreDNG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NelloreDNG19
Bhanu Teja Nellore, Sri Harsha Dumpala, Karan Nathwani, Suryakanth V. Gangashetty:
Excitation Source and Vocal Tract System Based Acoustic Features for Detection of Nasals in Continuous Speech. 166-170
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChatziagapiPSPN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChatziagapiPSPN19
Aggelina Chatziagapi, Georgios Paraskevopoulos, Dimitris Sgouropoulos, Georgios Pantazopoulos, Malvina Nikandrou, Theodoros Giannakopoulos, Athanasios Katsamanis, Alexandros Potamianos, Shrikanth Narayanan:
Data Augmentation Using GANs for Speech Emotion Recognition. 171-175

Neural Waveform Generation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KonsSSRH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KonsSSRH19
Zvi Kons, Slava Shechtman, Alexander Sorin, Carmel Rabinovitz, Ron Hoory:
High Quality, Lightweight and Adaptable TTS Using LPCNet. 176-180
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lorenzo-TruebaD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lorenzo-TruebaD19
Jaime Lorenzo-Trueba, Thomas Drugman, Javier Latorre, Thomas Merritt, Bartosz Putrycz, Roberto Barra-Chicote, Alexis Moinet, Vatsal Aggarwal:
Towards Achieving Robust Universal Neural Vocoding. 181-185
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NeekharaDPDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NeekharaDPDM19
Paarth Neekhara, Chris Donahue, Miller S. Puckette, Shlomo Dubnov, Julian J. McAuley:
Expediting TTS Synthesis with Adversarial Vocoding. 186-190
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MustafaBBSM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MustafaBBSM19
Ahmed Mustafa, Arijit Biswas, Christian Bergler, Julia Schottenhamml, Andreas K. Maier:
Analysis by Adversarial Synthesis - A Novel Approach for Speech Vocoding. 191-195
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHTKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHTKT19
Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation. 196-200
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianC019
Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data. 201-205

Attention Mechanism for Speaker State Recognition

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/HanPM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanPM19
Kyu Jeong Han, Ramon Prieto, Tao Ma:
Survey Talk: When Attention Meets Speech Applications: Speech & Speaker Recognition Perspective.
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoB0CWS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoB0CWS19
Ziping Zhao, Zhongtian Bao, Zixing Zhang, Nicholas Cummins, Haishuai Wang, Björn W. Schuller:
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition. 206-210
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL19
Jeng-Lin Li, Chi-Chun Lee:
Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile. 211-215
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gallardo-Antolin19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gallardo-Antolin19
Ascensión Gallardo-Antolín, Juan Manuel Montero:
A Saliency-Based Attention LSTM Model for Cognitive Load Classification from Speech. 216-220
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Mallol-RagoltaZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Mallol-RagoltaZ19
Adria Mallol-Ragolta, Ziping Zhao, Lukas Stappen, Nicholas Cummins, Björn W. Schuller:
A Hierarchical Attention Network-Based Approach for Depression Detection from Transcribed Clinical Interviews. 221-225

ASR Neural Network Training — 1

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Carmantini0R19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Carmantini0R19
Andrea Carmantini, Peter Bell, Steve Renals:
Untranscribed Web Audio for Low Resource Speech Recognition. 226-230
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuscherBIKMZSN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuscherBIKMZSN19
Christoph Lüscher, Eugen Beck, Kazuki Irie, Markus Kitza, Wilfried Michel, Albert Zeyer, Ralf Schlüter, Hermann Ney:
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention. 231-235
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaHTFNW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaHTFNW19
Naoyuki Kanda, Shota Horiguchi, Ryoichi Takashima, Yusuke Fujita, Kenji Nagamatsu, Shinji Watanabe:
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition. 236-240
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengGLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengGLG19
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. 241-245
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCW019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCW019
Peidong Wang, Jia Cui, Chao Weng, Dong Yu:
Large Margin Training for Attention Based End-to-End Speech Recognition. 246-250
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MacCZP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MacCZP19
Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny:
Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition. 251-255

Zero-Resource ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MildeB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MildeB19
Benjamin Milde, Chris Biemann:
SparseSpeech: Unsupervised Acoustic Unit Discovery with Memory-Augmented Sequence Autoencoders. 256-260
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OndelVBC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OndelVBC19
Lucas Ondel, Hari Krishna Vydana, Lukás Burget, Jan Cernocký:
Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery. 261-265
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiguchiTKO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiguchiTKO19
Yosuke Higuchi, Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa:
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages. 266-270
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasadERM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasadERM19
Manasa Prasad, Daan van Esch, Sandy Ritchie, Jonas Fromseier Mortensen:
Building Large-Vocabulary ASR Systems for Languages Without Any Audio Training Data. 271-275
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AzuhHG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AzuhHG19
Emmanuel Azuh, David Harwath, James R. Glass:
Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio. 276-280
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FengL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FengL19
Siyuan Feng, Tan Lee:
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation. 281-285

Sociophonetics

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NissenBDD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NissenBDD19
Shawn L. Nissen, Sharalee Blunck, Anita Dromey, Christopher Dromey:
Listeners' Ability to Identify the Gender of Preadolescent Children in Different Linguistic Contexts. 286-290
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AhlersM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AhlersM19
Wiebke Ahlers, Philipp Meer:
Sibilant Variation in New Englishes: A Comparative Sociophonetic Study of Trinidadian and American English /s(tr)/-Retraction. 291-295
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GubianHSSW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GubianHSSW19
Michele Gubian, Jonathan Harrington, Mary Stevens, Florian Schiel, Paul Warren:
Tracking the New Zealand English NEAR/SQUARE Merger Using Functional Principal Components Analysis. 296-300
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GessingerMARS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GessingerMARS19
Iona Gessinger, Bernd Möbius, Bistra Andreeva, Eran Raveh, Ingmar Steiner:
Phonetic Accommodation in a Wizard-of-Oz Experiment: Intonation and Segments. 301-305
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiebuhrM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiebuhrM19
Oliver Niebuhr, Jan Michalsky:
PASCAL and DPA: A Pilot Study on Using Prosodic Competence Scores to Predict Communicative Skills for Team Working and Public Speaking. 306-310
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MichalskySS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MichalskySS19
Jan Michalsky, Heike Schoormann, Thomas Schultze:
Towards the Prosody of Persuasion in Competitive Negotiation. The Relationship Between f0 and Negotiation Success in Same Sex Sales Tasks. 311-315

Resources – Annotation – Evaluation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SagerSRV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SagerSRV19
Jacob Sager, Ravi Shankar, Jacob Reinhold, Archana Venkataraman:
VESUS: A Crowd-Annotated Database to Study Emotion Production and Perception in Spoken English. 316-320
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KohMKAANT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KohMKAANT19
Jia Xin Koh, Aqilah Mislan, Kevin Khoo, Brian Ang, Wilson Ang, Charmaine Ng, Ying-Ying Tan:
Building the Singapore English National Speech Corpus. 321-325
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PichenyTKACS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PichenyTKACS19
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. 326-330
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamtekeSHNAK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamtekeSHNAK19
Pravin Bhaskar Ramteke, Sujata Supanekar, Pradyoth Hegde, Hanna Nelson, Venkataraja Aithal, Shashidhar G. Koolagudi:
NITK Kids' Speech Corpus. 331-335
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AliKH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AliKH19
Ahmed Ali, Salam Khalifa, Nizar Habash:
Towards Variability Resistant Dialectal Speech Evaluation. 336-340
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FallgrenME19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FallgrenME19
Per Fallgren, Zofia Malisz, Jens Edlund:
How to Annotate 100 Hours in 45 Minutes. 341-345

Speaker Recognition and Diarization

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiezBWRC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiezBWRC19
Mireia Díez, Lukás Burget, Shuai Wang, Johan Rohdin, Jan Cernocký:
Bayesian HMM Based x-Vector Clustering for Speaker Diarization. 346-350
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VestmanLKK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VestmanLKK19
Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of i-Vectors Enabled by GPU Acceleration. 351-355
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShonDRG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShonDRG19
Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation. 356-360
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSMLJD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSMLJD19
Zhifu Gao, Yan Song, Ian McLoughlin, Pengcheng Li, Yiheng Jiang, Li-Rong Dai:
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System. 361-365
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinYLBB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinYLBB19
Qingjian Lin, Ruiqing Yin, Ming Li, Hervé Bredin, Claude Barras:
LSTM Based Similarity Measurement with Spectral Clustering for Speaker Diarization. 366-370
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungLH19
Joon Son Chung, Bong-Jin Lee, Icksang Han:
Who Said That?: Audio-Visual Speaker Diarisation of Real-World Meetings. 371-375
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieGPK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieGPK19
Jiamin Xie, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Multi-PLDA Diarization on Children's Speech. 376-380
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McCreeSG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McCreeSG19
Alan McCree, Gregory Sell, Daniel Garcia-Romero:
Speaker Diarization Using Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings. 381-385
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhahabiF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhahabiF19
Omid Ghahabi, Volker Fischer:
Speaker-Corrupted Embeddings for Online Speaker Diarization. 386-390
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkH0HZGN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkH0HZGN19
Tae Jin Park, Kyu Jeong Han, Jing Huang, Xiaodong He, Bowen Zhou, Panayiotis G. Georgiou, Shrikanth Narayanan:
Speaker Diarization with Lexical Information. 391-395
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShafeySS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShafeySS19
Laurent El Shafey, Hagen Soltau, Izhak Shafran:
Joint Speech Recognition and Speaker Diarization via Sequence Transduction. 396-400
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cumani19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cumani19
Sandro Cumani:
Normal Variance-Mean Mixtures for Unsupervised Score Calibration. 401-405
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoLOK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoLOK19
Hitoshi Yamamoto, Kong Aik Lee, Koji Okabe, Takafumi Koshinaka:
Speaker Augmentation and Bandwidth Extension for Deep Speaker Embedding. 406-410
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazDZHB0L19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazDZHB0L19
Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. 411-415
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DubeySH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DubeySH19
Harishchandra Dubey, Abhijeet Sangwan, John H. L. Hansen:
Toeplitz Inverse Covariance Based Robust Speaker Clustering for Naturalistic Audio Streams. 416-420

ASR for Noisy and Far-Field Speech

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KovacsTCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KovacsTCL19
György Kovács, László Tóth, Dirk Van Compernolle, Marcus Liwicki:
Examining the Combination of Multi-Band Processing and Channel Dropout for Robust Speech Recognition. 421-425
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoniP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoniP19
Meet H. Soni, Ashish Panda:
Label Driven Time-Frequency Masking for Robust Continuous Speech Recognition. 426-430
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCWZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCWZ019
Long Wu, Hangting Chen, Li Wang, Pengyuan Zhang, Yonghong Yan:
Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning. 431-435
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MingC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MingC19
Ji Ming, Danny Crookes:
Full-Sentence Correlation: A Method to Handle Unpredictable Noise for Robust Speech Recognition. 436-440
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoniJP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoniJP19
Meet H. Soni, Sonal Joshi, Ashish Panda:
Generative Noise Modeling and Channel Simulation for Robust Speech Recognition in Unseen Conditions. 441-445
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarR19
Shashi Kumar, Shakti P. Rath:
Far-Field Speech Enhancement Using Heteroscedastic Autoencoder for Improved Speech Recognition. 446-450
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelcroixWOKKON19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelcroixWOKKON19
Marc Delcroix, Shinji Watanabe, Tsubasa Ochiai, Keisuke Kinoshita, Shigeki Karita, Atsunori Ogawa, Tomohiro Nakatani:
End-to-End SpeakerBeam for Single Channel Target Speech Recognition. 451-455
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuJN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuJN19
I-Hung Hsu, Ayush Jaiswal, Premkumar Natarajan:
NIESR: Nuisance Invariant End-to-End Speech Recognition. 456-460
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiOTNN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiOTNN19
Takahito Suzuki, Jun Ogata, Takashi Tsunakawa, Masafumi Nishida, Masafumi Nishimura:
Knowledge Distillation for Throat Microphone Speech Recognition. 461-465
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuXZCYX019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuXZCYX019
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Improved Speaker-Dependent Separation for CHiME-5 Challenge. 466-470
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangTW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangTW19
Peidong Wang, Ke Tan, DeLiang Wang:
Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling. 471-475
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangW19
Peidong Wang, DeLiang Wang:
Enhanced Spectral Features for Distortion-Independent Acoustic Modeling. 476-480
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NeekharaHPDMK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NeekharaHPDMK19
Paarth Neekhara, Shehzeen Hussain, Prakhar Pandey, Shlomo Dubnov, Julian J. McAuley, Farinaz Koushanfar:
Universal Adversarial Perturbations for Speech Recognition Systems. 481-485
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimotoK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimotoK19
Masakiyo Fujimoto, Hisashi Kawai:
One-Pass Single-Channel Noisy Speech Recognition Using a Combination of Noisy and Enhanced Features. 486-490
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuNLLYCPL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuNLLYCPL19
Bin Liu, Shuai Nie, Shan Liang, Wenju Liu, Meng Yu, Lianwu Chen, Shouye Peng, Changliang Li:
Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition. 491-495

Social Signals Detection and Speaker Traits Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangHH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangHH19
Zixiaofan Yang, Bingyan Hu, Julia Hirschberg:
Predicting Humor by Learning from Time-Aligned Comments. 496-500
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Dinkov0KN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Dinkov0KN19
Yoan Dinkov, Ahmed Ali, Ivan Koychev, Preslav Nakov:
Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata Information. 501-505
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnL19
Guozhen An, Rivka Levitan:
Mitigating Gender and L1 Differences to Improve State and Trait Recognition. 506-509
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeningerSPWZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeningerSPWZ19
Felix Weninger, Yang Sun, Junho Park, Daniel Willett, Puming Zhan:
Deep Learning Based Mandarin Accent Identification for Accent Robust ASR. 510-514
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GosztolyaT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GosztolyaT19
Gábor Gosztolya, László Tóth:
Calibrating DNN Posterior Probability Estimates of HMM/DNN Models to Improve Social Signal Detection from Audio Data. 515-519
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriNA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriNA19
Hiroki Mori, Tomohiro Nagata, Yoshiko Arimoto:
Conversational and Social Laughter Synthesis with WaveNet. 520-523
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LudusanW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LudusanW19
Bogdan Ludusan, Petra Wagner:
Laughter Dynamics in Dyadic Conversations. 524-528
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TruongTJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TruongTJ19
Khiet P. Truong, Jürgen Trouvain, Michel-Pierre Jansen:
Towards an Annotation Scheme for Complex Laughter in Speech Corpora. 529-533
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BairdACSJMBRS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BairdACSJMBRS19
Alice Baird, Shahin Amiriparian, Nicholas Cummins, Sarah Sturmbauer, Johanna Janson, Eva-Maria Meßner, Harald Baumeister, Nicolas Rohleder, Björn W. Schuller:
Using Speech to Predict Sequentially Measured Cortisol Levels During a Trier Social Stress Test. 534-538
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BairdCHS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BairdCHS19
Alice Baird, Eduardo Coutinho, Julia Hirschberg, Björn W. Schuller:
Sincerity in Acted Speech: Presenting the Sincere Apology Corpus and Results. 539-543
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiebuhrF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiebuhrF19
Oliver Niebuhr, Kerstin Fischer:
Do not Hesitate! - Unless You Do it Shortly or Nasally: How the Phonetics of Filled Pauses Determine Their Subjective Frequency and Perceived Speaker Performance. 544-548
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Vasquez-CorreaK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Vasquez-CorreaK19
Juan Camilo Vásquez-Correa, Philipp Klumpp, Juan Rafael Orozco-Arroyave, Elmar Nöth:
Phonet: A Tool Based on Gated Recurrent Neural Networks to Extract Phonological Posteriors from Speech. 549-553

Applications of Language Technologies

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangCL19
Ching-Ting Chang, Shun-Po Chuang, Hung-yi Lee:
Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. 554-558
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeierMPS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeierMPS19
Moritz Meier, Celeste Mason, Felix Putze, Tanja Schultz:
Comparative Analysis of Think-Aloud Methods for Everyday Activities in the Context of Cognitive Robotics. 559-563
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeefermanBR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeefermanBR19
Doug Beeferman, William Brannon, Deb Roy:
RadioTalk: A Large-Scale Corpus of Talk Radio Transcripts. 564-568
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MdhaffarEHLDQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MdhaffarEHLDQ19
Salima Mdhaffar, Yannick Estève, Nicolas Hernandez, Antoine Laurent, Richard Dufour, Solen Quiniou:
Qualitative Evaluation of ASR Adaptation in a Lecture Context: Application to the PASTEL Corpus. 569-573
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarinelliCTSFR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarinelliCTSFR19
Federico Marinelli, Alessandra Cervone, Giuliano Tortoreto, Evgeny A. Stepanov, Giuseppe Di Fabbrizio, Giuseppe Riccardi:
Active Annotation: Bootstrapping Annotation Lexicon and Guidelines for Supervised NLU Learning. 574-578
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DabikeB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DabikeB19
Gerardo Roa Dabike, Jon Barker:
Automatic Lyric Transcription from Karaoke Vocal Tracks: Resources and a Baseline System. 579-583
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangH19
Qiang Huang, Thomas Hain:
Detecting Mismatch Between Speech and Transcription Using Cross-Modal Attention. 584-588
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VidalFB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VidalFB19
Jazmín Vidal, Luciana Ferrer, Leonardo Brambilla:
EpaDB: A Database for Development of Pronunciation Assessment Systems. 589-593
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AngerbauerAV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AngerbauerAV19
Katrin Angerbauer, Heike Adel, Ngoc Thang Vu:
Automatic Compression of Subtitles with Neural Networks and its Effect on User Experience. 594-598
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoMGKR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoMGKR19
Hongyin Luo, Mitra Mohtarami, James R. Glass, Karthik Krishnamurthy, Brigitte Richardson:
Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering. 599-603

Speech and Audio Characterization and Segmentation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gutz0YG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gutz0YG19
Sarah E. Gutz, Jun Wang, Yana Yunusova, Jordan R. Green:
Early Identification of Speech Changes Due to Amyotrophic Lateral Sclerosis Using Machine Classification. 604-608
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KR19
Mohamed Ismail Yasar Arafath K, Aurobinda Routray:
Automatic Detection of Breath Using Voice Activity Detection and SVM Classifier with Application on News Reports. 609-613
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeoJSY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeoJSY19
Hee-Soo Heo, Jee-weon Jung, Hye-jin Shim, Ha-Jin Yu:
Acoustic Scene Classification Using Teacher-Student Learning with Soft-Labels. 614-618
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0005J19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0005J19
Yanping Chen, Hongxia Jin:
Rare Sound Event Detection Using Deep Learning and Data Augmentation. 619-623
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sharma019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sharma019
Bidisha Sharma, Haizhou Li:
A Combination of Model-Based and Feature-Based Strategy for Speech-to-Singing Alignment. 624-628
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShremGK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShremGK19
Yosi Shrem, Matthew Goldrick, Joseph Keshet:
Dr.VOT: Measuring Positive and Negative Voice Onset Time in the Wild. 629-633
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuiWCS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuiWCS19
Jun Hui, Yue Wei, Shutao Chen, Richard Hau Yue So:
Effects of Base-Frequency and Spectral Envelope on Deep-Learning Speech Separation and Recognition Models. 634-638
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahP19
Nirmesh J. Shah, Hemant A. Patil:
Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion. 639-643
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShankarV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShankarV19
Ravi Shankar, Archana Venkataraman:
Weakly Supervised Syllable Segmentation by Vowel-Consonant Peak Classification. 644-648
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatejuCZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatejuCZ19
Lukás Mateju, Petr Cerva, Jindrich Zdánský:
An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs. 649-653
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangKHM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangKHM19
Zhenyu Tang, John D. Kanu, Kevin Hogan, Dinesh Manocha:
Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks. 654-658

Neural Techniques for Voice Conversion and Waveform Generation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PaulPS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PaulPS19
Dipjyoti Paul, Yannis Pantazis, Yannis Stylianou:
Non-Parallel Voice Conversion Using Weighted Generative Adversarial Networks. 659-663
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChouL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChouL19
Ju-Chieh Chou, Hung-yi Lee:
One-Shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization. 664-668
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuWDLK0M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuWDLK0M19
Hui Lu, Zhiyong Wu, Dongyang Dai, Runnan Li, Shiyin Kang, Jia Jia, Helen Meng:
One-Shot Voice Conversion with Global Speaker Embeddings. 669-673
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TobingWHKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TobingWHKT19
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder. 674-678
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTH19
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. 679-683
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuritaKTT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuritaKTT19
Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda:
Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds. 684-688
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoNWM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoNWM19
Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks. 689-693
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JuvelaBYA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JuvelaBYA19
Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram. 694-698
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoSK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoSK19
Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation. 699-703
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohammadiK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohammadiK19
Seyed Hamidreza Mohammadi, Taehwan Kim:
One-Shot Voice Conversion with Disentangled Representations by Leveraging Phonetic Posteriorgrams. 704-708
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangWLTHKT0W19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangWLTHKT0W19
Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion. 709-713
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCWSLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCWSLM19
Songxiang Liu, Yuewen Cao, Xixin Wu, Lifa Sun, Xunying Liu, Helen Meng:
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams. 714-718
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenL019
Li-Wei Chen, Hung-yi Lee, Yu Tsao:
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech. 719-723
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingG19
Shaojin Ding, Ricardo Gutierrez-Osuna:
Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion. 724-728
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StephensonKTE19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StephensonKTE19
Cory Stephenson, Gokce Keskin, Anil Thomas, Oguz H. Elibol:
Semi-Supervised Voice Conversion with Amortized Variational Inference. 729-733

Model Adaptation for ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeyMBD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeyMBD19
Subhadeep Dey, Petr Motlícek, Trung Bui, Franck Dernoncourt:
Exploiting Semi-Supervised Training Through a Dropout Regularization in End-to-End Speech Recognition. 734-738
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSGG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSGG19
Chanwoo Kim, Minkyu Shin, Abhinav Garg, Dhananjaya Gowda:
Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System. 739-743
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuWZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuWZ019
Han Zhu, Li Wang, Pengyuan Zhang, Yonghong Yan:
Multi-Accent Adaptation Based on Gate Mechanism. 744-748
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoS019
Pengcheng Guo, Sining Sun, Lei Xie:
Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition. 749-753
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitzaGSN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitzaGSN19
Markus Kitza, Pavel Golik, Ralf Schlüter, Hermann Ney:
Cumulative Adaptation for BLSTM Acoustic Models. 754-758
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieLLW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieLLW19
Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features. 759-763
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsunooKAK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsunooKAK19
Emiru Tsunoo, Yosuke Kashiwagi, Satoshi Asakawa, Toshiyuki Kumakura:
End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System. 764-768
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SariTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SariTH19
Leda Sari, Samuel Thomas, Mark A. Hasegawa-Johnson:
Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks. 769-773
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimZB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimZB19
Khe Chai Sim, Petr Zadrazil, Françoise Beaufays:
An Investigation into On-Device Personalization of End-to-End Automatic Speech Recognition Models. 774-778
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control: