default search action
20th SPECOM 2018: Leipzig, Germany
- Alexey Karpov, Oliver Jokisch, Rodmonga Potapova:
Speech and Computer - 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18-22, 2018, Proceedings. Lecture Notes in Computer Science 11096, Springer 2018, ISBN 978-3-319-99578-6 - Oleg Akhtiamov, Vasily Palkov:
Gaze, Prosody and Semantics: Relevance of Various Multimodal Signals to Addressee Detection in Human-Human-Computer Conversations. 1-10 - Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
A Continuous Vocoder Using Sinusoidal Model for Statistical Parametric Speech Synthesis. 11-20 - Sergei Astapov, Aleksandr Lavrentyev, Evgeniy Shuranov:
Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming. 21-31 - Vladimir Bataev, Maxim Korenevsky, Ivan Medennikov, Alexander Zatvornitskiy:
Exploring End-to-End Techniques for Low-Resource Speech Recognition. 32-41 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova, Olga Blinova, Gregory Y. Martynenko, Ekaterina Baeva:
Towards a Description of Pragmatic Markers in Russian Everyday Speech. 42-48 - Christopher G. Buchanan, Matthew P. Aylett, David A. Braude:
Adding Personality to Neutral Speech Synthesis Voices. 49-57 - Martin Bulín, Lubos Smídl, Jan Svec:
Towards Network Simplification for Low-Cost Devices by Removing Synapses. 58-67 - Lukás Bures, Petr Neduchal, Miroslav Hlavác, Marek Hrúz:
Generation of Synthetic Images of Full-Text Documents. 68-75 - Felix Burkhardt, Benjamin Weiss:
Speech Synthesizing Simultaneous Emotion-Related States. 76-85 - Marco Canora, Fernando García-Granada, Emilio Sanchis, Encarna Segarra:
An Approach to Automatic Summarization of Television Programs. 86-93 - George Christodoulides:
The Prosody of Discourse Makers alors and et in French: A Corpus-Based Study on Multiple Speaking Styles. 94-102 - Adam Chýlek, Lubos Smídl, Jakub Nedved:
Choosing a Dialogue System's Modality in Order to Minimize User's Workload. 103-112 - Erik Edwards, Michael Brenndoerfer, Amanda Robinson, Najmeh Sadoughi, Greg P. Finley, Maxim Korenevsky, Nico Axtmann, Mark Miller, David Suendermann-Oeft:
A Free Synthetic Corpus for Speaker Diarization Research. 113-122 - Erik Edwards, Amanda Robinson, Najmeh Sadoughi, Greg P. Finley, Maxim Korenevsky, Michael Brenndoerfer, Nico Axtmann, Mark Miller, David Suendermann-Oeft:
Speaker Diarization: A Top-Down Approach Using Syllabic Phonology. 123-133 - Olga Egorow, Ingo Siegert, Andreas Wendemuth:
Improving Emotion Recognition Performance by Random-Forest-Based Feature Selection. 134-144 - Polina Eismont, Vladislav Metelyagin, Elena I. Riekhakaynen:
Coherence Understanding Through Cohesion Markers: The Case of Child Spoken Language. 145-154 - Dmitrii Fedotov, Heysem Kaya, Alexey Karpov:
Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup. 155-165 - Carlos Ferreira, Bruno Direito, Alexandre Sayal, Marco Simões, Inês Cadório, Paula Martins, Marisa Lousada, Daniela Figueiredo, Miguel Castelo-Branco, António J. S. Teixeira:
Functional Mapping of Inner Speech Areas: A Preliminary Study with Portuguese Speakers. 166-176 - Greg P. Finley, Erik Edwards, Wael Salloum, Amanda Robinson, Najmeh Sadoughi, Nico Axtmann, Maxim Korenevsky, Michael Brenndoerfer, Mark Miller, David Suendermann-Oeft:
Semi-Supervised Acoustic Model Retraining for Medical ASR. 177-187 - Jing Han, Maximilian Schmitt, Björn W. Schuller:
You Sound Like Your Counterpart: Interpersonal Speech Analysis. 188-197 - François Hernandez, Vincent Nguyen, Sahar Ghannay, Natalia A. Tomashenko, Yannick Estève:
TED-LIUM 3: Twice as Much Data and Corpus Repartition for Experiments on Speaker Adaptation. 198-208 - Miroslav Hlavác, Ivan Gruber, Milos Zelezný, Alexey Karpov:
LipsID Using 3D Convolutional Neural Networks. 209-214 - Rüdiger Hoffmann, Peter Birkholz, Falk Gabriel, Rainer Jäckel:
From Kratzenstein to the Soviet Vocoder: Some Results of a Historic Research Project in Speech Technology. 215-225 - Marek Hrúz, Miroslav Hlavác:
LSTM Neural Network for Speaker Change Detection in Telephone Conversations. 226-233 - Takuto Isoyama, Masashi Unoki:
Noise Suppression Method Based on Modulation Spectrum Analysis. 234-244 - Denis Ivanko, Dmitry Ryumin, Alexandr Axyonov, Milos Zelezný:
Designing Advanced Geometric Features for Automatic Russian Visual Speech Recognition. 245-254 - Markéta Juzová:
On the Comparison of Different Phrase Boundary Detection Approaches Trained on Czech TTS Speech Corpora. 255-263 - Tatiana Kachkovskaia, Mayya Nurislamova:
Word-Initial Consonant Lengthening in Stressed and Unstressed Syllables in Russian. 264-273 - Arman Kaliyev, Sergey V. Rybin, Yuri N. Matveev:
Phoneme Duration Prediction for Kazakh Language. 274-280 - Stamatis Karlos, Konstantinos Kaleris, Nikos Fazakis, Vasileios G. Kanas, Sotiris Kotsiantis:
Optimized Active Learning Strategy for Audiovisual Speaker Recognition. 281-290 - Irina S. Kipyatkova:
Improving Russian LVCSR Using Deep Neural Networks for Acoustic and Language Modeling. 291-300 - Daniil Kocharov, Vera Evdokimova, Karina Evgrafova, Mariia Morskovatykh:
Labialization of Unstressed Vowels in Russian: Phonetic and Perceptual Evidence. 301-310 - Liubov Kovriguina, Ivan Shilin, Alina Putintseva, Alexander Shipilo:
Multilevel Annotation in the Corpus for Parsing Russian Spontaneous Speech. 311-320 - Anat Lerner, Oren Miara, Sarit Malayev, Vered Silber-Varod:
The Influence of the Interlocutor's Gender on the Speaker's Role Identification. 321-330 - Tatiana Litvinova, Pavel Seredin, Olga Litvinova, Tatiana Dankova, Olga Zagorovskaya:
On the Stability of Some Idiolectal Features. 331-336 - Boris Lobanov, Vladimir Zhitko, Vadim Zahariev:
A Prototype of the Software System for Study, Training and Analysis of Speech Intonation. 337-346 - Elena E. Lyakso, Olga V. Frolova:
Speech Interaction in "Mother-Child" Dyads with 4-7 Years Old Typically Developing Children and Children with Autism Spectrum Disorders. 347-356 - Elena E. Lyakso, Olga V. Frolova, Aleksey Grigorev, Viktor Gorodnyi, Aleksandr Nikolaev, Yuri N. Matveev:
Speech Features of Adults with Autism Spectrum Disorders and Mental Retardation. 357-366 - Thomas Manzini, Alan W. Black:
Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise. 367-376 - Nikita Markovnikov, Irina S. Kipyatkova, Elena E. Lyakso:
End-to-End Speech Recognition in Russian. 377-386 - Martin Matura, Markéta Juzová:
Correction of Formal Prosodic Structures in Czech Corpora Using Legendre Polynomials. 387-397 - Martin Matura, Markéta Juzová, Jindrich Matousek:
On the Contribution of Articulatory Features to Speech Synthesis. 398-407 - Martin Meszaros, Franziska Trojahn, Michael Maruschke, Oliver Jokisch:
QuARTCS: A Tool Enabling End-to-Any Speech Quality Assessment of WebRTC-Based Calls. 408-418 - Petr Mizera, Petr Pollák:
Automatic Phonetic Segmentation and Pronunciation Detection with Various Approaches of Acoustic Modeling. 419-429 - Eduardo Mizraji, Andrés Pomi, Juan Lin:
Improving Neural Models of Language with Input-Output Tensor Contexts. 430-440 - Anfisa Naumova:
Sociolinguistic Variability of Predicate Groups in Colloquial Russian Speech. 441-450 - Thai Son Nguyen, Matthias Sperber, Sebastian Stüker, Alex Waibel:
Building Real-Time Speech Recognition Without CMVN. 451-460 - Dariya Novokhrestova, Evgeny Kostyuchenko, Roman V. Meshcheryakov:
Choice of Signal Short-Term Energy Parameter for Assessing Speech Intelligibility in the Process of Speech Rehabilitation. 461-469 - Jaromír Novotný, Pavel Ircing:
The Benefit of Document Embedding in Unsupervised Document Classification. 470-478 - Siham Ouamour, Halim Sayoud:
A Comparative Survey of Authorship Attribution on Short Arabic Texts. 479-489 - Vedhas Pandit, Maximilian Schmitt, Nicholas Cummins, Franz Graf, Lucas Paletta, Björn W. Schuller:
How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers. 490-500 - Olga Perepelkina, Evdokia Kazimirova, Maria Konstantinova:
RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing. 501-510 - Gábor Pintér, Mira Schielke, Rico Petrick:
Investigating Word Segmentation Techniques for German Using Finite-State Transducers. 511-521 - Branislav M. Popovic, Edvin Pakoci, Darko Pekar:
A Comparison of Language Model Training Techniques in a Continuous Speech Recognition System for Serbian. 522-531 - Rodmonga Potapova, Liliya Komalova, Vsevolod Potapov:
Perceptual-Auditory Evaluation of the Aggressive Speech Behavior: Gender Aspect (on the Basis of Russian and Spanish Languages). 532-541 - Rodmonga Potapova, Vsevolod Potapov:
Main Determinants of the Acmeologic Personality Profiling. 542-551 - Eran Raveh, Ingmar Steiner, Iona Gessinger, Bernd Möbius:
Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System. 552-562 - Najmeh Sadoughi, Greg P. Finley, Erik Edwards, Amanda Robinson, Maxim Korenevsky, Michael Brenndoerfer, Nico Axtmann, Mark Miller, David Suendermann-Oeft:
Detecting Section Boundaries in Medical Dictations: Toward Real-Time Conversion of Medical Dictations to Clinical Reports. 563-573 - Michelina Savino, Loredana Lapertosa, Mario Refice:
Seeing or Not Seeing Your Conversational Partner: The Influence of Interaction Modality on Prosodic Entrainment. 574-584 - Tina Schuh, Stephan Dreiseitl:
Evaluating Novel Features for Aggressive Language Detection. 585-595 - Tatiana Y. Sherstinova:
Quantitative Data on POS Distribution in the Beginnings and the Ends of Utterances in Everyday Russian Speech. 596-605 - Tatiana Shevchenko, Tatiana Sokoreva:
Corpus Data on Adult Life-Long Trajectory of Prosody Development in American English, with Special Reference to Middle Age. 606-614 - Nikolay Shilov, Alexey M. Kashevnik, Sergey Mikhailov:
Context-Aware Generation of Personalized Audio Tours: Approach and Evaluation. 615-624 - Ingo Siegert, Alicia Flores Lotz, Olga Egorow, Susann Wolff:
Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition. 625-635 - Vered Silber-Varod, Anat Lerner, Oliver Jokisch:
Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers' Role. 636-645 - Lubos Smídl, Jan Svec, Ales Prazák, Jan Trmal:
Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition. 646-655 - Anton Stepikhov, Anastassia Loukina:
Personality, Working Memory Capacity and Expert Manual Annotation of German Spontaneous Speech. 656-666 - Mikhail Stolbov, Marina Tatarnikova, Quan Trong The:
Using Dual-Element Microphone Arrays for Automatic Keyword Recognition. 667-675 - Daniel Tihelka, Zdenek Hanzlícek, Markéta Juzová, Jindrich Matousek:
First Steps Towards Hybrid Speech Synthesis in Czech TTS System ARTIC. 676-686 - Maxim Tkachenko, Alexander Yamshinin, Mikhail Kotov, Marina Nastasenko:
Lightweight Embeddings for Speaker Verification. 687-696 - László Tóth, György Kovács, Dirk Van Compernolle:
A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models. 697-706 - Constanze Tschöpe, Frank Duckhorn, Markus Huber, Werner Meyer, Matthias Wolff:
A Cognitive User Interface for a Multi-modal Human-Machine Interaction. 707-717 - Amir Vaheb, Ali Janalizadeh Choobbasti, S. H. E. Mortazavi Najafabadi, Saeid Safavi:
Investigating Language Variability on the Performance of Speaker Verification Systems. 718-727 - Jan Vanek, Josef Michálek, Josef Psutka:
Recurrent DNNs and Its Ensembles on the TIMIT Phone Recognition Task. 728-736 - Alena Velichko, Viktor Budkov, Ildar Kagirov, Alexey A. Karpov:
Comparative Analysis of Classification Methods for Automatic Deception Detection in Speech. 737-746 - Jochen Weiner, Tanja Schultz:
Selecting Features for Automatic Screening for Dementia Based on Speech. 747-756 - Matthias Wolff, Günther Wirsching, Markus Huber, Peter beim Graben, Ronald Römer, Ingo Schmitt:
A Fock Space Toolbox and Some Applications in Computational Cognition. 757-767 - Olga Yakovenko, Ivan Bondarenko, Mariya Borovikova, Daniil Vodolazsky:
Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. 768-777 - Zbynek Zajíc, Lucie Zajícová, Josef V. Psutka, Petr Salajka, Jaromír Novotný, Ales Prazák, Ludek Müller:
First Insight into the Processing of the Language Consulting Center Data. 778-787
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.