default search action
20th SPECOM 2018: Leipzig, Germany
- Alexey Karpov, Oliver Jokisch, Rodmonga Potapova:
Speech and Computer - 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18-22, 2018, Proceedings. Lecture Notes in Computer Science 11096, Springer 2018, ISBN 978-3-319-99578-6 - Oleg Akhtiamov, Vasily Palkov:
Gaze, Prosody and Semantics: Relevance of Various Multimodal Signals to Addressee Detection in Human-Human-Computer Conversations. 1-10 - Mohammed Salah Al-Radhi
, Tamás Gábor Csapó
, Géza Németh
:
A Continuous Vocoder Using Sinusoidal Model for Statistical Parametric Speech Synthesis. 11-20 - Sergei Astapov, Aleksandr Lavrentyev, Evgeniy Shuranov:
Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming. 21-31 - Vladimir Bataev
, Maxim Korenevsky, Ivan Medennikov, Alexander Zatvornitskiy
:
Exploring End-to-End Techniques for Low-Resource Speech Recognition. 32-41 - Natalia Bogdanova-Beglarian
, Tatiana Y. Sherstinova
, Olga Blinova
, Gregory Y. Martynenko
, Ekaterina Baeva
:
Towards a Description of Pragmatic Markers in Russian Everyday Speech. 42-48 - Christopher G. Buchanan, Matthew P. Aylett, David A. Braude:
Adding Personality to Neutral Speech Synthesis Voices. 49-57 - Martin Bulín
, Lubos Smídl
, Jan Svec
:
Towards Network Simplification for Low-Cost Devices by Removing Synapses. 58-67 - Lukás Bures, Petr Neduchal
, Miroslav Hlavác
, Marek Hrúz
:
Generation of Synthetic Images of Full-Text Documents. 68-75 - Felix Burkhardt, Benjamin Weiss:
Speech Synthesizing Simultaneous Emotion-Related States. 76-85 - Marco Canora, Fernando García-Granada
, Emilio Sanchis, Encarna Segarra
:
An Approach to Automatic Summarization of Television Programs. 86-93 - George Christodoulides
:
The Prosody of Discourse Makers alors and et in French: A Corpus-Based Study on Multiple Speaking Styles. 94-102 - Adam Chýlek
, Lubos Smídl
, Jakub Nedved:
Choosing a Dialogue System's Modality in Order to Minimize User's Workload. 103-112 - Erik Edwards, Michael Brenndoerfer, Amanda Robinson, Najmeh Sadoughi, Greg P. Finley, Maxim Korenevsky, Nico Axtmann, Mark Miller, David Suendermann-Oeft:
A Free Synthetic Corpus for Speaker Diarization Research. 113-122 - Erik Edwards, Amanda Robinson, Najmeh Sadoughi, Greg P. Finley, Maxim Korenevsky, Michael Brenndoerfer, Nico Axtmann, Mark Miller, David Suendermann-Oeft:
Speaker Diarization: A Top-Down Approach Using Syllabic Phonology. 123-133 - Olga Egorow, Ingo Siegert, Andreas Wendemuth:
Improving Emotion Recognition Performance by Random-Forest-Based Feature Selection. 134-144 - Polina Eismont
, Vladislav Metelyagin
, Elena I. Riekhakaynen
:
Coherence Understanding Through Cohesion Markers: The Case of Child Spoken Language. 145-154 - Dmitrii Fedotov, Heysem Kaya
, Alexey Karpov
:
Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup. 155-165 - Carlos Ferreira
, Bruno Direito
, Alexandre Sayal
, Marco Simões
, Inês Cadório
, Paula Martins
, Marisa Lousada
, Daniela Figueiredo
, Miguel Castelo-Branco
, António J. S. Teixeira
:
Functional Mapping of Inner Speech Areas: A Preliminary Study with Portuguese Speakers. 166-176 - Greg P. Finley, Erik Edwards, Wael Salloum, Amanda Robinson, Najmeh Sadoughi, Nico Axtmann, Maxim Korenevsky, Michael Brenndoerfer, Mark Miller, David Suendermann-Oeft:
Semi-Supervised Acoustic Model Retraining for Medical ASR. 177-187 - Jing Han, Maximilian Schmitt, Björn W. Schuller
:
You Sound Like Your Counterpart: Interpersonal Speech Analysis. 188-197 - François Hernandez, Vincent Nguyen, Sahar Ghannay
, Natalia A. Tomashenko
, Yannick Estève:
TED-LIUM 3: Twice as Much Data and Corpus Repartition for Experiments on Speaker Adaptation. 198-208 - Miroslav Hlavác
, Ivan Gruber
, Milos Zelezný, Alexey Karpov
:
LipsID Using 3D Convolutional Neural Networks. 209-214 - Rüdiger Hoffmann, Peter Birkholz
, Falk Gabriel, Rainer Jäckel:
From Kratzenstein to the Soviet Vocoder: Some Results of a Historic Research Project in Speech Technology. 215-225 - Marek Hrúz
, Miroslav Hlavác
:
LSTM Neural Network for Speaker Change Detection in Telephone Conversations. 226-233 - Takuto Isoyama, Masashi Unoki
:
Noise Suppression Method Based on Modulation Spectrum Analysis. 234-244 - Denis Ivanko
, Dmitry Ryumin
, Alexandr Axyonov
, Milos Zelezný:
Designing Advanced Geometric Features for Automatic Russian Visual Speech Recognition. 245-254 - Markéta Juzová:
On the Comparison of Different Phrase Boundary Detection Approaches Trained on Czech TTS Speech Corpora. 255-263 - Tatiana Kachkovskaia, Mayya Nurislamova:
Word-Initial Consonant Lengthening in Stressed and Unstressed Syllables in Russian. 264-273 - Arman Kaliyev
, Sergey V. Rybin
, Yuri N. Matveev
:
Phoneme Duration Prediction for Kazakh Language. 274-280 - Stamatis Karlos
, Konstantinos Kaleris, Nikos Fazakis
, Vasileios G. Kanas, Sotiris Kotsiantis
:
Optimized Active Learning Strategy for Audiovisual Speaker Recognition. 281-290 - Irina S. Kipyatkova:
Improving Russian LVCSR Using Deep Neural Networks for Acoustic and Language Modeling. 291-300 - Daniil Kocharov
, Vera Evdokimova, Karina Evgrafova
, Mariia Morskovatykh:
Labialization of Unstressed Vowels in Russian: Phonetic and Perceptual Evidence. 301-310 - Liubov Kovriguina
, Ivan Shilin
, Alina Putintseva
, Alexander Shipilo
:
Multilevel Annotation in the Corpus for Parsing Russian Spontaneous Speech. 311-320 - Anat Lerner
, Oren Miara, Sarit Malayev, Vered Silber-Varod
:
The Influence of the Interlocutor's Gender on the Speaker's Role Identification. 321-330 - Tatiana Litvinova
, Pavel Seredin
, Olga Litvinova
, Tatiana Dankova, Olga Zagorovskaya
:
On the Stability of Some Idiolectal Features. 331-336 - Boris Lobanov, Vladimir Zhitko, Vadim Zahariev:
A Prototype of the Software System for Study, Training and Analysis of Speech Intonation. 337-346 - Elena E. Lyakso, Olga V. Frolova:
Speech Interaction in "Mother-Child" Dyads with 4-7 Years Old Typically Developing Children and Children with Autism Spectrum Disorders. 347-356 - Elena E. Lyakso, Olga V. Frolova, Aleksey Grigorev
, Viktor Gorodnyi
, Aleksandr Nikolaev
, Yuri N. Matveev
:
Speech Features of Adults with Autism Spectrum Disorders and Mental Retardation. 357-366 - Thomas Manzini, Alan W. Black:
Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise. 367-376 - Nikita Markovnikov, Irina S. Kipyatkova, Elena E. Lyakso:
End-to-End Speech Recognition in Russian. 377-386 - Martin Matura, Markéta Juzová:
Correction of Formal Prosodic Structures in Czech Corpora Using Legendre Polynomials. 387-397 - Martin Matura, Markéta Juzová, Jindrich Matousek
:
On the Contribution of Articulatory Features to Speech Synthesis. 398-407 - Martin Meszaros, Franziska Trojahn, Michael Maruschke, Oliver Jokisch
:
QuARTCS: A Tool Enabling End-to-Any Speech Quality Assessment of WebRTC-Based Calls. 408-418 - Petr Mizera, Petr Pollák:
Automatic Phonetic Segmentation and Pronunciation Detection with Various Approaches of Acoustic Modeling. 419-429 - Eduardo Mizraji, Andrés Pomi, Juan Lin:
Improving Neural Models of Language with Input-Output Tensor Contexts. 430-440 - Anfisa Naumova
:
Sociolinguistic Variability of Predicate Groups in Colloquial Russian Speech. 441-450 - Thai Son Nguyen, Matthias Sperber, Sebastian Stüker, Alex Waibel:
Building Real-Time Speech Recognition Without CMVN. 451-460 - Dariya Novokhrestova
, Evgeny Kostyuchenko
, Roman V. Meshcheryakov
:
Choice of Signal Short-Term Energy Parameter for Assessing Speech Intelligibility in the Process of Speech Rehabilitation. 461-469 - Jaromír Novotný, Pavel Ircing:
The Benefit of Document Embedding in Unsupervised Document Classification. 470-478 - Siham Ouamour
, Halim Sayoud
:
A Comparative Survey of Authorship Attribution on Short Arabic Texts. 479-489 - Vedhas Pandit
, Maximilian Schmitt, Nicholas Cummins
, Franz Graf, Lucas Paletta
, Björn W. Schuller
:
How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers. 490-500 - Olga Perepelkina
, Evdokia Kazimirova, Maria Konstantinova:
RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing. 501-510 - Gábor Pintér, Mira Schielke, Rico Petrick:
Investigating Word Segmentation Techniques for German Using Finite-State Transducers. 511-521 - Branislav M. Popovic, Edvin Pakoci, Darko Pekar:
A Comparison of Language Model Training Techniques in a Continuous Speech Recognition System for Serbian. 522-531 - Rodmonga Potapova
, Liliya Komalova
, Vsevolod Potapov
:
Perceptual-Auditory Evaluation of the Aggressive Speech Behavior: Gender Aspect (on the Basis of Russian and Spanish Languages). 532-541 - Rodmonga Potapova
, Vsevolod Potapov
:
Main Determinants of the Acmeologic Personality Profiling. 542-551 - Eran Raveh
, Ingmar Steiner
, Iona Gessinger
, Bernd Möbius
:
Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System. 552-562 - Najmeh Sadoughi, Greg P. Finley, Erik Edwards, Amanda Robinson, Maxim Korenevsky, Michael Brenndoerfer, Nico Axtmann, Mark Miller, David Suendermann-Oeft:
Detecting Section Boundaries in Medical Dictations: Toward Real-Time Conversion of Medical Dictations to Clinical Reports. 563-573 - Michelina Savino, Loredana Lapertosa, Mario Refice:
Seeing or Not Seeing Your Conversational Partner: The Influence of Interaction Modality on Prosodic Entrainment. 574-584 - Tina Schuh, Stephan Dreiseitl:
Evaluating Novel Features for Aggressive Language Detection. 585-595 - Tatiana Y. Sherstinova
:
Quantitative Data on POS Distribution in the Beginnings and the Ends of Utterances in Everyday Russian Speech. 596-605 - Tatiana Shevchenko
, Tatiana Sokoreva
:
Corpus Data on Adult Life-Long Trajectory of Prosody Development in American English, with Special Reference to Middle Age. 606-614 - Nikolay Shilov, Alexey M. Kashevnik, Sergey Mikhailov:
Context-Aware Generation of Personalized Audio Tours: Approach and Evaluation. 615-624 - Ingo Siegert, Alicia Flores Lotz, Olga Egorow, Susann Wolff:
Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition. 625-635 - Vered Silber-Varod
, Anat Lerner
, Oliver Jokisch
:
Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers' Role. 636-645 - Lubos Smídl
, Jan Svec
, Ales Prazák, Jan Trmal:
Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition. 646-655 - Anton Stepikhov, Anastassia Loukina:
Personality, Working Memory Capacity and Expert Manual Annotation of German Spontaneous Speech. 656-666 - Mikhail Stolbov, Marina Tatarnikova, Quan Trong The:
Using Dual-Element Microphone Arrays for Automatic Keyword Recognition. 667-675 - Daniel Tihelka
, Zdenek Hanzlícek
, Markéta Juzová, Jindrich Matousek
:
First Steps Towards Hybrid Speech Synthesis in Czech TTS System ARTIC. 676-686 - Maxim Tkachenko, Alexander Yamshinin, Mikhail Kotov, Marina Nastasenko:
Lightweight Embeddings for Speaker Verification. 687-696 - László Tóth, György Kovács, Dirk Van Compernolle:
A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models. 697-706 - Constanze Tschöpe
, Frank Duckhorn
, Markus Huber, Werner Meyer, Matthias Wolff
:
A Cognitive User Interface for a Multi-modal Human-Machine Interaction. 707-717 - Amir Vaheb, Ali Janalizadeh Choobbasti, S. H. E. Mortazavi Najafabadi, Saeid Safavi:
Investigating Language Variability on the Performance of Speaker Verification Systems. 718-727 - Jan Vanek
, Josef Michálek
, Josef Psutka
:
Recurrent DNNs and Its Ensembles on the TIMIT Phone Recognition Task. 728-736 - Alena Velichko, Viktor Budkov, Ildar Kagirov
, Alexey A. Karpov
:
Comparative Analysis of Classification Methods for Automatic Deception Detection in Speech. 737-746 - Jochen Weiner, Tanja Schultz
:
Selecting Features for Automatic Screening for Dementia Based on Speech. 747-756 - Matthias Wolff
, Günther Wirsching, Markus Huber, Peter beim Graben, Ronald Römer, Ingo Schmitt:
A Fock Space Toolbox and Some Applications in Computational Cognition. 757-767 - Olga Yakovenko
, Ivan Bondarenko, Mariya Borovikova, Daniil Vodolazsky
:
Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. 768-777 - Zbynek Zajíc
, Lucie Zajícová
, Josef V. Psutka, Petr Salajka
, Jaromír Novotný, Ales Prazák, Ludek Müller
:
First Insight into the Processing of the Language Consulting Center Data. 778-787
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.