


default search action
23rd SPECOM 2021: St. Petersburg, Russia
- Alexey Karpov
, Rodmonga Potapova
:
Speech and Computer - 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021, Proceedings. Lecture Notes in Computer Science 12997, Springer 2021, ISBN 978-3-030-87801-6 - Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:
Text-Independent Speaker Verification Employing CNN-LSTM-TDNN Hybrid Networks. 1-13 - Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:
End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics. 14-25 - Nuno Almeida, Conceição Cunha, Samuel S. Silva, António Teixeira
:
Assessing Velar Gestures Timing in European Portuguese Nasal Vowels with RT-MRI Data. 26-35 - Nuno Almeida
, Diogo Cunha, Samuel S. Silva, António Teixeira:
Designing and Deploying an Interaction Modality for Articulatory-Based Audiovisual Speech Synthesis. 36-49 - Arash Amani, Mohammad MohammadAmini, Hadi Veisi:
Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding. 50-57 - Yu Bai, Cristian Tejedor García
, Ferdy Hubers
, Catia Cucchiarini, Helmer Strik:
An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders. 58-69 - Peter Birkholz
, Christian Kleiner:
Velocity Differences Between Velum Raising and Lowering Movements. 70-80 - Natalia Bogdanova-Beglarian, Olga Blinova
, Tatiana Y. Sherstinova, Tatiana Sulimova:
Pragmatic Markers of Russian Everyday Speech: Invariants in Dialogue and Monologue. 81-90 - Vincent Brignatz, Jarod Duret, Driss Matrouf, Mickael Rouvier:
Language Adaptation for Speaker Recognition Systems Using Contrastive Learning. 91-99 - Pierre Champion, Denis Jouvet, Anthony Larcher:
Evaluating X-Vector-Based Speaker Anonymization Under White-Box Assessment. 100-111 - Myrsini Christidou, Alexandra Vioni, Nikolaos Ellinas, Georgios Vamvoukakis, Konstantinos Markopoulos, Panos Kakoulidis
, June Sig Sung, Hyoungmin Park, Aimilios Chalamandaris, Pirros Tsiakoulis:
Improved Prosodic Clustering for Multispeaker and Speaker-Independent Phoneme-Level Prosody Control. 112-123 - Adam Chýlek
, Jan Svec
, Lubos Smídl
:
Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives. 124-133 - Debadatta Dash
, Paul Ferrari
, Karinne Berstis, Jun Wang
:
Imagined, Intended, and Spoken Speech Envelope Synthesis from Neuromagnetic Signals. 134-145 - Maria Dayter
, Elena I. Riekhakaynen
:
What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms. 146-156 - Mikhail Dolgushin
, Dayana Ismakova, Yuliya Bidulya, Igor Krupkin
, Galina Barskaya, Anastasiya Lesiv
:
Toxic Comment Classification Service in Social Network. 157-165 - Denis Dresvyanskiy
, Wolfgang Minker, Alexey Karpov
:
Deep Learning Based Engagement Recognition in Highly Imbalanced Data. 166-178 - Anna Dunashova
:
Intraspeaker Variability of a Professional Lecturer: Ageing, Genre, Pragmatics vs. Voice Acting (Case Study). 179-189 - Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
An Ensemble Approach for the Diagnosis of COVID-19 from Speech and Cough Sounds. 190-201 - Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar
, Gaëlle Laperrière, Bassam Jabaian, Yannick Estève:
Where Are We in Semantic Concept Extraction for Spoken Language Understanding? 202-213 - Parismita Gogoi, Sishir Kalita, Wendy Lalhminghlui
, Priyankoo Sarmah, S. R. M. Prasanna:
Learning Mizo Tones from F0 Contours Using 1D-CNN. 214-225 - Ivan Gruber
, Marek Hrúz
, Pavel Ircing
, Petr Neduchal
, Tomás Zítka
, Miroslav Hlavác
, Zbynek Zajíc
, Jan Svec
, Martin Bulín
:
OCR Improvements for Images of Multi-page Historical Documents. 226-237 - Ivan Gruber
, Marek Hrúz
, Milos Zelezný
, Alexey Karpov
:
X-Bridge: Image-to-Image Translation with Reconstruction Capabilities. 238-249 - Hien Thi Ha, Ales Horák:
Who is Selling to Whom - Feature Evaluation for Multi-block Classification in Invoice Information Extraction. 250-261 - Abner Hernandez, Seung Hee Yang:
Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning. 262-270 - Juan Hussain, Christian Huber, Sebastian Stüker, Alexander Waibel:
Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition. 271-278 - Anosha Ignatius, Uthayasanker Thayasivam
:
Speaker-Invariant Speech-to-Intent Classification for Low-Resource Languages. 279-290 - Denis Ivanko
, Dmitry Ryumin
, Alexandr Axyonov
, Alexey M. Kashevnik
:
Speaker-Dependent Visual Command Recognition in Vehicle Cabin: Methodology and Evaluation. 291-302 - Joshua Jansen van Vueren, Thomas Niesler:
Optimised Code-Switched Language Model Data Augmentation in Four Under-Resourced South African Languages. 303-316 - Virender Kadyan, Hemant Kumar Kathania, Prajjval Govil, Mikko Kurimo:
Synthesis Speech Based Data Augmentation for Low Resource Children ASR. 317-326 - Irina S. Kipyatkova:
End-to-End Russian Speech Recognition Models with Multi-head Attention. 327-335 - Konstantinos Klapsas, Nikolaos Ellinas, June Sig Sung, Hyoungmin Park, Spyros Raptis:
Word-Level Style Control for Expressive, Non-attentive Speech Synthesis. 336-347 - Liliya Komalova
, Diana Kulagina:
Perceiving Speech Aggression with and without Textual Context on Twitter Social Network Site. 348-359 - Roman Korostik, Javier Latorre, Sivanand Achanta, Yannis Stylianou:
Assessing Speaker Interpolation in Neural Text-to-Speech. 360-371 - Denis Likhachov, Maxim Vashkevich
, Elias Azarov
, Katsiaryna Malhina, Yuliya Rushkevich
:
A Mobile Application for Detection of Amyotrophic Lateral Sclerosis via Voice Analysis. 372-383 - Elena E. Lyakso
, Olga V. Frolova
, Nersisson Ruban
, A. Mary Mekala
:
Child's Emotional Speech Classification by Human Across Two Languages: Russian & Tamil. 384-396 - Olesia Makhnytkina
, Aleksey Grigorev
, Aleksander Nikolaev
:
Analysis of Dialogues of Typically Developing Children, Children with Down Syndrome and ASD Using Machine Learning Methods. 397-406 - Ali Raheem Mandeel
, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS. 407-416 - Yuri Matveev
, Anton Matveev
, Olga V. Frolova
, Elena E. Lyakso
:
Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development. 417-425 - Salima Mdhaffar
, Marc Tommasi
, Yannick Estève
:
Study on Acoustic Model Personalization in a Context of Collaborative Learning Constrained by Privacy Preservation. 426-436 - Muhammadjon Musaev, Saida Mussakhojayeva, Ilyos Khujayorov
, Yerbolat Khassanov, Mannon Ochilov
, Huseyin Atakan Varol:
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments. 437-447 - Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English. 448-459 - Sergis Nicolaou, Lambros Mavrides, Georgina Tryfou, Kyriakos Tolias, Konstantinos P. Panousis, Sotirios Chatzis, Sergios Theodoridis:
Dialog Speech Sentiment Classification for Imbalanced Datasets. 460-471 - Tijana V. Nosek, Sinisa Suzic, Mia Vujovic, Darko Pekar, Milan Secujski, Vlado Delic:
Explicit Control of the Level of Expressiveness in DNN-Based Speech Synthesis by Embedding Interpolation. 472-482 - Dariya Novokhrestova
, Evgeny Kostuchenko
, Ilya A. Hodashinsky
, Lidiya N. Balatskaya
:
Experimental Analysis of Expert and Quantitative Estimates of Syllable Recordings in the Process of Speech Rehabilitation. 483-491 - Edvin Pakoci
, Branislav M. Popovic
:
Methods for Using Class Based N-gram Language Models in the Kaldi Toolkit. 492-503 - Ankur T. Patil, Harsh Kotta, Rajul Acharya, Hemant A. Patil:
Spectral Root Features for Replay Spoof Detection in Voice Assistants. 504-515 - Rodmonga Potapova
, Tatyana Agibalova
, Vsevolod Potapov
, Olga Tuchina
:
Influence of the Aggressive Internet Environment on Cognitive Personality Disorders (in Relation to the Russian Young Generation of Users). 516-527 - Rodmonga Potapova
, Vsevolod Potapov
, Nataliya Lebedeva, Ekaterina Karimova
, Nikolay Bobrov
:
Media Content vs Nature Stimuli Influence on Human Brain Activity. 528-539 - Valeriya Prokaeva
, Elena I. Riekhakaynen
, Vladislav I. Zubov
:
Can Your Eyes Tell Us Why You Hesitate? Comparing Reading Aloud in Russian as L1 and Japanese as L2. 540-552 - Josef V. Psutka
, Ales Prazák
, Jan Vanek
:
Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures. 553-564 - Mathias Quillot
, Richard Dufour
, Jean-François Bonastre
:
Assessing Speaker-Independent Character Information for Acted Voices. 565-576 - Mathias Quillot
, Jarod Duret
, Richard Dufour
, Mickael Rouvier
, Jean-François Bonastre
:
Influence of Speaker Pre-training on Character Voice Representation. 577-588 - Ilyos Rabbimov
, Sami Kobilov, Iosif Mporas:
Opinion Classification via Word and Emoji Embedding Models with LSTM. 589-601 - Aku Rouhe, Astrid Van Camp
, Mittul Singh
, Hugo Van hamme
, Mikko Kurimo:
An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR. 602-613 - Lyudmila V. Savchenko, Andrey V. Savchenko:
Speaker-Aware Training of Speech Emotion Classifier with Speaker Recognition. 614-625 - Andrey V. Savinkov
, Vladimir V. Bochkarev
, Anna V. Shevlyakova
, Stanislav Khristoforov
:
Neural Network Recognition of Russian Noun and Adjective Cases in the Google Books Ngram Corpus. 626-637 - Vered Silber-Varod
, Mária Gósy
, Anat Lerner
:
Is It a Filler or a Pause? A Quantitative Analysis of Filled Pauses in Hebrew. 638-648 - Shrishti Singh, Kuldeep Khoria, Hemant A. Patil:
Modified Group Delay Function Using Different Spectral Smoothing Techniques for Voice Liveness Detection. 649-659 - Tatiana Sokoreva
, Tatiana Shevchenko
, Mariya Chyrvonaya:
Complex Rhythm Adjustments in Multilingual Code-Switching Across Mandarin, English and Russian. 660-669 - Mohammad Soleymanpour, Michael T. Johnson, Jeffrey Berry:
Increasing the Precision of Dysarthric Speech Intelligibility and Severity Level Estimate. 670-679 - Lauri Tavi, Tomi Kinnunen, Einar Meister
, Rosa González Hautamäki, Anton Malmi
:
Articulation During Voice Disguise: A Pilot Study. 680-691 - Elena Timofeeva, Elena Evseeva, Valeriia Zaluskaia, Vlada Kapranova
, Sergei Astapov, Vladimir Kabarov:
Improvement of Speaker Number Estimation by Applying an Overlapped Speech Detector. 692-703 - Paras Tiwari
, Sawan Rai
:
Mind Your Tweet: Abusive Tweet Detection. 704-715 - Marián Trnka, Sakhia Darjaa, Milan Rusko, Meilin Schaper, Tim H. Stelkens-Kobsch:
Speaker Authorization for Air Traffic Control Security. 716-725 - Ana Rita Valente
, Catarina Oliveira
, Luciana Albuquerque
, António Teixeira
, Plínio A. Barbosa
:
Prosodic Changes with Age: A Longitudinal Study on a Famous European Portuguese Native Speaker. 726-736 - Loes van Bemmel
, Wieke Harmsen, Catia Cucchiarini, Helmer Strik:
Automatic Selection of the Most Characterizing Features for Detecting COPD in Speech. 737-748 - Ewald van der Westhuizen
, Trideba Padhi
, Thomas Niesler
:
Multilingual Training Set Selection for ASR in Under-Resourced Malian Languages. 749-760 - Jan Volín
, Markéta Rezácková
, Jindrich Matousek
:
Human and Transformer-Based Prosodic Phrasing in Two Speech Genres. 761-772 - Roman Vygon
, Nikolay Mikhaylovskiy
:
Learning Efficient Representations for Keyword Spotting with Triplet Loss. 773-785 - Tobias Watzel
, Ludwig Kürzinger
, Lujun Li
, Gerhard Rigoll
:
Regularized Forward-Backward Decoder for Attention Models. 786-794 - Tobias Watzel
, Ludwig Kürzinger
, Lujun Li
, Gerhard Rigoll
:
Induced Local Attention for Transformer Models in Speech Recognition. 795-806 - Zbynek Zajíc
, Marie Kunesová
, Ludek Müller
:
Applying EEND Diarization to Telephone Recordings from a Call Center. 807-817 - Svetlana Zimina, Vera Evdokimova:
Acoustic Characteristics of Speech Entrainment in Dialogues in Similar Phonetic Sequences. 818-825 - Ismail Rasim Ülgen
, Mustafa Erden, Levent M. Arslan:
Predicting Biometric Error Behaviour from Speaker Embeddings and a Fast Score Normalization Scheme. 826-836

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.