


default search action
IEEE Transactions on Audio, Speech & Language Processing, Volume 18
Volume 18, Number 1, January 2010
- Ali H. Sayed:
Free Electronic Access to SP Publications. 1 - Dmitry N. Zotkin, Ramani Duraiswami
, Nail A. Gumerov:
Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays. 2-16 - Ramdas Kumaresan, Nitesh Panchal:
Encoding Bandpass Signals Using Zero/Level Crossings: A Model-Based Approach. 17-33 - Péter Balázs
, Bernhard Laback
, Gerhard Eckel, Werner A. Deutsch:
Time-Frequency Sparsity by Removing Perceptually Irrelevant Components Using a Simple Model of Simultaneous Masking. 34-49 - Antti J. Eronen, Anssi Klapuri:
Music Tempo Estimation With k -NN Regression. 50-57 - Jean-Marc Valin, Timothy B. Terriberry, Christopher Montgomery, Gregory Maxwell:
A High-Quality Speech and Audio Codec With Less Than 10-ms Delay. 58-67 - Martin Raspaud
, Harald Viste, Gianpaolo Evangelista
:
Binaural Source Localization by Joint Estimation of ILD and ITD. 68-77 - Konrad Kowalczyk
, Maarten van Walstijn:
Wideband and Isotropic Room Acoustics Simulation Using 2-D Interpolated FDTD Schemes. 78-89 - Tiago H. Falk
, Wai-Yip Chan:
Modulation Spectral Features for Robust Far-Field Speaker Identification. 90-100 - Vaninirappuputhenpurayil Gopalan Reju
, Soo Ngee Koh, Ing Yann Soon:
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking. 101-116 - Ian McLoughlin
:
Vowel Intelligibility in Chinese. 117-125 - Bernd Matschkal, Johannes B. Huber:
Spherical Logarithmic Quantization. 126-140 - Shih-Sian Cheng, Hsin-Min Wang
, Hsin-Chia Fu:
BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization. 141-157 - Emanuël Anco Peter Habets
, Jacob Benesty
, Israel Cohen, Sharon Gannot
, Jacek Dmochowski:
New Insights Into the MVDR Beamformer in Room Acoustics. 158-170 - Tianyu T. Wang, Thomas F. Quatieri:
High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch. 171-186 - Feifan Liu, Yang Liu:
Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries. 187-196 - Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Efficient and Robust Music Identification With Weighted Finite-State Transducers. 197-207
Volume 18, Number 2, February 2010
- Hüseyin Hacihabiboglu
, Banu Gunel
, Zoran Cvetkovic:
Simulation of Directional Microphones in Digital Waveguide Mesh-Based Models of Room Acoustics. 213-223 - Claudius Gläser, Martin Heckmann
, Frank Joublin, Christian Goerick:
Combining Auditory Preprocessing and Bayesian Estimation for Robust Formant Tracking. 224-236 - Damián Marelli, Péter Balázs
:
On Pole-Zero Model Estimation Methods Minimizing a Logarithmic Criterion for Speech Analysis. 237-248 - Alfred Mertins, Tiemin Mei
, Markus Kallinger:
Room Impulse Response Shortening/Reshaping With Infinity- and p -Norm Optimization. 249-259 - Mehrez Souden, Jacob Benesty
, Sofiène Affes
:
On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction. 260-276 - Avram Levi, Harvey F. Silverman:
A Robust Method to Extract Talker Azimuth Orientation Using a Large-Aperture Microphone Array. 277-285 - Roberto Napoli, Luigi Piroddi
:
Nonlinear Active Noise Control With NARX Models. 286-295 - Luis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega
, Eduardo Lleida
:
Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition. 296-309 - Chao-Ling Hsu, Jyh-Shing Roger Jang:
On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset. 310-319 - Ümit Güz, Sébastien Cuendet, Dilek Hakkani-Tür, Gökhan Tür:
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech. 320-329 - Vinay Melkote, Kenneth Rose:
Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding. 330-341 - Bram Cornelis, Simon Doclo
, Tim Van den Bogaert, Marc Moonen, Jan Wouters
:
Theoretical Analysis of Binaural Multimicrophone Noise Reduction Techniques. 342-355 - Wen Jin, Xin Liu, Michael S. Scordilis, Lu Han:
Speech Enhancement Using Harmonic Emphasis and Adaptive Comb Filtering. 356-368 - Nathalie Camelin
, Frédéric Béchet, Géraldine Damnati, Renato de Mori:
Detection and Interpretation of Opinion Expressions in Spoken Surveys. 369-381 - Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis:
Model-Based Expectation-Maximization Source Separation and Localization. 382-394 - Shinji Watanabe
, Atsushi Nakamura:
Predictor-Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale. 395-406 - Alexandros Nanopoulos, Dimitrios Rafailidis
, Panagiotis Symeonidis
, Yannis Manolopoulos:
MusicBox: Personalized Music Recommendation Based on Cubic Analysis of Social Tags. 407-412
Volume 18, Number 3, March 2010
- Bertrand David, Masataka Goto
, Laurent Daudet, Paris Smaragdis:
Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds. 417-419 - Vittoria Bruni, Silvia Marconi
, Domenico Vitulano:
Time-Scale Atoms Chains for Transients Detection in Audio Signals. 420-433 - Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Audio Signal Representations for Indexing in the Transform Domain. 434-446 - Nicolás Ruiz-Reyes
, Pedro Vera-Candeas
:
Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding. 447-460 - Bob L. Sturm, John J. Shynk:
Sparse Approximation and the Pursuit of Meaningful Signal Models With Interference Adaptation. 461-472 - Julio J. Carabias-Orti
, Pedro Vera-Candeas
, Francisco J. Cañadas-Quesada
, Nicolás Ruiz-Reyes
:
Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection. 473-486 - Johan Xi Zhang, Mads Græsbøll Christensen
, Søren Holdt Jensen, Marc Moonen:
A Robust and Computationally Efficient Subspace-Based Fundamental Frequency Estimator. 487-497 - Jeremy Wells, Damian T. Murphy:
A Comparative Evaluation of Techniques for Single-Frame Discrimination of Nonstationary Sinusoids. 498-508 - Mathieu Lagrange, Gary P. Scavone, Philippe Depalle:
Analysis/Synthesis of Sounds Generated by Sustained Contact Between Rigid Objects. 509-518 - Paul H. Peeling, Ali Taylan Cemgil
, Simon J. Godsill:
Generative Spectrogram Factorization Models for Polyphonic Piano Transcription. 519-527 - Emmanuel Vincent, Nancy Bertin, Roland Badeau:
Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation. 528-537 - Nancy Bertin, Roland Badeau, Emmanuel Vincent:
Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription. 538-549 - Alexey Ozerov, Cédric Févotte:
Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation. 550-563 - Jean-Louis Durrieu, Gaël Richard, Bertrand David, Cédric Févotte:
Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals. 564-575 - Yannis Panagakis
, Constantine Kotropoulos, Gonzalo R. Arce
:
Non-Negative Multilinear Principal Component Analysis of Auditory Temporal Modulations for Music Genre Classification. 576-588 - Onur Dikmen
, Ali Taylan Cemgil
:
Gamma Markov Random Fields for Audio Source Modeling. 589-601 - Luke Barrington, Antoni B. Chan
, Gert R. G. Lanckriet:
Modeling Music as a Dynamic Texture. 602-612 - Anssi Klapuri, Tuomas Virtanen
:
Representing Musical Sounds With an Interpolating State Model. 613-624 - Kris West, Stephen Cox:
Incorporating Cultural Representations of Features Into Audio Music Similarity Estimation. 625-637 - Hiromasa Fujihara, Masataka Goto
, Tetsuro Kitahara, Hiroshi G. Okuno
:
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval. 638-648 - Meinard Müller
, Sebastian Ewert
:
Towards Timbre-Invariant Audio Features for Harmony-Based Music. 649-662 - Juan José Burred, Axel Röbel, Thomas Sikora:
Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds. 663-674 - Geoffroy Peeters, Emmanuel Deruty:
Sound Indexing Using Morphological Description. 675-687 - Gordon Wichern, Jiachen Xue, Harvey D. Thornburg, Brandon Mechtley, Andreas Spanias:
Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds. 688-707
Volume 18, Number 4, May 2010
- Vesa Välimäki
, Federico Fontana
, Julius O. Smith III
, Udo Zölzer:
Introduction to the Special Issue on Virtual Analog Audio Effects and Musical Instruments. 713-714 - Giovanni De Sanctis, Augusto Sarti:
Virtual Analog Modeling in the Wave-Digital Domain. 715-727 - David T. Yeh, Jonathan S. Abel, Julius O. Smith III
:
Automated Physical Modeling of Nonlinear Audio Circuits For Real-Time Audio Effects - Part I: Theoretical Development. 728-737 - Jyri Pakarinen, Matti Karjalainen:
Enhanced Wave Digital Triode Model for Real-Time Tube Amplifier Emulation. 738-746 - Thomas Hélie:
Volterra Series and State Transformation for Real-Time Simulations of Audio Circuits Including Saturations: Application to the Moog Ladder Filter. 747-759 - Federico Fontana
, Marco Civolani:
Modeling of the EMS VCS3 Voltage-Controlled Filter as a Nonlinear Filter Network. 760-772 - Juhan Nam
, Vesa Välimäki
, Jonathan S. Abel, Julius O. Smith III
:
Efficient Antialiasing Oscillator Algorithms Using Low-Order Fractional Delay Filters. 773-785 - Vesa Välimäki
, Juhan Nam
, Julius O. Smith III
, Jonathan S. Abel:
Alias-Suppressed Oscillators Based on Differentiated Polynomial Waveforms. 786-798 - Stefan Bilbao, Julian Parker:
A Virtual Model of Spring Reverberation. 799-808 - Balázs Bank
, Stefano Zambon, Federico Fontana
:
A Modal-Based Real-Time Piano Synthesizer. 809-821 - Gianpaolo Evangelista
, Fredrik Eckerholm:
Player-Instrument Interaction Models for Digital Waveguide Synthesis of Guitar: Touch and Collisions. 822-832 - Nelson Lee, Julius O. Smith III
, Vesa Välimäki
:
Analysis and Synthesis of Coupled Vibrating Strings Using a Hybrid Modal-Waveguide Synthesis Model. 833-842 - Rémi Mignot, Thomas Hélie, Denis Matignon:
Digital Waveguide Modeling for Wind Instruments: Building a State-Space Representation Based on the Webster-Lokshin Model. 843-854 - Esteban Maestre
, Merlijn Blaauw
, Jordi Bonada
, Enric Guaus
, Alfonso Pérez
:
Statistical Modeling of Bowing Control Applied to Violin Sound Synthesis. 855-871 - Stefan Bilbao:
Percussion Synthesis Based on Models of Nonlinear Shell Vibration. 872-880 - Rudolf Rabenstein, Tilman Koch, Christian Popp:
Tubular Bells: A Physical and Algorithmic Model. 881-890 - Federico Avanzini
, Riccardo Marogna:
A Modular Physically Based Approach to the Sound Synthesis of Membrane Percussion Instruments. 891-902
Volume 18, Number 5, July 2010
- Yannis Stylianou, Tomoki Toda
, Chung-Hsien Wu
, Alexander Kain, Olivier Rosec:
Introduction to the Special Section on Voice Transformation. 909-911 - Elina Helander
, Tuomas Virtanen
, Jani Nurminen, Moncef Gabbouj
:
Voice Conversion Using Partial Least Squares Regression. 912-921 - Daniel Erro, Asunción Moreno, Antonio Bonafonte
:
Voice Conversion Based on Weighted Frequency Warping. 922-931 - Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang:
Supervisory Data Alignment for Text-Independent Voice Conversion. 932-943 - Daniel Erro, Asunción Moreno, Antonio Bonafonte
:
INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora. 944-953 - Srinivas Desai, Alan W. Black, B. Yegnanarayana, Kishore Prahallad:
Spectral Mapping Using Artificial Neural Networks for Voice Conversion. 954-964 - Oytun Türk, Marc Schröder:
Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques. 965-973 - Daniel Erro, Eva Navas
, Inmaculada Hernáez
, Ibon Saratxaga
:
Emotion Conversion Based on Prosodic Unit Selection. 974-983 - Junichi Yamagishi, Bela Usabaev, Simon King
, Oliver Watts, John Dines, Jilei Tian, Yong Guan, Rile Hu, Keiichiro Oura, Yi-Jian Wu, Keiichi Tokuda, Reima Karhila, Mikko Kurimo:
Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora. 984-1004 - Oliver Watts, Junichi Yamagishi, Simon King
, Kay Berkling:
Synthesis of Child Speech With HMM Adaptation and Voice Conversion. 1005-1016 - Purvis Bedenbaugh, Diana K. Sarko, Heidi L. Roth, Eugene M. Martin:
Prosody-Preserving Voice Transformation to Evaluate Brain Representations of Speech Sounds. 1017-1029 - Daniel Felps, Ricardo Gutierrez-Osuna
:
Developing Objective Measures of Foreign-Accent Conversion. 1030-1040 - Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán
, Claudio Garretón, Jorge Wuth:
Maximum Entropy-Based Reinforcement Learning Using a Confidence Measure in Speech Recognition for Telephone Speech. 1041-1052 - Xugang Lu, Jianwu Dang:
Vowel Production Manifold: Intrinsic Factor Analysis of Vowel Articulation. 1053-1062 - S. Abdallah:
Comment on "Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection". 1063-1065 - Cong-Thanh Do, Dominique Pastor
, André Goalic:
On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR. 1065-1068 - Parham Mokhtari, Hironori Takemoto, Ryouichi Nishimura, Hiroaki Kato:
Optimum Loss Factor for a Perfectly Matched Layer in Finite-Difference Time-Domain Acoustic Simulation. 1068-1071 - Mehrez Souden, Jingdong Chen, Jacob Benesty
, Sofiène Affes
:
Gaussian Model-Based Multichannel Speech Presence Probability. 1072-1077 - Stas Tiomkin, David Malah
, Slava Shechtman:
Statistical Text-to-Speech Synthesis Based on Segment-Wise Representation With a Norm Constraint. 1077-1082 - Claudio Garretón, Néstor Becerra Yoma, Matias Torres:
Channel Robust Feature Transformation Based on Filter-Bank Energy Filtering. 1082-1086
Volume 18, Number 6, August 2010
- Hamed Ketabdar, Hervé Bourlard:
Enhanced Phone Posteriors for Improving Speech Recognition Systems. 1094-1106 - Sergio Canazza, Giovanni De Poli
, Gian Antonio Mian:
Restoration of Audio Documents by Means of Extended Kalman Filter. 1107-1115 - Chunghsin Yeh, Axel Röbel, Xavier Rodet:
Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals. 1116-1126 - Jiucang Hao, Te-Won Lee, Terrence J. Sejnowski:
Speech Enhancement Using Gaussian Scale Mixture Models. 1127-1136 - Romain Serizel, Marc Moonen, Jan Wouters
, Søren Holdt Jensen:
Integrated Active Noise Control and Noise Reduction in Hearing Aids. 1137-1146 - Justin Jian Zhang, Ricky Ho Yin Chan, Pascale Fung:
Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling. 1147-1157 - Xiong Xiao, Jinyu Li
, Engsiong Chng
, Haizhou Li
, Chin-Hui Lee:
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition. 1158-1169 - Chung-Hsien Wu
, Chao-Hong Liu
, Matthew Harris, Liang-Chih Yu:
Sentence Correction Incorporating Relative Position and Parse Template Language Models. 1170-1181 - Robbie Vogt, Sridha Sridharan, Michael Mason
:
Making Confident Speaker Verification Decisions With Minimal Speech. 1182-1192 - Dimitri Nion, Kleanthis N. Mokios, Nicholas D. Sidiropoulos
, Alexandros Potamianos:
Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures. 1193-1207 - Vaclav Eksler, Milan Jelinek:
Glottal-Shape Codebook to Improve Robustness of CELP Codecs. 1208-1217 - Moo Young Kim, W. Bastiaan Kleijn
:
Reduction of the Impact of Distortion Outliers and Source Mismatch in Resolution-Constrained Quantization. 1218-1227 - Maurice F. Fallon, Simon J. Godsill:
Acoustic Source Localization and Tracking Using Track Before Detect. 1228-1242 - Xiaoqiang Xiao, Robert M. Nickel
:
Speech Enhancement With Inventory Style Speech Resynthesis. 1243-1257 - Angel M. Gomez
, José L. Carmona, Antonio M. Peinado
, Victoria E. Sánchez:
A Multipulse-Based Forward Error Correction Technique for Robust CELP-Coded Speech Transmission Over Erasure Channels. 1258-1268 - Matt Gibson, Thomas Hain
:
Error Approximation and Minimum Phone Error Acoustic Model Estimation. 1269-1279 - Matthias Mauch, Simon Dixon:
Simultaneous Estimation of Chords and Musical Context From Audio. 1280-1289 - Jingen Ni, Feng Li:
A Variable Step-Size Matrix Normalized Subband Adaptive Filter. 1290-1299 - Chang Huai You, Kong-Aik Lee
, Haizhou Li
:
GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition. 1300-1312 - Ilknur Durgar El-Kahlout, Kemal Oflazer
:
Exploiting Morphology and Local Word Reordering in English-to-Turkish Phrase-Based Statistical Machine Translation. 1313-1322 - Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthew Y. Ma:
Active Learning With Sampling by Uncertainty and Density for Data Annotations. 1323-1331 - Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang:
Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information. 1332-1340 - José L. Carmona, Antonio M. Peinado
, José L. Pérez-Córdoba
, Angel M. Gomez
:
MMSE-Based Packet Loss Concealment for CELP-Coded Speech Recognition. 1341-1353 - Kentaro Ishizuka, Shoko Araki
, Tatsuya Kawahara
:
Speech Activity Detection for Multi-Party Conversation Analyses Based on Likelihood Ratio Test on Spatial Magnitude. 1354-1365 - Marc Ferras, Cheung-Chi Leung, Claude Barras, Jean-Luc Gauvain:
Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition. 1366-1378 - Hynek Boril, John H. L. Hansen:
Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments. 1379-1393 - Chung-Hsien Wu
, Chi-Chun Hsia, Chung-Han Lee, Mai-Chun Lin:
Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Speech Synthesis. 1394-1405 - Keansub Lee, Daniel P. W. Ellis:
Audio-Based Semantic Concept Classification for Consumer Video. 1406-1416 - Charturong Tantibundhit, Franz Pernkopf
, Gernot Kubin:
Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement. 1417-1428 - Eric A. Lehmann
, Anders M. Johansson:
Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses. 1429-1439 - Øystein Birkenes, Tomoko Matsui, Kunio Tanabe, Sabato Marco Siniscalchi
, Tor André Myrvoll, Magne Hallstein Johnsen:
Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition. 1440-1454 - Jerome R. Bellegarda:
A Dynamic Cost Weighting Framework for Unit Selection Text-to-Speech Synthesis. 1455-1463 - Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier:
A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor. 1464-1475 - Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino
:
Blind Source Separation With Parameter-Free Adaptive Step-Size Method for Robot Audition. 1476-1485 - Murat Akbacak, John H. L. Hansen:
Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations. 1486-1495 - Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan:
Data-Driven Background Dataset Selection for SVM-Based Speaker Verification. 1496-1506 - Hirokazu Kameoka, Nobutaka Ono
, Shigeki Sagayama:
Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency. 1507-1516 - Andre Holzapfel, Yannis Stylianou, Ali Cenk Gedik
, Baris Bozkurt
:
Three Dimensions of Pitched Instrument Onset Detection. 1517-1527 - Panikos Heracleous, V.-A. Tran, Takayuki Nagai, Kiyohiro Shikano:
Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information. 1528-1538 - Yuya Akita, Tatsuya Kawahara
:
Statistical Transformation of Language and Pronunciation Models for Spontaneous Speech Recognition. 1539-1549 - Charles Verron, Mitsuko Aramaki, Richard Kronland-Martinet
, Grégory Pallone:
A 3-D Immersive Synthesizer for Environmental Sounds. 1550-1561 - Yi-Cheng Pan, Lin-Shan Lee:
Performance Analysis for Lattice-Based Speech Indexing Approaches Using Words and Subword Units. 1562-1574 - Mehrez Souden, Jacob Benesty
, Sofiène Affes
:
Broadband Source Localization From an Eigenanalysis Perspective. 1575-1587 - Péter Mihajlik
, Zoltán Tüske, Balázs Tarján
, Bottyán Németh, Tibor Fegyó:
Improved Recognition of Spontaneous Hungarian Speech - Morphological and Acoustic Modeling Techniques for a Less Resourced Task. 1588-1600 - Gökhan Tür, Andreas Stolcke, L. Lynn Voss, Stanley Peters, Dilek Hakkani-Tür, John Dowding, Benoît Favre, Raquel Fernández, Matthew Frampton, Michael W. Frandsen, Clint Frederickson, Martin Graciarena, Donald Kintzing, Kyle Leveque, Shane Mason, John Niekrasz, Matthew Purver
, Korbinian Riedhammer
, Elizabeth Shriberg, Jing Tien, Dimitra Vergyri, Fan Yang:
The CALO Meeting Assistant System. 1601-1611 - Bengt J. Borgstrom, Abeer Alwan:
HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition. 1612-1623 - Sriram Ganapathy, Petr Motlícek
, Hynek Hermansky
:
Autoregressive Models of Amplitude Modulations in Audio Compression. 1624-1631 - Hyeon-Jin Jeon, Tae-Gyu Chang, Sen M. Kuo:
Analysis of Frequency Mismatch in Narrowband Active Noise Control. 1632-1642 - Valentin Emiya
, Roland Badeau, Bertrand David:
Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle. 1643-1654 - Thushara D. Abhayapala
, Aastha Gupta:
Spherical Harmonic Analysis of Wavefields Using Multiple Circular Sensor Arrays. 1655-1666
Volume 18, Number 7, September 2010
- Tomohiro Nakatani, Walter Kellermann, Patrick A. Naylor
, Masato Miyoshi, Biing-Hwang Juang:
Introduction to the Special Issue on Processing Reverberant Speech: Methodologies and Applications. 1673-1675 - Armin Sehr, Roland Maas, Walter Kellermann:
Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition. 1676-1691 - Alexander Krueger, Reinhold Haeb-Umbach
:
Model-Based Feature Enhancement for Reverberant Speech Recognition. 1692-1707 - Randy Gomez, Tatsuya Kawahara
:
Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood. 1708-1716 - Tomohiro Nakatani
, Takuya Yoshioka, Keisuke Kinoshita
, Masato Miyoshi, Biing-Hwang Juang:
Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction. 1717-1731 - Marco Jeub, Magnus Schäfer, Thomas Esch, Peter Vary:
Model-Based Dereverberation Preserving Binaural Cues. 1732-1745 - Jan S. Erkelens, Richard Heusdens:
Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments. 1746-1765 - Tiago H. Falk
, Chenxi Zheng, Wai-Yip Chan:
A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech. 1766-1774 - Takayuki Arai, Nao Hodoshima
, Keiichi Yasu:
Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners. 1775-1780 - Flavio P. Ribeiro, Cha Zhang, Dinei A. F. Florêncio, Demba E. Ba:
Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization. 1781-1792 - Yan-Chen Lu, Martin Cooke:
Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources. 1793-1805 - Fotios Talantzis:
An Acoustic Source Localization and Tracking Framework Using Particle Filtering and Information Theory. 1806-1817 - Matthieu Kowalski, Emmanuel Vincent, Rémi Gribonval:
Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation. 1818-1829 - Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval:
Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model. 1830-1840 - Alireza Masnadi-Shirazi, Wenyi Zhang, Bhaskar D. Rao:
Glimpsing IVA: A Framework for Overcomplete/Complete/Undercomplete Convolutive Source Separation. 1841-1855 - John Woodruff, DeLiang Wang:
Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization. 1856-1866 - Chris Hummersone, Russell Mason
, Tim Brookes
:
Dynamic Precedence Effect Modeling for Source Separation in Reverberant Environments. 1867-1871 - Michael I. Mandel, Scott Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis:
Evaluating Source Separation Algorithms With Reverberant Speech. 1872-1883
Volume 18, Number 8, November 2010
- Ozlem Kalinli, Michael L. Seltzer, Jasha Droppo
, Alex Acero:
Noise Adaptive Training for Robust Automatic Speech Recognition. 1889-1901 - Georgios N. Lilis, Daniele Angelosante, Georgios B. Giannakis
:
Sound Field Reproduction using the Lasso. 1902-1912 - Wenyi Zhang, Bhaskar D. Rao:
A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources. 1913-1928 - Damián Marelli, Mitsuko Aramaki, Richard Kronland-Martinet
, Charles Verron:
Time-Frequency Synthesis of Noisy Sounds With Narrow Spectral Components. 1929-1940 - Songfang Huang, Steve Renals
:
Hierarchical Bayesian Language Models for Conversational Speech Recognition. 1941-1954 - Emmanouil Benetos
, Constantine Kotropoulos
:
Non-Negative Tensor Factorization Applied to Music Genre Classification. 1955-1967 - Emmanouil Benetos
, Yannis Stylianou:
Auditory Spectrum-Based Pitched Instrument Onset Detection. 1968-1977 - Jian Liu, Yegui Xiao, Jinwei Sun, Li Xu:
Analysis of Online Secondary-Path Modeling With Auxiliary Noise Scaled by Residual Noise Signal. 1978-1993 - Chi-Chun Hsia, Chung-Hsien Wu
, Jung-Yun Wu:
Exploiting Prosody Hierarchy and Dynamic Features for Pitch Modeling and Generation in HMM-Based Speech Synthesis. 1994-2003 - Huijun Ding, Ing Yann Soon, Chai Kiat Yeo
:
Over-Attenuated Components Regeneration for Speech Enhancement. 2004-2014 - Aarthi M. Reddy, Richard C. Rose:
Integration of Statistical Models for Dictation of Document Translations in a Machine-Aided Human Translation Task. 2015-2027 - David Imseng, Gerald Friedland:
Tuning-Robust Initialization Methods for Speaker Diarization. 2028-2037 - Jens Ahrens
, Sascha Spors
:
Sound Field Reproduction Using Planar and Linear Arrays of Loudspeakers. 2038-2050 - Gregory Sell, Malcolm Slaney
:
Solving Demodulation as an Optimization Problem. 2051-2066 - Guoning Hu, DeLiang L. Wang:
A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation. 2067-2079 - Gibak Kim, Philipos C. Loizou:
Improving Speech Intelligibility in Noise Using Environment-Optimized Algorithms. 2080-2090 - Maor Kleider, Boaz Rafaely
, Barak Weiss, Eitan Bachmat
:
Golden-Ratio Sampling for Scanning Circular Microphone Arrays. 2091-2098 - Seokhwan Jo, Chang D. Yoo:
Psychoacoustically Constrained and Distortion Minimized Speech Enhancement. 2099-2110 - Wooil Kim, John H. L. Hansen:
Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions. 2111-2120 - Zhiyao Duan
, Bryan Pardo, Changshui Zhang:
Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions. 2121-2133 - Nikoletta Bassiou, Vassiliki Moschou, Constantine Kotropoulos
:
Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles. 2134-2144 - Vishweshwara Rao, Preeti Rao:
Vocal Melody Extraction in the Presence of Pitched Accompaniment in Polyphonic Music. 2145-2154 - Brady Laska
, Miodrag Bolic, Rafik A. Goubran:
Particle Filter Enhancement of Speech Spectral Amplitudes. 2155-2167

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.