Ralf Schlüter

Publications of Ralf Schlüter (inverse chronological order)

Disclaimer

Printable version of list of publications (PDF)

Interspeech 2019 Survey: Modeling in Automatic Speech Recognition: Beyond Hidden Markov Models (video, slides)

End-to-End ASR Survey (2023)

Link to google scholar

2025

B. Hilmes, N. Rossenbach, and R. Schl�ter. Analyzing the Importance of Blank for CTC-Based Knowledge Distillation. In Interspeech, Rotterdam, The Netherlands, August 2025.
P. Vieting, M. Kannen, B. Hilmes, R. Schl�ter, and H. Ney. Regularizing Learnable Feature Extraction for Automatic Speech Recognition. In Interspeech, Rotterdam, The Netherlands, August 2025. To appear. Preprint arXiv:2506.09804.
Z. Yang, M. Phan, R. Schl�ter, and H. Ney. Label-Context-Dependent Internal Language Model Estimation for CTC. In Interspeech, Rotterdam, The Netherlands, August 2025. To appear. Preprint arXiv:2506.06096.
T. Raissi, R. Schl�ter, and H. Ney. Right Label Context in End-to-End Training of Time-Synchronous ASR Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, Inida, April 2025. Preprint ArXiv:2501.04521.
R. Schmitt, A. Zeyer, M. Zeineldeen, R. Schl�ter, and H. Ney. The Conformer Encoder May Reverse the Time Dimension. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, Inida, April 2025. Preprint ArXiv:2501.04521.
J. Xu, E. Beck, Z. Yang, and R. Schlueter. Efficient Supernet Training with Orthogonal Softmax for Scalable ASR Model Compression. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, Inida, April 2025. Preprint arxiv:2501.18895.
Z. Yang, V. Eminyan, R. Schl�ter, and H. Ney. Classification Error Bound for Low Bayes Error Conditions in Machine Learning. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, Inida, April 2025. Preprint arxiv:2501.15977.
K. Le-Duc, D. Thulke, H. Tran, L. Vo-Dang, K. Nguyen, T. S. Hy, and R. Schl�ter. Medical Spoken Named Entity Recognition. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track) (NAACL 2025 Industry Track), pages 724-783, Albuquerque, New Mexico, April 2025.

2024

P. Vieting, S. Berger, T. v. Neumann, C. Boeddeker, R. Schl�ter, and R. Haeb-Umbach. Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. In IEEE Spoken Language Technology Workshop (SLT), Macao, China, December 2024. Preprint arxiv:2309.08454.
Z. Yang, V. Eminyan, R. Schl�ter, and H. Ney. Refined Statistical Bounds for Classification Error Mismatches with Constrained Bayes Error. In 2024 IEEE Information Theory Workshop (ITW 2024), Shenzhen, China, November 2024. Preprint arxiv:2409.01309.
J. Xu, W. Zhou, Z. Yang, E. Beck, and R. Schlueter. Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition. In Interspeech, Kos, Greece, September 2024. Preprint arxiv:2407.18930.
T. Raissi, C. L�scher, S. Berger, R. Schl�ter, and H. Ney. Investigating the Effect of Label Topology and Training Criterion on ASR Performance and Alignment Quality. In Interspeech, Kos, Greece, September 2024. Preprint Arxiv:2407.11641.
B. Hilmes, N. Rossenbach, and R. Schl�ter. On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures. In Synthetic Data�s Transformative Role in Foundational Speech Models (SynData4GenAI), Kos, Greece, August 2024.
N. Rossenbach, R. Schl�ter, and S. Sakti. On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition. In SynData4GenAI, Kos, Greece, August 2024.
B. Hilmes, N. Rossenbach, P. Bahar, and R. Schl�ter. Towards Efficient Speech Translation: Degradation-less 8-bit Quantization. In International Conference on Neuromorphic Computing (ICNCE), Aachen, Germany, June 2024.
Z. Yang, W. Zhou, R. Schl�ter, and H. Ney. On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, Korea, April 2024. Preprint arxiv:2309.14130.
M. Zeineldeen, A. Zeyer, R. Schl�ter, and H. Ney. Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, Korea, April 2024. Preprint arxiv:2309.08436.

2023

Z. Yang, W. Zhou, R. Schl�ter, and H. Ney. Investigating the Effect of Language Models in Sequence Discriminative Training for Neural Transducers. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Taipei, Taiwan, December 2023.
N. Rossenbach, B. Hilmes, and R. Schl�ter. On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Taipei, Taiwan, December 2023.
R. Prabhavalkar, T. Hori, T. N. Sainath, R. Schl�ter, and S. Watanabe. End-to-End Speech Recognition: A Survey. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 32, October 2023. Preprint arXiv:2303.03329, early access: ieeexplore.ieee.org/document/10301513.
D. Mann, T. Raissi, W. Michel, R. Schl�ter, and H. Ney. End-to-End Training of a Neural HMM with Label and Transition Probabilities. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), October 2023. Preprint Arxiv:2310.02724.
C. L�scher, M. Zeineldeen, Z. Yang, T. Raissi, P. Vieting, K. Le-Duc, W. Wang, R. Schl�ter, and H. Ney. Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech. In ITG Conference on Speech Communication, pages 161-165, Aachen, Germany, September 2023.
C. L�scher, J. Xu, M. Zeineldeen, R. Schl�ter, and H. Ney. Analyzing And Improving Neural Speaker Embeddings for ASR. In ITG Conference on Speech Communication, pages 205-209, Aachen, Germany, September 2023.
P. Vieting, R. Schl�ter, and H. Ney. Comparative Analysis of the wav2vec 2.0 Feature Extractor. In ITG Conference on Speech Communication, pages 131-135, Aachen, Germany, September 2023.
W. Zhou, E. Beck, S. Berger, R. Schl�ter, and H. Ney. RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition. In Interspeech, pages 4094-4098, Dublin, Ireland, August 2023. [slides].
S. Berger, P. Vieting, C. Boeddeker, R. Schl�ter, and R. Haeb-Umbach. Mixture Encoder for Joint Speech Separation and Recognition. In Interspeech, Dublin, Ireland, August 2023.
P. Vieting, C. L�scher, J. Dierkes, R. Schl�ter, and H. Ney. Efficient Utilization of Large Pre-Trained Models for Low Resource ASR. In IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSP SASB), pages 1-5, Rhodes, Greece, June 2023.
W. Zhou, H. Wu, J. Xu, M. Zeineldeen, C. L�scher, R. Schl�ter, and H. Ney. Enhancing and Adversarial: Improve ASR with Speaker Labels. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, June 2023. [poster].
Z. Yang, W. Zhou, R. Schl�ter, and H. Ney. Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, June 2023.
T. Raissi, W. Zhou, S. Berger, R. Schl�ter, and H. Ney. HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch. In IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar, January 2023.
A. Zeyer, R. Schmitt, W. Zhou, R. Schlüter, and H. Ney. Monotonic segmental attention for automatic speech recognition. In IEEE Spoken Language Technology Workshop (SLT), pages 229-236, Doha, Qatar, January 2023. Preprint Arxiv:2210.14742.
T. Raissi, C. L�scher, M. Gunz, R. Schl�ter, and H. Ney. Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think. In Interspeech, Dublin, Ireland, 2023.

2022

F. Meyer, W. Michel, M. Zeineldeen, R. Schl�ter, and H. Ney. Automatic Learning of Subword Dependent Model Scales. In Interspeech, Incheon, Korea, September 2022.
W. Zhou, W. Michel, R. Schl�ter, and H. Ney. Efficient Training of Neural Transducer for Speech Recognition. In Interspeech, pages 2058-2062, Incheon, Korea, September 2022. [poster].
M. Zeineldeen, J. Xu, C. L�scher, R. Schl�ter, and H. Ney. Improving the Training Recipe for a Robust Conformer-based Hybrid Model. In Interspeech, Incheon, Korea, September 2022. [poster].
Z. Yang, Y. Gao, A. Gerstenberger, J. Jiang, R. Schl�ter, and H. Ney. Self-Normalized Importance Sampling for Neural Language Modeling. In Interspeech, Incheon, Korea, September 2022.
W. Zhou, Z. Zheng, R. Schlüter, and H. Ney. On Language Model Integration for RNN Transducer based Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 8407-8411, Singapore, May 2022. [poster] [slides].
N. Wynands, W. Michel, J. Rosendahl, R. Schl�ter, and H. Ney. Efficient Sequence Training of Attention Models using Approximative Recombination. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Singapore, May 2022. Preprint arXiv:2110.09245.
M. Zeineldeen, J. Xu, C. L�scher, W. Michel, A. Gerstenberger, R. Schl�ter, and H. Ney. Conformer-based Hybrid ASR System for Switchboard Dataset. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Singapore, May 2022. [poster].
M. Gansen, J. Lou, F. Freye, T. Gemmeke, F. Merchant, A. Zeyer, M. Zeineldeen, R. Schlüter, and X. Fan. Discrete Steps towards Approximate Computing. In International Symposium on Quality Electronic Design (ISQED), pages 1-6, April 2022. DOI: 10.1109/ISQED54688.2022.9806215.
T. Raissi, E. Beck, R. Schl�ter, and H. Ney. Improving Factored Hybrid HMM Acoustic Modeling without State Tying. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Singapore, 2022. To Appear. Preprint arXiv:2201.09692.

2021

P. Vieting, C. L�scher, W. Michel, R. Schl�ter, and H. Ney. On Architectures and Training for Raw Waveform Feature Extraction in ASR. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 267-274, Cartagena, Colombia, December 2021.
N. Rossenbach, M. Zeineldeen, B. Hilmes, R. Schl�ter, and H. Ney. Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, Colombia, December 2021.
Yu Qiao, Sourabh Zanwar, Rishab Bhattacharyya, Daniel Wiechmann, Wei Zhou, Elma Kerz, and Ralf Schl�ter. Prediction of Listener Perception of Argumentative Speech in a Crowdsourced Dataset Using (Psycho-)Linguistic and Fluency Features. , November, 2021.
A. Zeyer, A. Merboldt, W. Michel, R. Schl�ter, and H. Ney. Librispeech Transducer Model with Internal Language Model Prior Correction. In Interspeech, pages 2052-2056, August 2021. Arxiv:2104.03006.
W. Zhou, A. Zeyer, A. Merboldt, R. Schl�ter, and H. Ney. Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept. In Interspeech, pages 2891-2895, August 2021. [slides].
W. Zhou, M. Zeineldeen, Z. Zheng, R. Schl�ter, and H. Ney. Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition. In Interspeech, pages 2886-2890, August 2021. [slides].
Y. Qiao, W. Zhou, E. Kerz, and R. Schl�ter. The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech. In Interspeech, pages 4453-4457, August 2021.
M. Zeineldeen, A. Glushko, W. Michel, A. Zeyer, R. Schl�ter, and H. Ney. Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR Models. In Interspeech, pages 2856-2860, August 2021. [slides] [video].
Y. Gao, D. Thulke, A. Gerstenberger, K. V. Tran, R. Schl�ter, and H. Ney. On Sampling-Based Training Criteria for Neural Language Modeling. In Interspeech, August 2021.
W. Zhou, S. Berger, R. Schl�ter, and H. Ney. Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5644-5648, June 2021. [poster].
Albert Zeyer, Ralf Schl�ter, and Hermann Ney. Why does CTC result in peaky behavior?. , May, 2021. Preprint arXiv:2105.14849.
Albert Zeyer, Ralf Schl�ter, and Hermann Ney. A study of latent monotonic attention variants. , March, 2021. Preprint arXiv:2103.16710.
P. Bahar, T. Bieschke, R. Schl�ter, and H. Ney. Tight Integrated End-to-End Training for Cascaded Speech Translation. In IEEE Spoken Language Technology Workshop (SLT), pages 950-957, Shenzhen, China, January 2021.
T. Raissi, E. Beck, R. Schl�ter, and H. Ney. Towards Consistent Hybrid HMM Acoustic Modeling. Internal Report, Lehrstuhl f�r Informatik 6, RWTH Aachen University, 2021. 2104.02387.

2020

Mohammad Zeineldeen, Albert Zeyer, Wei Zhou, Thomas Ng, Ralf Schl�ter, and Hermann Ney. A Systematic Comparison of Grapheme-based vs. Phoneme-based Label Units for Encoder-Decoder-Attention Models. , November, 2020. Preprint arXiv:2005.09336.
A. Zeyer, N. Rossenbach, P. Bahar, A. Merboldt, and R. Schl�ter. Efficient and Flexible Implementation of Machine Learning for ASR and MT. In Interspeech, Shanghai, China, October 2020. Tutorial slides.
J. Huo, Y. Gao, W. Wang, R. Schl�ter, and H. Ney. Investigation of Large-Margin Softmax in Neural Language Modeling. In Interspeech, Shanghai, China, October 2020.
W. Michel, R. Schl�ter, and H. Ney. Early Stage LM Integration Using Local and Global Log-Linear Combination. In Interspeech, pages 3605-3609, Shanghai, China, October 2020.
W. Zhou, R. Schl�ter, and H. Ney. Robust Beam Search for Encoder-Decoder Attention Based Speech Recognition without Length Bias. In Interspeech, pages 1768-1772, Shanghai, China, October 2020. [slides].
A. Zeyer, A. Merboldt, R. Schl�ter, and H. Ney. A New Training Pipeline for an Improved Neural Transducer. In Interspeech, Shanghai, China, October 2020. [slides].
E. Beck, R. Schl�ter, and H. Ney. LVCSR with Transformer Language Models. In Interspeech, pages 1798-1802, Shanghai, China, October 2020.
T. Raissi, E. Beck, R. Schl�ter, and H. Ney. Context-Dependent Acoustic Modeling without Explicit Phone Clustering. In Interspeech, Shanghai, China, October 2020.
M. Zeineldeen, A. Zeyer, R. Schl�ter, and H. Ney. Layer-normalized LSTM for Hybrid-HMM and End-to-End ASR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7679-7683, Barcelona, Spain, May 2020. [slides].
N. Rossenbach, A. Zeyer, R. Schl�ter, and H. Ney. Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7069-7073, Barcelona, Spain, May 2020.
W. Zhou, R. Schl�ter, and H. Ney. Full-Sum Decoding for Hybrid HMM based Speech Recognition using LSTM Language Model. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7834-7838, Barcelona, Spain, May 2020.
W. Zhou, W. Michel, K. Irie, M. Kitza, R. Schl�ter, and H. Ney. The RWTH ASR system for TED-LIUM release 2: Improving Hybrid HMM with SpecAugment. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7839-7843, Barcelona, Spain, May 2020.
P. Bahar, N. Makarov, A. Zeyer, R. Schl�ter, and H. Ney. Exploring A Zero-Order Direct HMM based on Latent Attention for Automatic Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7854-7858, Barcelona, Spain, May 2020.
W. Michel, R. Schl�ter, and H. Ney. Frame-Level MMI as a Sequence Discriminative Training Criterion for LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6904-6908, Barcelona, Spain, May 2020.
V. Bozheniuk, A. Zeyer, R. Schl�ter, and H. Ney. A comprehensive study of Residual CNNs for acoustic modeling in ASR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7674-7678, Barcelona, Spain, May 2020.
K. Irie, A. Gerstenberger, R. Schl�ter, and H. Ney. How Much Self-Attention Do We Need? Trading Attention for Feed-Forward Layers. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6154-6158, Barcelona, Spain, May 2020. [slides].

2019

K. Irie, A. Zeyer, R. Schl�ter, and H. Ney. Training Language Models for Long-Span Cross-Sentence Evaluation. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 419-426, Sentosa, Singapore, December 2019. [poster].
A. Zeyer, P. Bahar, K. Irie, R. Schl�ter, and H. Ney. A comparison of Transformer and LSTM encoder decoder models for ASR. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 8-15, Sentosa, Singapore, December 2019.
P. Bahar, A. Zeyer, R. Schl�ter, and H. Ney. On Using SpecAugment for End-to-End Speech Translation. In International Workshop on Spoken Language Translation (IWSLT), Hong Kong, China, November 2019.
T. Menne, I. Sklyar, R. Schl�ter, and H. Ney. Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech. In Interspeech, pages 2638-2642, Graz, Austria, September 2019.
K. Irie, A. Zeyer, R. Schlüter, and H. Ney. Language Modeling with Deep Transformers. In Interspeech, pages 3905-3909, Graz, Austria, September 2019. ISCA Best Student Paper Award. [slides].
W. Michel, R. Schlüter, and H. Ney. Comparison of Lattice-Free and Lattice-Based Sequence Discriminative Training Criteria for LVCSR. In Interspeech, pages 1601-1605, Graz, Austria, September 2019.
C. L�scher, E. Beck, K. Irie, M. Kitza, W. Michel, A. Zeyer, R. Schl�ter, and H. Ney. RWTH ASR Systems for LibriSpeech: Hybrid vs Attention. In Interspeech, pages 231-235, Graz, Austria, September 2019.
A. Merboldt, A. Zeyer, R. Schl�ter, and H. Ney. An analysis of local monotonic attention variants. In Interspeech, pages 1398-1402, Graz, Austria, September 2019.
M. Kitza, P. Golik, R. Schl�ter, and H. Ney. Cumulative Adaptation for BLSTM Acoustic Models. In Interspeech, pages 754-758, Graz, Austria, September 2019.
A. Piunova, E. Beck, R. Schl�ter, and H. Ney. Rescoring Keyword Search Confidence Estimates with Graph-based Re-ranking Using Acoustic Word Embeddings. In Interspeech, Graz, Austria, September 2019.
Eugen Beck, Wei Zhou, Ralf Schl�ter, and Hermann Ney. LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring. , July, 2019. https://arxiv.org/abs/1907.01030.
M. A. Tahir, H. Huang, A. Zeyer, R. Schl�ter, and H. Ney. Training of reduced-rank linear transformations for multi-layer polynomial acoustic features for speech recognition. Speech Communication, volume 110, pages 56-63, July 2019.
P. Bahar, A. Zeyer, R. Schl�ter, and H. Ney. On using 2D sequence-to-sequence models for speech recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5671-5675, Brighton, UK, May 2019.
T. Menne, R. Schl�ter, and H. Ney. Investigation into Joint Optimization of Single Channel Speech Enhancement and Acoustic Modeling for Robust ASR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6660-6664, Brighton, UK, May 2019.
K. Bene�, K. Irie, E. Beck, R. Schl�ter, and H. Ney. Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources. In Deutsche Jahrestagung für Akustik (DAGA), pages 954�957, Rostock, Germany, March 2019.
R. Schl�ter, E. Beck, and H. Ney. Upper and Lower Tight Error Bounds for Feature Omission with an Extension to Context Reduction. IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 41, number 2, pages 502-514, February 2019. DOI 10.1109/TPAMI.2017.2788434.

2018

A. Zeyer, A. Merboldt, R. Schl�ter, and H. Ney. A comprehensive analysis on attention models. In Interpretability and Robustness in Audio, Speech, and Language (IRASL) Workshop, Conference on Neural Information Processing Systems (NeurIPS) (NIPS IRASL), Montreal, Canada, December 2018.
T. Menne, R. Schl�ter, and H. Ney. Speaker Adapted Beamforming for Multi-Channel Automatic Speech Recognition. In International Workshop On Spoken Language Technology (SLT), pages 535-541, Athens, Greece, December 2018.
E. Beck, A. Zeyer, P. Doetsch, A. Merboldt, R. Schl�ter, and H. Ney. Sequence Modeling and Alignment for LVCSR-Systems. In ITG Conference on Speech Communication, Oldenburg, October 2018.
K. Irie, Z. Lei, L. Deng, R. Schlüter, and H. Ney. Investigation on Estimation of Sentence Probability By Combining Forward, Backward and Bi-directional LSTM-RNNs. In Interspeech, pages 392-395, Hyderabad, India, September 2018.
A. Zeyer, K. Irie, R. Schl�ter, and H. Ney. Improved training of end-to-end attention models for speech recognition. In Interspeech, Hyderabad, India, September 2018.
E. Beck, M. Hannemann, P. Doetsch, R. Schl�ter, and H. Ney. Segmental Encoder-Decoder Models for Large Vocabulary Automatic Speech Recognition. In Interspeech, Hyderabad, India, September 2018.
M. Kitza, W. Michel, C. Boeddeker, J. Heitkaemper, T. Menne, R. Schl�ter, H. Ney, J. Schmalenstroeer, L. Drude, J. Heymann, and R. Haeb-Umbach. The RWTH/UPB System Combination for the CHiME 2018 Workshop. In The 5th International Workshop on Speech Processing in Everyday Environments (CHiME-5), pages 53-57, Hyderabat, India, September 2018.
M. Kitza, R. Schl�ter, and H. Ney. Comparison of BLSTM-Layer-Specific Affine Transformationsfor Speaker Adaptation. In Interspeech, pages 877-881, Hyderabad, India, September 2018.
Z. Tüske, R. Schlüter, and H. Ney. Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition. In Interspeech, pages 3358-3362, Hyderabad, India, September 2018.
K. Irie, Z. Lei, R. Schlüter, and H. Ney. Prediction of LSTM-RNN Full Context States as a Subtask for N-gram Feedforward Language Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6104-6108, Calgary, Canada, April 2018. IEEE Spoken Language Processing Student Travel Grant. Best Student Paper Award.
Z. Tüske, R. Schlüter, and H. Ney. Acoustic modeling of Speech Waveform based on Multi-Resolution, Neural Network Signal Processing. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4859-4863, Calgary, Canada, April 2018.
R. Schl�ter, P. Doetsch, P. Golik, M. Kitza, T. Menne, K. Irie, Z. T�ske, and A. Zeyer. Neuronale Netze in der automatischen Spracherkennung - ein Paradigmenwechsel?. In 44. Jahrestagung f�r Akustik der Deutschen Gesellschaft f�r Akustik, pages 15-28, Munich, Germany, March 2018.
T. Menne, Z. T�ske, R. Schl�ter, and H. Ney. Learning Acoustic Features from the Raw Waveform for Automatic Speech Recognition. In 44. Jahrestagung f�r Akustik der Deutschen Gesellschaft f�r Akustik, pages 1533-1536, Munich, Germany, March 2018.

2017

P. Doetsch, M. Hannemann, R. Schlueter, and H. Ney. Inverted Alignments for End-to-End Automatic Speech Recognition. IEEE Journal of Selected Topics in Signal Processing, volume 11, number 8, pages 1265-1273, December 2017.
P. Golik, Z. Tüske, K. Irie, E. Beck, R. Schlüter, and H. Ney. The 2016 RWTH Keyword Search System for Low-Resource Languages. In International Conference Speech and Computer (SPECOM), Lecture Notes in Computer Science, Subseries Lecture Notes in Artificial Intelligence, volume 10458, pages 719-730, Hatfield, UK, September 2017.
A. Zeyer, E. Beck, R. Schl�ter, and H. Ney. CTC in the Context of Generalized Full-Sum HMM Training. In Interspeech, pages 944-948, Stockholm, Sweden, August 2017.
Z. Tüske, W. Michel, R. Schlüter, and H. Ney. Parallel Neural Network Features for Improved Tandem Acoustic Modeling. In Interspeech, pages 1651-1655, Stockholm, Sweden, August 2017.
P. Doetsch, A. Zeyer, P. Voigtlaender, I. Kulikov, R. Schl�ter, and H. Ney. RETURNN: the RWTH extensible training framework for universal recurrent neural networks. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5345-5349, New Orleans, LA, USA, March 2017.
K. Irie, P. Golik, R. Schlüter, and H. Ney. Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5740-5744, New Orleans, LA, USA, March 2017.
A. Zeyer, I. Kulikov, R. Schl�ter, and H. Ney. Faster sequence training. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5285-5289, New Orleans, LA, USA, March 2017.
A. Zeyer, P. Doetsch, P. Voigtlaender, R. Schl�ter, and H. Ney. A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 2462-2466, New Orleans, LA, USA, March 2017.
M. Nussbaum-Thom, R. Schl�ter, V. Goel, and H. Ney. Noisy Objective Functions based on the f-Divergence. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 2327-2331, New Orleans, LA, USA, March 2017.

2016

P. Doetsch, S. Hegselmann, R. Schl�ter, and H. Ney. Inverted HMM - a Proof of Concept. In Neural Information Processing Systems Workshop (NIPS Workshop), Barcelona, Spain, December 2016.
W. Michel, Z. Tüske, M. A. B. Shaik, R. Schlüter, and H. Ney. The RWTH Aachen LVCSR system for IWSLT-2016 German Skype conversation recognition task. In International Workshop on Spoken Language Translation (IWSLT), Seattle, USA, December 2016.
M. Kitza, J. Heymann, A. Zeyer, R. Schl�ter, and R. H�b-Umbach. Robust Online Multi-Channel Speech Recognition. In ITG Conference on Speech Communication, pages 347-351, Paderborn, Germany, October 2016.
K. Irie, Z. Tüske, T. Alkhouli, R. Schlüter, and H. Ney. LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition. In Interspeech, pages 3519-3523, San Francisco, CA, USA, September 2016.
A. Zeyer, R. Schlüter, and H. Ney. Towards online-recognition with deep bidirectional LSTM acoustic models. In Interspeech, pages 3424-3428, San Francisco, CA, USA, September 2016.
T. Menne, J. Heymann, A. Alexandridis, K. Irie, A. Zeyer, M. Kitza, P. Golik, I. Kulikov, L. Drude, R. Schlüter, H. Ney, R. Haeb-Umbach, and A. Mouchtaris. The RWTH/UPB/FORTH System Combination for the 4th CHiME Challenge Evaluation. In The 4th International Workshop on Speech Processing in Everyday Environments, pages 39-44, San Francisco, CA, USA, September 2016.
R. Schlüter, P. Doetsch, P. Golik, M. Kitza, T. Menne, K. Irie, Z. Tüske, and A. Zeyer. Automatic Speech Recognition Based on Neural Networks. In International Conference Speech and Computer (SPECOM), Lecture Notes in Computer Science, Subseries Lecture Notes in Artificial Intelligence, volume 9811, pages 3-17, Budapest, Hungary, August 2016.
Z. Tüske, K. Irie, R. Schlüter, and H. Ney. Investigation on log-linear interpolation of multi-domain neural network language model. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6005-6009, Shanghai, China, March 2016.

2015

Z. Tüske, P. Golik, R. Schlüter, and H. Ney. Speaker Adaptive Joint Training of Gaussian Mixture Models and Bottleneck Features. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 596-603, Scottsdale, AZ, USA, December 2015.
J. Cui, B. Kingsbury, B. Ramabhadran, A. Sethy, K. Audhkhasi, X. Cui, E. Kislal, L. Mangu, M. Nussbaum-Thom, M. Picheny, Z. Tüske, P. Golik, R. Schlüter, H. Ney, M. J. F. Gales, K. M. Knill, A. Ragni, H. Wang, and P. Woodland. Multilingual Representations for Low Resource Speech Recognition and Keyword Search. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 259-266, Scottsdale, AZ, USA, December 2015.
M. A. B. Shaik, Z. Tüske, M. A. Tahir, M. Nussbaum-Thom, R. Schlüter, and H. Ney. Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, Urdu, and Arabic. In Interspeech, pages 3154-3157, Dresden, Germany, September 2015.
P. Golik, Z. Tüske, R. Schlüter, and H. Ney. Convolutional Neural Networks for Acoustic Modeling of Raw Time Signal in LVCSR. In Interspeech, pages 26-30, Dresden, Germany, September 2015.
P. Golik, Z. Tüske, R. Schlüter, and H. Ney. Multilingual Features Based Keyword Search for Very Low-Resource Languages. In Interspeech, pages 1260-1264, Dresden, Germany, September 2015.
K. Irie, R. Schl�ter, and H. Ney. Bag-of-Words Input for Long History Representation in Neural Network-based Language Models for Speech Recognition. In Interspeech, pages 2371-2375, Dresden, Germany, September 2015.
E. Beck, R. Schlüter, and H. Ney. Error Bounds for Context Reduction and Feature Omission. In Interspeech, pages 1280-1284, Dresden, Germany, September 2015.
P. Voigtlaender, P. Doetsch, S. Wiesler, R. Schl�ter, and H. Ney. Sequence-Discriminative Training of Recurrent Neural Networks. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 2100-2104, Brisbane, Australia, April 2015.
J. Heymann, R. Haeb-Umbach, P. Golik, and R. Schlüter. Unsupervised Adaptation of a Denoising Autoencoder by Bayesian Feature Enhancement for Reverberant ASR under Mismatch Conditions. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5053-5057, Brisbane, Australia, April 2015.
S. Wiesler, P. Golik, R. Schlüter, and H. Ney. Investigations on Sequence Training of Neural Networks. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4565-4569, Brisbane, Australia, April 2015. IEEE Spoken Language Processing Student Travel Grant.
Z. Tüske, M. A. Tahir, R. Schlüter, and H. Ney. Integrating Gaussian Mixtures into Deep Neural Networks: Softmax Layer with Hidden Variables. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4285-4289, Brisbane, Australia, April 2015.
M. A. B. Shaik, A. El-Desoky Mousa, S. Hahn, R. Schlüter, and H. Ney. Improved strategies for a zero OOV rate LVCSR system. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5048-5052, Brisbane, Australia, April 2015.
M. A. Tahir, S. Wiesler, R. Schl�ter, and H. Ney. Investigation of Mixture Splitting Concept for Training Linear Bottlenecks of Deep Neural Network Acoustic Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4614-4618, Brisbane, Australia, April 2015.
M. Sundermeyer, H. Ney, and R. Schlüter. From Feedforward to Recurrent LSTM Neural Networks for Language Modeling. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 23, number 3, pages 517-529, March 2015.

2014

M. Kozielski, M. Matysiak, P. Doetsch, R. Schlueter, and H. Ney. Open-lexicon Language Modeling Combining Word and Character Levels. International Conference on Frontiers in Handwriting Recognition, pages 343-348, Crete, Greece, September 2014.
Z. Tüske, P. Golik, R. Schlüter, and H. Ney. Acoustic Modeling with Deep Neural Networks Using Raw Time Signal for LVCSR. In Interspeech, pages 890-894, Singapore, September 2014. ISCA best student paper award Interspeech 2014.
Z. Tüske, P. Golik, D. Nolden, R. Schlüter, and H. Ney. Data Augmentation, Feature Combination, and Multilingual Neural Networks to Improve ASR and KWS Performance for Low-resource Languages. In Interspeech, pages 1420-1424, Singapore, September 2014.
M. A. B. Shaik, Z. Tüske, M. A. Tahir, M. Nussbaum-Thom, R. Schlüter, and H. Ney. RWTH LVCSR Systems for Quaero and EU-Bridge: German, Polish, Spanish and Portuguese. In Interspeech, pages 973-977, Singapore, September 2014.
M. Sundermeyer, Z. Tüske, R. Schlüter, and H. Ney. Lattice Decoding and Rescoring with Long-Span Neural Network Language Models . In Interspeech, pages 661-665, Singapore, September 2014.
M. Sundermeyer, R. Schlüter, and H. Ney. rwthlm - The RWTH Aachen University Neural Network Language Modeling Toolkit . In Interspeech, pages 2093-2097, Singapore, September 2014.
D. Nolden, R. Schlüter, and H. Ney. Word Pair Approximation for More Efficient Decoding with High-Order Language Models. In Interspeech, pages 646-650, Singapore, September 2014.
S. Wiesler, A. Richard, R. Schlüter, and H. Ney. Mean-normalized stochastic gradient for large-scale deep learning. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 180-184, Florence, Italy, May 2014. IBM Research Spoken Language Processing Student Travel Grant Award..
S. Wiesler, A. Richard, P. Golik, R. Schlüter, and H. Ney. RASR/NN: The RWTH neural network toolkit for speech recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 3313-3317, Florence, Italy, May 2014.
S. Wiesler, K. Irie, Z. Tüske, R. Schlüter, and H. Ney. The RWTH English lecture recognition system. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 3322-3326, Florence, Italy, May 2014.
Z. Tüske, D. Nolden, R. Schlüter, and H. Ney. Multilingual MRASTA Features for Low-resource Keyword Search and Speech Recognition Systems. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7904-7908, Florence, Italy, May 2014.
M. Nussbaum-Thom, X. Cui, R. Schluter, V. Goel, and H. Ney. A family of discriminative training criteria based on the F-divergence for deep neural networks. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5612-5616, Florence, Italy, May 2014.

2013

M. A. B. Shaik, Z. Tüske, S. Wiesler, M. Nussbaum-Thom, S. Peitz, R. Schlüter, and H. Ney. The RWTH Aachen German and English LVCSR systems for IWSLT-2013. In International Workshop on Spoken Language Translation (IWSLT), pages 120-127, Heidelberg, Germany, December 2013.
G. Heigold, H. Ney, and R. Schlüter. Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs. IEEE Transactions on Audio, Speech, and Language Processing, volume 21, number 12, pages 2616-2626, December 2013.
D. Nolden, R. Schlüter, and H. Ney. Efficient Nearly Error-Less LVCSR Decoding Based on Incremental Forward and Backward Passes. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 66-71, Olomouc, Czech Republic, December 2013.
R. Schlüter, M. Nußbaum-Thom, E. Beck, T. Alkhouli, and H. Ney. Novel Tight Classification Error Bounds under Mismatch Conditions based on f-Divergence. In IEEE Information Theory Workshop, pages 432-436, Sevilla, Spain, September 2013.
P. Golik, Z. Tüske, R. Schlüter, and H. Ney. Development of the RWTH Transcription System for Slovenian. In Interspeech, pages 3107-3111, Lyon, France, August 2013.
A. El-Desoky Mousa, M. A. B. Shaik, R. Schlüter, and H. Ney. Morpheme Level Hierarchical Pitman-Yor Class-based Language Models for LVCSR of Morphologically Rich Languages. In Interspeech, pages 3409-3413, Lyon, France, August 2013.
M. A. B. Shaik, A. El-Desoky Mousa, R. Schlüter, and H. Ney. Feature-rich sub-lexical language models using a maximum entropy approach for German LVCSR. In Interspeech, pages 3404-3408, Lyon, France, August 2013.
M. A. Tahir, H. Huang, R. Schlüter, H. Ney, L. t. Bosch, B. Cranen, and L. Boves. Training Log-Linear Acoustic Models in Higher-Order Polynomial Feature Space for Speech Recognition. In Interspeech, pages 3352-3355, Lyon, France, August 2013.
S. Hahn, P. Lehnen, S. Wiesler, R. Schlüter, and H. Ney. Improving LVCSR with Hidden Conditional Random Fields for Grapheme-to-Phoneme Conversion. In Interspeech, pages 495-499, Lyon, France, August 2013.
Z. Tüske, R. Schlüter, and H. Ney. Multilingual Hierarchical MRASTA Features for ASR. In Interspeech, pages 2222-2226, Lyon, France, August 2013.
M. Nußbaum-Thom, E. Beck, T. Alkhouli, R. Schlüter, and H. Ney. Relative Error Bounds for Statistical Classifiers Based on the f-Divergence. In Interspeech, pages 2197-2201, Lyon, France, August 2013.
J. Mamou, J. Cui, X. Cui, M. Gales, B. Kingsbury, K. Knill, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, M. Saraclar, R. Schl�ter, A. Sethy, and P. Woodland. Developing Keyword Search under the IARPA Babel Program. AFEKA Speech Processing Conference, Tel Aviv, Israel, July 2013.
D. Rybach, H. Ney, and R. Schlüter. Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR. IEEE Transactions on Audio, Speech, and Language Processing, volume 21, number 6, pages 1295 -1307, June 2013.
©
M. Kozielski, D. Rybach, S. Hahn, R. Schlüter, and H. Ney. Open Vocabulary Handwriting Recognition Using Combined Word-Level and Character-Level Language Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 8257-8261, Vancouver, Canada, May 2013. IBM Research Spoken Language Processing Student Travel Grant Award.
M. Sundermeyer, I. Oparin, J. Gauvain, B. Freiberg, R. Schlüter, and H. Ney. Comparison of Feedforward and Recurrent Neural Network Language Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 8430-8434, Vancouver, Canada, May 2013.
Z. Tüske, R. Schlüter, and H. Ney. Deep hierarchical bottleneck MRASTA features for LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6970-6974, Vancouver, Canada, May 2013.
Z. Tüske, J. Pinto, D. Willett, and R. Schlüter. Investigation on cross- and multilingual MLP features under matched and mismatched acoustical conditions. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 7349-7353, Vancouver, Canada, May 2013.
S. Wiesler, A. Richard, R. Schlüter, and H. Ney. A Critical Evaluation of Stochastic Algorithms for Convex Optimization. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6955-6959, Vancouver, Canada, May 2013.
C. Plahl, M. Kozielski, R. Schlüter, and H. Ney. Feature Combination and Stacking of Recurrent and Non-recurrent Neural Networks for LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6714-6718, Vancouver, Canada, May 2013.
D. Nolden, R. Schlüter, and H. Ney. Advanced Search Space Pruning with Acoustic Look-Ahead for WFST Based LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6734-6738, Vancouver, Canada, May 2013.
B. Kingsbury, J. Cui, X. Cui, M. Gales, K. Knill, J. Mamou, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schlüter, A. Sethy, and P. Woodland. A high-performance Cantonese keyword search system. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 8277-8281, Vancouver, Canada, May 2013.
J. Mamou, J. Cui, X. Cui, M. Gales, B. Kingsbury, K. Knill, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schlüter, A. Sethy, and P. Woodland. System Combination and Score Normalization for Spoken Term Detection. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 8272-8276, Vancouver, Canada, May 2013.

2012

G. Heigold, R. Schlüter, H. Ney, and S. Wiesler. Discriminative Training for Automatic Speech Recognition: Modeling, Criteria, Optimization, Implementation, and Performance . IEEE Signal Processing Magazine, volume 29, number 6, pages 58-69, November 2012.
M. A. B. Shaik, A. El-Desoky Mousa, R. Schlüter, and H. Ney. Investigation of Maximum Entropy Hybrid Language Models for Open Vocabulary German and Polish LVCSR . In Interspeech, pages 1071-1074, Portland, OR, USA, September 2012.
A. El-Desoky Mousa, M. A. B. Shaik, R. Schlüter, and H. Ney. Morpheme Level Feature-based Language Models for German LVCSR . In Interspeech, pages 170-173, Portland, OR, USA, September 2012.
M. A. B. Shaik, D. Rybach, S. Hahn, R. Schlüter, and H. Ney. Hierarchical Hybrid Language models for Open Vocabulary Continuous Speech Recognition using WFST. In Workshop on Statistical and Perceptual Audition (SAPA), pages 46-51, Portland, OR, USA, September 2012.
M. A. Tahir, M. Nußbaum, R. Schlüter, and H. Ney. Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition. In Interspeech, pages 571-574, Portland, OR, USA, September 2012.
M. Sundermeyer, R. Schlüter, and H. Ney. LSTM Neural Networks for Language Modeling. In Interspeech, pages 194-197, Portland, OR, USA, September 2012.
S. Wiesler, R. Schlüter, and H. Ney. Accelerated Batch Learning of Convex Log-linear Models for LVCSR. In Interspeech, pages 1207-1210, Portland, OR, USA, September 2012.
Z. Tüske, M. Sundermeyer, R. Schlüter, and H. Ney. Context-Dependent MLPs for LVCSR: TANDEM, Hybrid or Both?. In Interspeech, pages 18-21, Portland, OR, USA, September 2012.
Z. Tüske, F. R. Drepper, and R. Schlüter. Non-Stationary Signal Processing and its Application in Speech Recognition. In Workshop on Statistical and Perceptual Audition (SAPA), Portland, OR, USA, September 2012.
D. Nolden, R. Schlüter, and H. Ney. Search Space Pruning Based on Anticipated Path Recombination in LVCSR. In Interspeech, pages 1014-1017, Portland, OR, USA, September 2012.
M. Nussbaum-Thom, Z. Tüske, G. Heigold, R. Schlüter, and H. Ney. Posterior Scaled MPE: Novel Discriminative Training Criteria. In Interspeech, pages 2614-2617, Portland, OR, USA, September 2012.
Z. Tüske, F. R. Drepper, and R. Schlüter. Phase Difference of Filter-stable Part-tones as Acoustic Feature. In IEEE Statistical Signal Processing Workshop (SSP), pages 365-368, Ann Arbor, MI, USA, August 2012.
D. Rybach, R. Schlüter, and H. Ney. Silence is Golden: Modeling Non-speech Events in WFST-based Dynamic Network Decoders. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4205-4208, Kyoto, Japan, March 2012.
A. El-Desoky Mousa, R. Schlüter, and H. Ney. Investigations on the Use of Morpheme Level Features In Language Models for Arabic LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5021-5024, Kyoto, Japan, March 2012.
Y. Kubo, S. Watanabe, A. Nakamura, S. Wiesler, R. Schlüter, and H. Ney. Basis Vector Orthogonalization for an Improved Kernel Gradient Matching Pursuit Method. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 1909-1912, Kyoto, Japan, March 2012.
Z. Tüske, R. Schlüter, and H. Ney. Comparison and combination of different CRBE based MLP features for LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4081-4084, Kyoto, Japan, March 2012.
D. Nolden, D. Rybach, R. Schlüter, and H. Ney. Joining Advantages of Word-conditioned and Token-passing Decoding. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4425-4428, Kyoto, Japan, March 2012.
D. Nolden, R. Schlüter, and H. Ney. Extended Search Space Pruning in LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4429-4432, Kyoto, Japan, March 2012.
R. Schlüter, M. Nußbaum-Thom, and H. Ney. Does the Cost Function Matter in Bayes Decision Rule?. IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 34, number 2, pages 292-301, February 2012.
B. Hoffmeister, G. Heigold, D. Rybach, R. Schlüter, and H. Ney. WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding . IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 20, number 2, pages 551-564 , February 2012.
©

2011

L. Lamel, S. Courcinous, J. Despres, J. Gauvain, Y. Josse, K. Kilgour, F. Kraft, V. B. Le, H. Ney, M. Nußbaum-Thom, I. Oparin, T. Schlippe, R. Schlüter, T. Schultz, T. Fraga da Silva, S. Stüker, M. Sundermeyer, B. Vieru, N. T. Vu, A. Waibel, and C. Woehrling. Speech Recognition for Machine Translation in Quaero . In International Workshop on Spoken Language Translation (IWSLT), pages 121-128, San Francisco, CA, USA, December 2011.
S. Wiesler, R. Schlüter, and H. Ney. A Convergence Analysis of Log-Linear Training and its Application to Speech Recognition. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 1-6, Waikoloa, HI, USA, December 2011.
D. Rybach, S. Hahn, P. Lehnen, D. Nolden, M. Sundermeyer, Z. Tüske, S. Wiesler, R. Schlüter, and H. Ney. RASR - The RWTH Aachen University Open Source Speech Recognition Toolkit. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Waikoloa, HI, USA, December 2011.
C. Plahl, R. Schlüter, and H. Ney. Cross-lingual Portability of Chinese and English Neural Network Features for French and German LVCSR. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 371-376, Waikoloa, HI, USA, December 2011.
M. A. Tahir, R. Schlüter, and H. Ney. Discriminative Splitting of Gaussian/Log-Linear Mixture HMMs for Speech Recognition. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 7-11, Waikoloa, HI, USA, December 2011.
M. Sundermeyer, R. Schlüter, and H. Ney. On the Estimation of Discount Parameters for Language Model Smoothing. In Interspeech, pages 1433-1436, Florence, Italy, August 2011.
M. Nußbaum-Thom, A. El-Desoky Mousa, R. Schlüter, and H. Ney. Compound Word Recombination for German LVCSR. In Interspeech, pages 1449-1452, Florence, Italy, August 2011.
Z. Tüske, C. Plahl, and R. Schlüter. A study on speaker normalized MLP features in LVCSR. In Interspeech, pages 1089-1092, Florence, Italy, August 2011.
C. Plahl, R. Schlüter, and H. Ney. Improved Acoustic Feature Combination for LVCSR by Neural Networks. In Interspeech, pages 1237-1240, Florence, Italy, August 2011.
M. A. Tahir, R. Schlüter, and H. Ney. Log-linear Optimization of Second-order Polynomial Features with Subsequent Dimension Reduction for Speech Recognition. In Interspeech, pages 1705-1708, Florence, Italy, August 2011.
M. A. B. Shaik, A. El-Desoky Mousa, R. Schlüter, and H. Ney. Hybrid Language Models Using Mixed Types of Sub-lexical Units for Open Vocabulary German LVCSR. In Interspeech, pages 1441-1444, Florence, Italy, August 2011.
A. El-Desoky Mousa, M. A. B. Shaik, R. Schlüter, and H. Ney. Morpheme Based Factored Language Models for German LVCSR. In Interspeech, pages 1445-1448, Florence, Italy, August 2011.
D. Nolden, R. Schlüter, and H. Ney. Acoustic Look-Ahead for More Efficient Decoding in LVCSR. In Interspeech, pages 893-896, Florence, Italy, August 2011.
G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schlüter. Equivalence of Generative and Log-Linear Models. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 19, number 5, pages 1138-1148, July 2011.
R. Schlüter, M. Nußbaum-Thom, and H. Ney. On the Relationship between Bayes Risk and Word Error Rate in ASR. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 19, number 5, pages 1103-1112, July 2011.
S. Wiesler, A. Richard, Y. Kubo, R. Schlüter, and H. Ney. Feature Selection for Log-linear Acoustic Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5324-5327, Prague, Czech Republic, May 2011.
M. A. B. Shaik, A. El-Desoky Mousa, R. Schlüter, and H. Ney. Using Morpheme and Syllable Based Sub-words for Polish LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4680-4683, Prague, Czech Republic, May 2011.
D. Rybach, R. Schlüter, and H. Ney. A Comparative Analysis of Dynamic Network Decoding. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5184-5187, Prague, Czech Republic, May 2011.
M. Sundermeyer, M. Nußbaum-Thom, S. Wiesler, C. Plahl, A. El-Desoky Mousa, S. Hahn, D. Nolden, R. Schlüter, and H. Ney. The RWTH 2010 Quaero ASR Evaluation System for English, French, and German. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 2212-2215, Prague, Czech Republic, May 2011.
Z. Tüske, P. Golik, R. Schlüter, and F. R. Drepper. Non-stationary feature extraction for automatic speech recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5204-5207, Prague, Czech Republic, May 2011.
Y. Kubo, S. Wiesler, R. Schlüter, H. Ney, S. Watanabe, A. Nakamura, and T. Kobayashi. Subspace Pursuit Method for Kernel-log-linear Models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4500-4503, Prague, Czech Republic, May 2011.
D. Nolden, H. Ney, and R. Schlüter. Exploiting Sparseness of Backing-Off Language Models for Efficient Look-Ahead in LVCSR. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4684-4687, Prague, Czech Republic, May 2011.
C. Plahl, B. Hoffmeister, R. Schlueter, and H. Ney. Multiple Feature Speech Recognition. In J. Olive, C. Christianson, and J. McCary (Edts.): Handbook of Natural Language processing and Machine Translation: DARPA Global Autonomous Language Exploitation (GALE) Chp. 3.2.4, pages 415-421, Springer, New York, NY, USA, 2011.
D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlueter, K. Kirchhoff, A. Faria, and N. Morgan. Development of SRI/NIGHTINGALE Arabic ASR System. In J. Olive, C. Christianson, and J. McCary (Edts.): Handbook of Natural Language processing and Machine Translation: DARPA Global Autonomous Language Exploitation (GALE) Chp. 3.6.7, pages 561-568, Springer, New York, NY, USA, 2011.
G. Heigold, T. Deselaers, R. Schlüter, H. Ney, G. Saon, and D. Povey. Integration of large margin concept into standard discriminative training criteria. In J. Olive, C. Christianson, and J. McCary (Edts.): Handbook of Natural Language processing and Machine Translation: DARPA Global Autonomous Language Exploitation (GALE) Chp. 3.3.4, pages 445-452, Springer, New York, NY, USA, 2011.

2010

G. Heigold, P. Dreuw, S. Hahn, R. Schlüter, and H. Ney. Margin-Based Discriminative Training for String Recognition. IEEE Journal of Selected Topics in Signal Processing - Statistical Learning Methods for Speech and Language Processing, volume 4, number 6, pages 917-925, Aachen, Germany, December 2010.
A. El-Desoky Mousa, M. A. B. Shaik, R. Schlüter, and H. Ney. Sub-lexical Language Models for German LVCSR. In IEEE Workshop on Spoken Language Technology (SLT), pages 159-164, Berkeley, CA, USA, December 2010.
J. Lööf, D. Falavigna, R. Schlüter, R. Gretter, D. Giuliani, and H. Ney. Evaluation of Automatic Transcription Systems for the Judicial Domain. In IEEE Workshop on Spoken Language Technology (SLT), pages 194-199, Berkeley, CA, December 2010.
R. Schlüter, M. Nußbaum-Thom, and H. Ney. Simulation of Fixed Length Word String Probability Distributions. Internal Report, Lehrstuhl für Informatik 6, RWTH Aachen University, pages 1-12, December 2010. Corrected version (April 19, 2012).
N. Parihar, R. Schlüter, D. Rybach, and E. A. Hansen. Parallel Lexical-tree Based LVCSR on Multi-core Processors. In Interspeech, pages 1485-1488, Makuhari, Japan, September 2010.
C. Plahl, R. Schlüter, and H. Ney. Hierarchical Bottle Neck Features for LVCSR. In Interspeech, pages 1197-1200, Makuhari, Japan, September 2010.
S. Wiesler, G. Heigold, M. Nußbaum-Thom, R. Schlüter, and H. Ney. A Discriminative Splitting Criterion for Phonetic Decision Trees . In Interspeech, pages 54-57, Makuhari, Japan, September 2010. One of shortlist for Best Student Paper Award.
M. Nußbaum-Thom, S. Wiesler, M. Sundermeyer, C. Plahl, S. Hahn, R. Schlüter, and H. Ney. The RWTH 2009 Quaero ASR Evaluation System for English and German. In Interspeech, pages 1517-1520, Makuhari, Japan, September 2010.
J. Lööf, R. Schlüter, and H. Ney. Discriminative Adaptation for Log-linear Acoustic Models. In Interspeech, pages 1648-1651, Makuhari, Japan, September 2010.
R. Schlüter, M. Nußbaum-Thom, and H. Ney. On the Relation of Bayes Risk, Word Error, and Word Posteriors in ASR. In Interspeech, pages 230-233, Makuhari, Japan, September 2010.
D. R. Sanand, R. Schlüter, and H. Ney. Revisiting VTLN Using Linear Transformation on Conventional MFCC. In Interspeech, pages 538-541, Makuhari, Japan, September 2010.
D. Nolden, H. Ney, and R. Schlüter. Time Conditioned Search in Automatic Speech Recognition Reconsidered. In Interspeech, pages 234-237, Makuhari, Japan, September 2010.
A. El-Desoky Mousa, R. Schlüter, and H. Ney. A Hybrid Morphologically Decomposed Factored Language Models for Arabic LVCSR. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT), pages 701-704, Los Angeles, CA, USA, June 2010.
G. Heigold, S. Wiesler, M. Nussbaum, P. Lehnen, R. Schlüter, and H. Ney. Discriminative HMMs, Log-Linear Models, and CRFs: What is the Difference?. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5546-5549, Dallas, TX, USA, March 2010.

2009

M. A. Tahir, G. Heigold, C. Plahl, R. Schlüter, and H. Ney. Log-Linear Framework for Linear Feature Transformations in Speech Recognition. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 76-81, Merano, Italy, December 2009.
S. Wiesler, M. Nußbaum, G. Heigold, R. Schlüter, and H. Ney. Investigations on Features for Log-Linear Acoustic Models in Continuous Speech Recognition. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 52-57, Merano, Italy, December 2009. Best Student Paper Award.
D. Rybach, C. Gollan, G. Heigold, B. Hoffmeister, J. Lööf, R. Schlüter, and H. Ney. The RWTH Aachen University Open Source Speech Recognition System. In Interspeech, pages 2111-2114, Brighton, UK, September 2009.
N. Parihar, R. Schlüter, D. Rybach, and E. A. Hansen. Parallel Fast Likelihood Computation for LVCSR using Mixture Decomposition. In Interspeech, pages 3047-3050, Brighton, UK, September 2009.
G. Heigold, D. Rybach, R. Schlüter, and H. Ney. Investigations on Convex Optimization Using Log-Linear HMMs for Digit String Recognition. In Interspeech, pages 216-219, Brighton, UK, September 2009.
B. Hoffmeister, R. Liang, R. Schlüter, and H. Ney. Log-linear Model Combination with Word-dependent Scaling Factors. In Interspeech, pages 248-251, Brighton, UK, September 2009.
B. Hoffmeister, R. Schlüter, and H. Ney. Bayes Risk Approximations Using Time Overlap with an Application to System Combination. In Interspeech, pages 1191-1194, Brighton, UK, September 2009.
A. El-Desoky Mousa, C. Gollan, D. Rybach, R. Schlüter, and H. Ney. Investigating the Use of Morphological Decomposition and Diacritization for Improving Arabic LVCSR. In Interspeech, pages 2679-2682, Brighton, UK, September 2009.
C. Plahl, B. Hoffmeister, G. Heigold, J. Lööf, R. Schlüter, and H. Ney. Development of the GALE 2008 Mandarin LVCSR System. In Interspeech, pages 2107-2110, Brighton, UK, September 2009.
D. Falavigna, D. Giuliani, R. Gretter, J. Lööf, C. Gollan, R. Schlüter, and H. Ney. Automatic Transcription of Courtroom Recordings in the JUMAS project. In 2nd International Conference on ICT Solutions for Justice (ICT4Justice), pages 65-72, Skopje, Macedonia, September 2009.
D. Rybach, C. Gollan, R. Schlüter, and H. Ney. Audio Segmentation for Speech Recognition using Segment Features. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4197-4200, Taipei, Taiwan, April 2009.
G. Heigold, R. Schlüter, and H. Ney. Modified MPE/MMI in a Transducer-Based Framework. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 3749-3752, Taipei, Taiwan, April 2009.

2008

F. R. Drepper, and R. Schlüter. Non-Stationary, Filter-Stable Acoustic Objects as Atoms of Voiced Speech. In 8. ITG-Fachtagung Sprachkommunikation (ITG 2008), pages 1-4, Aachen, Germany, October 2008.
R. Schlüter, C. Gollan, S. Hahn, G. Heigold, B. Hoffmeister, J. Lööf, C. Plahl, D. Rybach, and H. Ney. Development of Large Vocabulary ASR Systems for Mandarin and Arabic. In ITG-Fachtagung Sprachkommunikation (Sprachkommunikation), Aachen, Germany, October 2008.
D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlüter, K. Kirchhoff, A. Faria, and N. Morgan. Development of the SRI/Nightingale Arabic ASR system. In Interspeech, pages 1437-1440, Brisbane, Australia, September 2008.
G. Heigold, P. Lehnen, R. Schlüter, and H. Ney. On the equivalence of Gaussian and log-linear HMMs. In Interspeech, pages 273-276, Brisbane, Australia, September 2008. ISCA best student paper award.
C. Plahl, B. Hoffmeister, M. Hwang, D. Lu, G. Heigold, J. Lööf, R. Schlüter, and H. Ney. Recent Improvements of the RWTH GALE Mandarin LVCSR System. In Interspeech, pages 2426-2429, Brisbane, Australia, September 2008.
B. Hoffmeister, R. Schlüter, and H. Ney. iCNC and iROVER: The Limits of Improving System Combination with Classification?. In Interspeech, pages 232-235, Brisbane, Australia, September 2008.
G. Heigold, T. Deselaers, R. Schlüter, and H. Ney. Modified MMI/MPE: A Direct Evaluation of the Margin in Speech Recognition. In International Conference on Machine Learning (ICML), pages 384-391, Helsinki, Finland, July 2008. A typo from the original publication was corrected (marked in red).
G. Heigold, T. Deselaers, R. Schlüter, and H. Ney. A GIS-like training algorithm for log-linear models with hidden variables. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4045-4048, Las Vegas, NV, USA, April 2008.
F. R. Drepper, and R. Schlüter. Non-Stationary Acoustic Objects as Atoms of Voiced Speech. In 34. Jahrestagung für Akustik der Deutschen Gesellschaft für Akustik (DAGA), Dresden, Germany, March 2008. [corrected version: Eq. (5)].

2007

D. Rybach, S. Hahn, C. Gollan, R. Schlüter, and H. Ney. Advances in Arabic Broadcast News Transcription at RWTH. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 449-454, Kyoto, Japan, December 2007.
B. Hoffmeister, C. Plahl, P. Fritz, G. Heigold, J. Lööf, R. Schlüter, and H. Ney. Development of the 2007 RWTH Mandarin LVCSR System. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 455-460 , Kyoto, Japan, December 2007.
G. Heigold, R. Schlüter, and H. Ney. On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields. In Interspeech, pages 1721-1724, Antwerp, Belgium, August 2007.
J. Lööf, C. Gollan, S. Hahn, G. Heigold, B. Hoffmeister, C. Plahl, D. Rybach, R. Schlüter, and H. Ney. The RWTH 2007 TC-STAR Evaluation System for European English and Spanish. In Interspeech, pages 2145-2148, Antwerp, Belgium, August 2007.
F. Valente, J. Vepa, C. Plahl, C. Gollan, H. Hermansky, and R. Schlüter. Hierarchical Neural Networks Feature Extraction for LVCSR system. In Interspeech, pages 42-45, Antwerp, Belgium, August 2007.
J. Lööf, R. Schlüter, and H. Ney. Efficient Estimation of Speaker-specific Projecting Feature Transforms. In Interspeech, pages 1557-1560, Antwerp, Belgium, August 2007.
C. Gollan, S. Hahn, R. Schlüter, and H. Ney. An Improved Method for Unsupervised Training of LVCSR Systems. In Interspeech, pages 2101-2104, Antwerp, Belgium, August 2007.
A. Zolnay, D. Kocharov, R. Schlüter, and H. Ney. Using multiple acoustic feature sets for speech recognition. Speech Communication, volume 49, number 6, pages 514-525, June 2007.
D. Hillard, B. Hoffmeister, M. Ostendorf, R. Schlüter, and H. Ney. iROVER: Improving System Combination with Classification. In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers, pages 65-68, Rochester, NY, USA, April 2007.
R. Schlüter, I. Bezrukov, H. Wagner, and H. Ney. Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 649-652, Honolulu, HI, USA, April 2007.
B. Hoffmeister, D. Hillard, S. Hahn, R. Schlüter, M. Ostendorf, and H. Ney. CROSS-SITE AND INTRA-SITE ASR SYSTEM COMBINATION: COMPARISONS ON LATTICE AND 1-BEST METHODS. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 1145-1148, Honolulu, HI, USA, April 2007.

2006

J. Lööf, M. Bisani, C. Gollan, G. Heigold, B. Hoffmeister, C. Plahl, R. Schlüter, and H. Ney. The 2006 RWTH Parliamentary Speeches Transcription System. In Interspeech, pages 105-108, Pittsburgh, PA, USA, September 2006.
B. Hoffmeister, T. Klein, R. Schlüter, and H. Ney. Frame Based System Combination and a Comparison with Weighted ROVER and CNC. In Interspeech, pages 537-540, Pittsburgh, PA, USA, September 2006.
R. Schlüter, A. Zolnay, and H. Ney. Feature Combination using Linear Discriminant Analysis and its Pitfalls. In International Conference on Spoken Language Processing (ICSLP), pages 345-348, September 2006.
J. Lööf, M. Bisani, C. Gollan, G. Heigold, B. Hoffmeister, C. Plahl, R. Schlüter, and H. Ney. The 2006 RWTH Parliamentary Speeches Transcription System. In TC-STAR Workshop on Speech-to-Speech Translation, pages 133-138, Barcelona, Spain, June 2006.

2005

E. Matusov, H. Ney, and R. Schlüter. Phrase-based Translation of Speech Recognizer Word Lattices Using Loglinear Model Combination. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 110-115, San Juan, Puerto Rico, November 2005.
G. Heigold, W. Macherey, R. Schlüter, and H. Ney. Minimum Exact Word Error Training. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 186-190, San Juan, Puerto Rico, November 2005.
D. Kocharov, A. Zolnay, R. Schlüter, and H. Ney. Articulatory Motivated Acoustic Features for Speech Recognition. In Interspeech, pages 1101-1104, Lisboa, Portugal, September 2005.
W. Macherey, L. Haferkamp, R. Schlüter, and H. Ney. Investigations on Error Minimizing Training Criteria for Discriminative Training in Automatic Speech Recognition. In Interspeech, pages 2133-2136, Lisboa, Portugal, September 2005.
R. Schlüter, T. Scharrenbach, V. Steinbiss, and H. Ney. Bayes Risk Minimization using Metric Loss Functions (corrected version, corrections in red). In Interspeech, pages 1449-1452, Lisboa, Portugal, September 2005.
C. Gollan, M. Bisani, S. Kanthak, R. Schlüter, and H. Ney. Cross Domain Automatic Transcription on the TC-STAR EPPS Corpus. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), volume 1, pages 825-828, Philadelphia, PA, March 2005.
A. Zolnay, R. Schlüter, and H. Ney. Acoustic Feature Combination for Robust Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), volume 1, pages 457-460, Philadelphia, PA, March 2005.

2004

W. Macherey, R. Schlüter, and H. Ney. Discriminative Training with Tied Covariance Matrices. In Interspeech, volume 1, pages 681-684, Jeju Island, Korea, October 2004.

2003

A. Zolnay, R. Schlüter, and H. Ney. Extraction Methods of Voicing Feature for Robust Speech Recognition. In European Conference on Speech Communication and Technology (EUROSPEECH), volume 1, pages 497-500, Geneva, Switzerland, September 2003.

2002

A. Zolnay, R. Schlüter, and H. Ney. Robust Speech Recognition using a Voiced-Unvoiced Feature. In International Conference on Spoken Language Processing (ICSLP), volume 2, pages 1065-1068, Denver, CO, USA, September 2002.

2001

M. Pitz, S. Molau, R. Schlüter, and H. Ney. Vocal Tract Normalization Equals Linear Transformation in Cepstral Space. In European Conference on Speech Communication and Technology (EUROSPEECH), volume 4, pages 2653-2656, Aalborg, Denmark, September 2001.
R. Schlüter, W. Macherey, B. Müller, and H. Ney. Comparison of Discriminative Training Criteria and Optimization Methods for Speech Recognition. Speech Communication, volume 34, pages 287-310, May 2001. EURASIP Best Paper Award.
F. Wessel, R. Schlüter, and H. Ney. Explicit Word Error Minimization using Word Hypothesis Posterior Probabilities. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 33-36, Salt Lake City, UT, USA, May 2001.
R. Schlüter, and H. Ney. Using Phase Spectrum Information for Improved Speech Recognition Performance. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 133-136, Salt Lake City, UT, USA, May 2001.
R. Schlüter, and H. Ney. Model-based MCE Bound to the True Bayes' Error. IEEE Signal Processing Letters, volume 8, number 5, pages 131-133, May 2001.
S. Molau, M. Pitz, R. Schlüter, and H. Ney. Computing Mel-Frequency Cepstral Coefficients On The Power Spectrum. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 73-76, Salt Lake City, UT, USA, May 2001.
F. Wessel, R. Schlüter, K. Macherey, and H. Ney. Confidence Measures for Large Vocabulary Continuous Speech Recognition. IEEE Transactions on Speech and Audio Processing, volume 9, number 3, pages 288-298, March 2001.

2000

R. Schlüter, F. Wessel, and H. Ney. Speech Recognition using Context Conditional Word Posterior Probabilities. In International Conference on Spoken Language Processing (ICSLP), pages 923-926, Beijing, China, October 2000.
S. Kanthak, S. Molau, A. Sixtus, R. Schlüter, and H. Ney. The RWTH Large Vocabulary Speech Recognition System for Spontaneous Speech. In Proceedings of the Konvens 2000, pages 249-254, Ilmenau, Germany, October 2000.
R. Schlüter. Investigations on Discriminative Training Criteria. PhD Thesis, Aachen, Germany, September 2000.
S. Kanthak, A. Sixtus, S. Molau, R. Schlüter, and H. Ney. Fast Search for Large Vocabulary Speech Recognition. In Wolfgang Wahlster: Verbmobil: Foundations of Speech-to-Speech Translation Chp. "From Speech Input to Augmented Word Lattices", pages 63-78, Springer Verlag, Berlin, Heidelberg, New York, July 2000.
A. Sixtus, S. Molau, S. Kanthak, R. Schlüter, and H. Ney. Recent Improvements of the RWTH Large Vocabulary Speech Recognition System on Spontaneous Speech. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 1671-1674, Istanbul, Turkey, June 2000.
F. Wessel, R. Schlüter, and H. Ney. Using Posterior Word Probabilities for Improved Speech Recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 1587-1590, Istanbul, Turkey, June 2000.

1999

R. Schlüter, B. Müller, F. Wessel, and H. Ney. Interdependence of Language Models and Discriminative Training. In IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), volume 1, pages 119-122, Keystone, CO, December 1999.
J. Dahmen, R. Schlüter, and H. Ney. Discriminative Training of Gaussian Mixture Densities for Image Object Recognition. In Deutsche Arbeitsgemeinschaft für Mustererkennung Symposium (DAGM), Informatik aktuell, pages 205-212, Bonn, Germany, September 1999.
R. Schlüter, W. Macherey, B. Müller, and H. Ney. A Combined Maximum Mutual Information and Maximum Likelihood Approach for Mixture Density Splitting. In European Conference on Speech Communication and Technology (EUROSPEECH), volume 4, pages 1715-1718, Budapest, Hungary, September 1999.
M. Pitz, S. Molau, R. Schlüter, and H. Ney. Automatic Transcription Verification of Broadcast News and Similar Speech Corpora. In Proc. 1999 DARPA Broadcast News Workshop, pages 157-159, Herndon, VA, USA, February 1999.

1998

F. Wessel, K. Macherey, and R. Schlüter. Using Word Probabilities as Confidence Measures. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 225-228, Seattle, WA, USA, May 1998.
R. Schlüter, and W. Macherey. Comparison of discriminative training criteria. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 493-496, Seattle, WA, USA, May 1998.

1997

R. Schlüter, W. Macherey, S. Kanthak, H. Ney, and L. Welling. Comparison of optimization methods for discriminative training criteria. In European Conference on Speech Communication and Technology (EUROSPEECH), volume 1, pages 15-18, Rhodes, Greece, September 1997.
R. Schlüter, H. Ney, and L. Welling. Diskriminatives Training für Spracherkennung bei kleinem Wortschatz. In Proc. 9. Kolloquium Signaltheorie, volume 9, pages 297-300, Aachen, Germany, March 1997.