- Bachelor students: Einführung in das wissenschaftliche Arbeiten (Proseminar)
- Master students: Bachelor degree
- Diploma students: Vordiplom
- Attendance of the lectures Pattern Recognition and Neural Networks, Speech Recognition or Statistical Methods in Natural Language Processing, or evidence of equivalent knowledge.
- For successful participants of the above lectures, the possibility of a seminar
talk is guaranteed.

initial proposals will be accepted up until the start of the term (October 10, 2011) at the Lehrstuhl für Informatik 6 office or by the relevant supervisor. At this time participants must arrange an appointment with the relevant supervisor. Revised proposals will be accepted up until two weeks after the start of the term.__Proposals:__must be submitted at least 1 month prior to the trial presentation date to either the Lehrstuhl für Informatik 6 office or the relevant supervisor.__Article:__

must be submitted at least 1 week prior to the trial presentation date to either the Lehrstuhl für Informatik 6 office or the relevant supervisor.__Presentation slides:__at least 2 weeks prior to the actual presentation date; refer to the section on topics.__Trial presentations:__the exact dates and plan for the presentation block (expected to be around January 2012) will be arranged and announced for the individual topics.__Seminar presentations:__must be submitted at the latest 4 weeks after the presentation date to either the Lehrstuhl für Informatik 6 office or the relevant supervisor.__Final (possibly corrected) articles and presentation slides:__in order to receive a certificate participants must attend all presentation sessions.__Compulsory attendance:__The Computer Science Department of RWTH Aachen University has adopted ethical guidelines for the authoring of academic work such as seminar reports. Each student has to comply with these guidelines. In this regard, you, as a seminar attendant, have to sign a declaration of compliance, in which you assert that your work complies with the guidelines, that all references used are properly cited, and that the report was done autonomously by yourself. We ask you do download the guidelines and submit the declaration together with your seminar report and talk to your supervisor. You also find a German version of the guidelines and a German version of the declaration you may use as well.__Ethical Guidelines:__

*Note: failure to meet deadlines, absence without
permission from compulsory sessions (presentations and preliminary
meeting as announced by email to each participating student), or
dropping out of the seminar after more than 3 weeks after the
preliminary meeting/topic distribution
results in the grade 5.0/not appeared. *

In general, selected topics from the following general areas of Human Language Technology and Pattern Recognition will be offered:

- Automatic Speech Recognition;
- Machine Translation;
- Pattern Recognition.

**The IBM Translation Models**(Middelburg; Supervisors: Christoph Schmidt)

References:

- Chapter 4 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, and Robert L. Mercer: "The mathematics
of statistical machine translation: Parameter estimation,"
*Computational Linguistics, 19(2):263-311,*1993. - Franz Josef Och and Hermann Ney: "Improved statistical alignment models,"
*Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics,*2000.

- Chapter 4 of P. Koehn: "Statistical Machine Translation," textbook,
**Discriminative Alignment for Machine Translation**(NN; Supervisors: Christoph Schmidt)

References:

- Ben Taskar, Simon Lacoste-Julien, and Dan Klein:
"A Discriminative Matching Approach to Word Alignment,"
pp. 73--80,
*Proc. Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP),*Vancouver, Canada, October 2005. - Abraham Ittycheriah and Salim Roukos:
"A Maximum Entropy Word Aligner for Arabic-English Machine Translation,"
pp. 89-96,
*Proc. Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP),*Vancouver, BC, Canada, October 2005. - Phil Blunsom and Trevor Cohn:
"Discriminative Word Alignment with Conditional Random Fields,"
pp. 65-72,
*Proc. 21*Sydney, Australia, July 2006.^{st}International Conference on Computational Linguistics and 44^{th}Annual Meeting of the Association for Computational Linguistics,

- Ben Taskar, Simon Lacoste-Julien, and Dan Klein:
"A Discriminative Matching Approach to Word Alignment,"
pp. 73--80,
**Decoding for Phrase-Based Statistical Machine Translation**(Stadler; Supervisor: Saab Mansour)

References:

- Chapter 6 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - P. Koehn: "Pharaoh: a Beam Search Decoder for Phrase-Based Statistical Machine Translation Models,"
*Proc. 6th Conference of the Association for Machine Translation in the Americas (AMTA),*Washington, DC, September 2004. - R. C. Moore, C. Quirk: "Faster Beam-Search Decoding
for Phrasal Statistical Machine Translation,"
*Proc. MT Summit XI,*Copenhagen, Denmark, September 2007. - R. Zens, H. Ney: "Improvements in Dynamic Programming Beam Search for Phrase-based Statistical Machine Translation,"
*International Workshop on Spoken Language Translation (IWSLT),*Honolulu, Hawaii, October 2008.

- Chapter 6 of P. Koehn: "Statistical Machine Translation," textbook,
**Machine Translation Evaluation**(Schmitz; Supervisor: Markus Freitag)

References:

- Chapter 8 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - K. Papineni, S. Roukus, T. Ward and W. Zhu: "BLEU: a Method for Automatic Evaluation of Machine Translation," Proc. ACL, pp. 311-318, Philadelphia, PA, July 2002.
- Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, John Makhoul: "A Study of Translation Edit Rate with Targeted Human Annotation," In
*Proceedings AMTA,*August 2006, pp. 223-231. - A. Lavie, A. Agarwal: "METEOR: An Automatic Metric for MT Evaluation with High Levels of Correlation with Human Judgments,"
*Proc. 2nd Workshop on MT*, ACL, pp.228-231, Prague, Czech Republic, June 2007.

- Chapter 8 of P. Koehn: "Statistical Machine Translation," textbook,
**Parameter Tuning for Statistical Machine Translation**(NN; Supervisor: Markus Freitag)

References:

- Chapter 9 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - J. A. Nelder, R. Mead: "The Downhill Simplex Method,"
*Computer Journal,*7:308, 1965. - F. J. Och: "Minimum Error Rate Training for Statistical Machine Translation,"
*Proc. of the 41st Annual Meeting of the Association for Computational Linguistics (ACL),*pp. 160-167, Sapporo, Japan, July 2003. - D. Chiang, Y. Marton, P. Resnik: "Online large-margin
training of syntactic and structural translation features,"
*Proc. Empirical Methods in Natural Language Processing (EMNLP),*pp. 224-233, Honolulu, Hawaii, October 2009. - D. Chiang, K. Knight, and W. Wang:
"11,001 New Features for Statistical Machine Translation,"
*Proc. NAACL HLT,*pp. 218-226, Boulder, Colorado, May 2009.

- Chapter 9 of P. Koehn: "Statistical Machine Translation," textbook,
**Deciphering Foreign Language**(Schamper; Supervisor: Malte Nuhn)

References:

- C. E. Shannon: "Communication Theory of Secrecy Systems,"
*Bell System Technical Journal*1949;28(4):656-715. Available at: http://netlab.cs.ucla.edu/wiki/files/shannon1949.pdf. - K. Knight, A. Nair, N. Rathod, K. Yamada: "Unsupervised analysis for decipherment problems,"
*Proceedings of the COLING/ACL*on Main conference poster sessions, 2006:499-506. - K. Knight: "Attacking decipherment problems optimally with low-order n-gram models,"
*EMNLP 2008 Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing,*Honolulu, Hawaii, USA, October 2008. - S. Ravi, K. Knight: "Deciphering Foreign Language"

- C. E. Shannon: "Communication Theory of Secrecy Systems,"
**Factored Machine Translation**(Prüne; Supervisor: Stephan Peitz)

References:

- Chapter 10 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - P. Koehn, H. Hoang: "Factored Translation Models,"
*EMNLP,*2007. - O. Bojar: "English-to-Czech Factored Machine Translation,"
*Proceedings of the Second Workshop on Statistical Machine Translation,*pp. 232-239, Prague, Czech Republic, 2007. - A. Birch, M. Osborne, and P. Koehn: "CCG Supertags in Factored Statistical Machine Translation,"
*ACL Workshop on Statistical Machine Translation,*2007. - E. Avramidis, P. Koehn: "Enriching Morphologically Poor Languages for Statistical Machine Translation,"
*ACL,*2008. - P. Koehn, M. Federico, W. Shen, N. Bertoldi, O. Bojar, C. Callison-Burch, B. Cowan, C. Dyer, H. Hoang, R. Zens, A. Constantin, C. Moran, and E. Herbst:
"Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Confusion Network Decoding,"
*Technical report, Johns Hopkins University, Center for Speech and Language Processing,*August 2006.

- Chapter 10 of P. Koehn: "Statistical Machine Translation," textbook,
(NN; Supervisor: Jörn Wübker)*ADVANCED:*Phrase Training in Statistical Machine Translation

References:

- J. DeNero, D. Gillick, J. Zhang, D. Klein: "Why Generative
Phrase Models Underperform Surface Heuristics",
*Proc. Workshop on Statistical Machine Translation (WMT06),*pp. 31-38, New York City, USA, June 2006. - D. Marcu, W. Wong: "A phrase-based, joint probability model
for statistical machine translation",
*Proc. Conference on Empirical Methods in Natural Language Processing (EMNLP),*Philadelphia, PA, USA, July 2002. - A. Birch, C. Callison-Burch, M. Osborne, P. Koehn:
"Constraining the phrase-based, joint probability statistical
translation model",
*Proc. Workshop on Statistical Machine Translation (WMT06),*pp. 154-157, New York City, USA, June 2006. - P. Koehn, F. Och, D. Marcu: "Statistical phrase-based translation",
*Proc. North American Chapter of the Association for Computational Linguistics (ACL),*pp. 761-768, Sydney, Australia, 2003.

- J. DeNero, D. Gillick, J. Zhang, D. Klein: "Why Generative
Phrase Models Underperform Surface Heuristics",
(NN; Supervisors: Jörn Wübker)*ADVANCED:*Discriminative Training for Machine Translation

References:

- Chapter 9 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - Abraham Ittycheriah and Salim Roukos:
"Direct Translation Model,"
pp. 57-64,
*Proc. Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT),*Rochester, NY, April 2007. - Percy Liang, Alexandre Bouchard-Cote, Dan Klein, and Ben Taskar:
"An End-to-End Discriminative Approach to Machine Translation,"
pp. 761-768,
*Proc. Joint Conf. of the Int. Committee on Computational Linguistics and the Association for Computational Linguistics (Coling-ACL),*Sydney, Australia, July 2006. - D. Chiang, Y. Marton, Ph. Resnik: "Online large-margin
training of syntactic and structural translation features,"
in
*Proc. Empirical Methods in Natural Language Processing (EMNLP),*pp. 224-233, Honolulu, Hawaii, October 2009.

- Chapter 9 of P. Koehn: "Statistical Machine Translation," textbook,
(NN; Supervisors: Matthias Huck)*ADVANCED:*Reordering for Phrase-Based Statistical Machine Translation

References:

- Chapter 5.4 and 6 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - R. Zens, H. Ney, T. Watanabe, and E. Sumita: "Reordering Constraints for Phrase-Based Statistical Machine Translation,"
*International Conference on Computational Linguistics (CoLing),*pages 205-211, Geneva, Switzerland, August 2004. - Christoph Tillmann: "A unigram orientation model for statistical machine translation,"
*Proceedings of HLT-NAACL 2004:*Short Papers, pages 101-104, 2004. - Yaser Al-Onaizan and Kishore Papineni: "Distortion models for statistical machine translation,"
*Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL (COLING/ACL),*pages 529-536, Morristown, NJ, USA, 2006. - Michel Galley, Christopher Manning: "A Simple and Effective Hierarchical Phrase Reordering Model,"
*Proceedings of EMNLP,*2006.

- Chapter 5.4 and 6 of P. Koehn: "Statistical Machine Translation," textbook,
(NN; Supervisors: Matthias Huck)*ADVANCED:*Hierarchical Phrase-Based Machine Translation

References:

- David Chiang: "Hierarchical Phrase-Based Translation,"
*Computational Linguistics,*33(2):201-228, June 2007. - Liang Huang, David Chiang: "Forest rescoring: faster decoding with integrated language models,"
*Proc. ACL,*2007. - A. de Gispert, G. Iglesias, G. Blackwood, E.R. Banga and W. Byrne:
"Hierarchical Phrase-based Translation with Weighted Finite State Transducers and Shallow-N Grammars,"
*Computational Linguistics,*Volume 36, Number 3, pp. 505-533, 2010.

- David Chiang: "Hierarchical Phrase-Based Translation,"
(NN; Supervisors: Stephan Peitz)*ADVANCED:*Syntax-Oriented Hierarchical Translation

References:

- Chapter 11 of P. Koehn: "Statistical Machine Translation," textbook,
*Cambridge University Press,*January 2010. - Ashish Venugopal, Andreas Zollmann, Noah A. Smith, and Stephan Vogel: "Preference Grammars: Softening Syntactic Constraints to
Improve Statistical Machine Translation,"
*Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics,*Boulder, Colorado, June 2009. - Andreas Zollmann and Stephan Vogel: "A Word-Class Approach to Labeling PSCFG Rules for Machine Translation,"
*Proc. of ACL,*Portland, Oregon, 2011. - Bing Zhao and Yaser Al-Onaizan: "Generalizing Local and Non-Local Word-Reordering Patterns for Syntax-Based Machine Translation,"
*EMNLP,*2008.

- Chapter 11 of P. Koehn: "Statistical Machine Translation," textbook,

- Online LaTeX-Documentation:

- Document Templates:

- Article Template (164kB), contains the template and all necessary files in tar format (or here 53kB in zip format).
- Presentation Slide Template (4.9MB), a zip file containing the template and all necessary graphics as well as the institute’s style template.

- Guidelines for articles and presentation slides:

- The aim of the seminar for the participants is to learn the following:
- to tackle a topic and to expand knowledge
- to critically analyze the literature

- to hold a presentation

- Take notice of references
to other topics in the seminar and discuss topics with one
another!

- Take care to stay within your
own topic. To this end participants should be aware of the other
topics in the seminar. If applicable, cross-reference
other articles and presentations.

*Important*: As part of the introduction, a slide should outline the most important literature used for the presentation. In addition, the presentation should clearly indicate which literature the particular elements of the presentation refer to.

- Take notice of references
to other topics in the seminar and discuss topics with one
another!

- Participants are expected to seek out additional literature on their topic. Assistance with the literature search is available at the faculty’s library. Access to literature is naturally also available at the Lehrstuhl für Informatik 6 library.
- Notation/Mathematical
Formulas: consistent, correct notation
is essential. When necessary, differing notation from various
literature sources is to be modified or standardized in order to be
clear and consistent. The
lectures held by the Lehrstuhl für Informatik 6 should provide a
guide as to what appropriate notation should look like.

- Tables must have titles (appearing above the table).
- Figures
must have captions (appearing below the figure).

- In the case that no adequate translation of an English technical term is available, the term should be used unchanged.
- Articles and presentation slides can also be prepared in
English.

- Completeness:
acknowledge all literature and
sources.

- Referencing must conform to the standard described in the article template.
- Examples should be used to illustrate points.
**Examples**should be as complex as necessary but as simple as possible.- Slides should be used as presentation aids and not to replace the role of the presenter; specifically, slides should:
- illustrate important points and relationships;
- remind the audience (and the presenter) of important aspects
and considerations;

- give the audience an overview
of the presentation.

- Slides should not contain chunks of text or complicated sentences; rather they should consist of succinct words and terms.
- Use illustrations where appropriate - a picture says a thousand words!
**Abbreviations**should be defined at the first usage in the manner demonstrated in the following example:

- Take care to stay within your own topic. To this end participants should be aware of the other topics in the seminar. If applicable, cross-reference other articles and presentations.
- Usage of fonts, typefaces and colors in presentation slides must
be consistent and appropriate. Such means should serve to clarify
points or relationships, not be applied needlessly or at random.

- Care should be taken when selecting fonts for presentation slides (also within diagrams) to ensure legibility on a projector even for those seated far from the screen.

Dr. Ralf Schlüter

RWTH Aachen

Lehrstuhl für Informatik 6

Ahornstr. 55

52056 Aachen

Raum 6125b

Telefon: 0241 / 80-21612

E-Mail: schlueter@cs.rwth-aachen.de