LISTEN Project: Hands-Free Voice-Enabled Interface to Web Applications for Smart Home Environments

LISTEN Workshop / Summer School
July 17-19, 2018
Bonn, Germany

Aus listen-summer-school-wiki
Wechseln zu: Navigation, Suche

Description

The EU project LISTEN (http://www.listen-project.eu/) is concerned with various aspects of far-field large-vocabulary speech recognition. As part of this project, we will organize a three-day workshop in Bonn/Germany, 17-19 July 2018 (Tuesday-Thursday).

The workshop intends to bring together experts from academic and industrial research in this fast developing area of speech recognition. The workshop will include regular talks and keynote talks. In total, we expect an audience of about 50 researchers.

Organizers

RWTH Aachen, Aachen, GermanyHermann Ney, Volker Steinbiss
RWTH Aachen, Aachen, Germany
Foundation for Research and Technology - Hellas Athanasios Mouchtaris
Foundation for Research and Technology - Hellas, Heraklion, Greece
European Media Laboratory GmbH (Heidelberg) Volker Fischer
EML, Heidelberg, Germany
CedatEmilio Giannotti
Cedat85, Rome, Italy

Venue

The workshop will take place in the Ameron Hotel Königshof, Adenauerallee 9, 53111 Bonn, Germany, Tel.: +49 228 260 10, Email: info@hotel-koenigshof-bonn.de.
The hotel is located next to the Rhine river; 20 km distance from airport Cologne/Bonn; 1 km from train station Bonn; 30 km from train station Cologne.

(Tentative) Program

  - Tuesday,   July 17:   9:00-18:30 technical session,
  - Wednesday, July 18:   9:00-13:00 technical session,
                         14:00-20:00 social event,
  - Thursday,  July 19:   9:00-18:00 technical session.

Keynotes

Confirmed keynote talks with (tentative) titles (45 minutes + questions/discussions):

Industry:

  • Alex Acero, Apple, USA: Our work on HomePod.
  • Jasha Droppo, Microsoft, USA: The Microsoft 2017 Conversational Speech Recognition System.
  • Erik McDermott, Google, USA: Next generation acoustic models for automatic speech recognition.
  • Björn Hoffmeister, Amazon, USA: Building far-field speech recognition for Amazon Alexa: Challenges and Solutions

Academia:

  • Chin-Hui Lee, Georgia Tech, USA: Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech or Automatic Speech Recognition Systems Already Trained in Clean Conditions.
  • Jon Barker, U Sheffield, UK: The CHiME-5 challenge for far-field conversational speech recognition in domestic environments.
  • Satoshi Nakamura, NAIST, Japan: Towards listening while speaking: speech chain by deep learning.
  • Reinhold Häb-Umbach, U Paderborn, Germany: Neural Network Supported Acoustic Beamforming and Source Separation for ASR.

Please scroll to the end to find the list of regular talks.

Travel

We encourage the workshop participants to arrive on Monday, July 16, and to leave after the end of the last session.

By plane

Several airports can be used to travel to Bonn:

  • Cologne/Bonn (Köln/Bonn, CGN): 23 km to the venue by car, 25 minutes with bus SB60 from CGN to Bonn main station (4 stops) then 1 km walking or taxi
  • Dusseldorf (Düsseldorf, DUS): 83 km by car, about 1h50 with public transport
  • Frankfurt (FRA): 154 km by car, about 1h15 - 2h00 with public transport

if you need assistance with your travel planning, our secretary will be happy to help you (+49 241 80-21601, e-mail sek@i6.informatik.rwth-aachen.de).

Watch out: There are two airports which have names similar to some of the above but lie badly remote in the countryside, notably Frankfurt-Hahn (HHN) and Düsseldorf Weeze (NRN; also Airport Weeze, Verkehrsflughafen Niederrhein). If you end up there, we will have a good laugh at the workshop.

By train

The hotel is in walking distance to Bonn main station (Bonn Hbf). Bonn is typically reached via Cologne (Köln).

Accomodation

The venue, Ameron Hotel Königshof, also provides accomodation for all workshop participants. (If you intend to stay at another place, please inform our secretariat.)

Hotel Information:

  • Room rate: 129.00 EUR per night for a room "smart standard", which is either single room or double room with single use, incl. breakfast buffet.
    (For use with two people, a double room is 149.00 EUR per night incl. breakfast buffet.)
  • Check-in: from 15:00 (3:00 PM). Important note: If you are likely to check in after 18:00 on arrival day, pls. inform the hotel about it.
  • Check-out: until 12:00 (noon)

Booking Information:

  • Unfortunately, the special rate for the rooms allocated for the workshop cannot be booked online, but you need to book directly with the hotel, either by phone call (+49 228 260 10) or by email. Please mention the booking code "LISTEN Summer School" to access the allocated rooms with the above room rate.
  • In case of problems with your hotel booking, please contact our secretaries, who will be pleased to provide help.
  • If you are a keynote speaker, please contact our secretaries, who will take over the booking of your accomodation.

Regular talks from LISTEN consortium

Confirmed regular talks with (tentative) titles (15 minutes + 5 minutes questions/discussions) (not in temporal order and incomplete yet):

  • Enrico Gianotti or et al., Cedat, Italy: Applications of the technology.
  • Wei Zhou, EML, Germany: Acoustic Modelling for the EML Listen Demonstrator.
  • Jozef Ivanécky, EML, Germany: Architecture of the EML Listen Demonstrator.
  • Nikos Stefanakis , FORTH, Greece: Acoustic Beamforming in Front of a Reflective Plane.
  • Tobias Menne, RWTH, Germany: Speaker-Adaptive Beamforming for ASR.
  • Albert Zeyer, RWTH, Germany: Attention-Based ASR.

Regular talks from workshop participants

Confirmed regular talks with (tentative) titles (15 minutes + 5 minutes questions/discussions) (not in temporal order and incomplete yet):

  • A. Giménez and J.A. Silvestre, UPV, Spain Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks.
  • A. Juan, UPV, Valencia, UPV, Spain: Multilingual Videos for Education.
  • Lauréline Pérotin, INRIA Nancy, France: Multichannel RNN-based separation of overlapping speech.
  • Sunit Sivasankaran, INRIA Nancy, France: Keyword based speaker localization: Localizing a single speaker in multi-speaker environment.
  • Gerasimos Potamianos, University of Thessaly & Athena Research and Innovation Center: Far-Field ASR in Greek for Domestic Environment and Child-Robot-Interaction Applications
  • Igor Szoke, U Brno, Czech Republic: Collection of re-transmitted data and impulse responses and remote ASR and speaker verification.
  • Katerina Zmolikova, U Brno, Czech Republic: tbd.
  • Florian Metze, CMU, USA: Grounded Sequence to Sequence Transduction (multi-modal speech recognition).
  • Sanjeev Khudanpur, JHU, USA: tbd.
  • Marcel Katz and Patrick Naylor, Nuance, UK: Robust Statistical Processing of TDOA Estimates for Distant Speaker Diarization
  • Tim Fingscheidt, TU Braunschweig, Germany: Acoustic Model Fusion for Phoneme Recognition According to the Turbo Principle.
  • Dorothea Kolossa, U Bochum, Germany: Exploiting Structures of Temporal Causality for Robust Speaker Localization in Reverberant Environments
  • Mehdi Zohourian, U Bochum, Germany: Source Localization and Separation for Binaural Hearing Aids
  • Markus Müller, KIT, Germany: tbd.
  • Drude, U Paderborn, Germany: tbd.
  • Mathew Magimai-Doss, IDIAP, Switzerland: Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: Current and Future Perspectives
  • Pranay Dighe, IDIAP, Switzerland: Improving Far-field ASR Using Low-rank and Sparse Models
  • Maurizio Omologo, FBK-IRST, Italy: Realistic Impulse Responses for Distant-Speech Recognition

Leisure activities


The Culture Trip (in English)
Kunstmuseum (art museum) Bonn (in English)
Museum overview on wikipedia (in English)
Traveling Tips of Bonn (in English)
German History Museum (in English)
Events calendar Bonn (in German)
Museumsmeile Bonn (information on the so called "museum mile", a conglomeration of several museums, in German)

Contact


For organizational issues, especially in case of questions w.r.t. travel and accommodation/hotel booking, please contact our secretariat:

Anna Eva Andersen, Stephanie Jansen
Tel.: +49 (241) 80-21601 or -21606


LISTEN Project: Hands-Free Voice-Enabled Interface to Web Applications for Smart Home Environments