June 12th to August 11th
**********************************************************************************************************
Speech translation has been a holy grail in human language technology for several decades; it was specifically called out by Bill Clinton during the 2000 State of the Union Address: “Soon researchers will bring us devices that can translate foreign languages as fast as you can talk”. Recent advances in deep learning have finally resulted in significant improvements in this technology, but research has been concentrated in two scenarios: (a) translation of public speeches in formal settings (e.g., TED Talks, Parliamentary speech) and (b) real-time interpretation.
SCALE’23 will focus on translation in informal conversational speech. In contrast to prepared addresses, translation of interpersonal conversations is challenging due to differences in acoustics, disfluencies, speaking style, vocabulary, and dialectal variation. In real-time interpretation a key objective is minimizing decoding latency, however, we plan to focus on improving translation accuracy.
Our goals at the workshop are to understand how current state-of-the-art technologies work for translation of conversational speech, identify the open areas of research unique to this scenario, and to advance research in this field. Research areas that we will explore include: