09:00 - 09:10 Opening remarks by Atul Kr. Ojha and Chao-Hong Liu (on behalf of Workshop Chairs)
09:10 - 10:10 Invited talk 1: Isaac Caswell, Google Inc.
Chair: TBD
Title - Low-Resource NLP: hot takes and anecdotes from Google Translate
Abstract: Come with me as I opine that Low-Resource NLP is still the best place to be, and then give lots of anecdotes about the things I’ve run into while adding hundreds of languages to Google Translate.
10:10 - 10:30 Session 1: Booster Presentations
Chair: TBD
10:10-10:12 PahGen: Generating Ancient Pahlavi Text via Grammar-guided Zero-shot Translation - Farhan Farsi, Parnian Fazel, Farzaneh Goshtasb, Nadia Hajipour, Sadra Sabouri, Ehsaneddin Asgari, Hossein Sameti
10:12-10:14 Multilingual State Space Models for Structured Question Answering in Indic Languages - Arpita Vats, Rahul Raja, Mrinal Mathur, Aman Chadha, Vinija Jain
10:14-10:16 Parallel Corpora for Machine Translation in Low-Resource Indic Languages: A Comprehensive Review - Rahul Raja, Arpita Vats
10:16-10:18 Comparative Evaluation of Machine Translation Models Using Human-Translated Social Media Posts as References: Human-Translated Datasets -Shareefa Ahmed Al Amer, Mark G. Lee, Phillip Smith
10:18-10:20 Fostering Digital Inclusion for Low-Resource Nigerian Languages: A Case Study of Igbo and Nigerian Pidgin - Ebelechukwu Nwafor, Minh Phuc Nguyen
10:20-10:22 Beyond English: The Impact of Prompt Translation Strategies across Languages and Tasks in Multilingual LLMs - Itai Mondshine, Tzuf Paz-Argaman, Reut Tsarfaty
10:22-10:24 Building Data Infrastructure for Low-Resource Languages - Sarah K. K. Luger, Rafael Mosquera, Pedro Ortiz Suarez
10:24-10:26 From Text to Multi-Modal: Advancing Low-Resource-Language Translation through Synthetic Data Generation and Cross-Modal Alignments - Bushi Xiao, Qian Shen, Daisy Zhe Wang
10:30 - 11:00 COFFEE/TEA BREAK
11:00 - 12:30 Session 2: Scientific Research Papers
Chair: TBD
11:00-11:18 Low-resource Machine Translation: what for? who for? An observational study on a dedicated Tetun language translation service - Raphael Merx, Adérito José Guterres Correia, Hanna Suominen and Ekaterina
Vylomova
11:18- 11:36 Enhanced Zero-Shot Machine Translation via Fixed Prefix Pair Bootstrapping - Van-Hien Tran and Masao Utiyama
11:36-11:54 Limitations of Religious Data and the Importance of the Target Domain: Towards Machine Translation for Guinea-Bissau Creole - Jacqueline Rowe, Edward Gow-Smith and Mark Hepple
11:54-12:12 Jamo-Level Subword Tokenization in Low-Resource Korean Machine Translation - Junyoung Lee, Marco Cognetta, Sangwhan Moon and Naoaki Okazaki
12:12-12:30 Wenzhou Dialect Speech to Mandarin Text Conversion - Zhipeng Gao, Akihiro Tamura and Tsuneo Kato
12:30 - 14:00 LUNCH
14:00 - 15:00 Invited talk 2: Arturo Oncevay, JP Morgan AI
Chair: TBD
Title - Low-resource MT: A perspective from the Americas
Abstract: Exploring the challenges and opportunities of MT for Indigenous languages in the Americas through lessons from organizing shared tasks at AmericasNLP.
15:00 - 16:00 Session 3: Poster Session
Chair: TBD
PahGen: Generating Ancient Pahlavi Text via Grammar-guided Zero-shot Translation - Farhan Farsi, Parnian Fazel, Farzaneh Goshtasb, Nadia Hajipour, Sadra Sabouri, Ehsaneddin Asgari, Hossein Sameti
Multilingual State Space Models for Structured Question Answering in Indic Languages - Arpita Vats, Rahul Raja, Mrinal Mathur, Aman Chadha, Vinija Jain
Parallel Corpora for Machine Translation in Low-Resource Indic Languages: A Comprehensive Review - Rahul Raja, Arpita Vats
Comparative Evaluation of Machine Translation Models Using Human-Translated Social Media Posts as References: Human-Translated Datasets -Shareefa Ahmed Al Amer, Mark G. Lee, Phillip Smith
Fostering Digital Inclusion for Low-Resource Nigerian Languages: A Case Study of Igbo and Nigerian Pidgin - Ebelechukwu Nwafor, Minh Phuc Nguyen
Beyond English: The Impact of Prompt Translation Strategies across Languages and Tasks in Multilingual LLMs - Itai Mondshine, Tzuf Paz-Argaman, Reut Tsarfaty
Building Data Infrastructure for Low-Resource Languages - Sarah K. K. Luger, Rafael Mosquera, Pedro Ortiz Suarez
From Text to Multi-Modal: Advancing Low-Resource-Language Translation through Synthetic Data Generation and Cross-Modal Alignments - Bushi Xiao, Qian Shen, Daisy Zhe Wang
15:30 - 16:00 COFFEE/TEA BREAK
16:00 - 17:30 Session 4: Scientific Research Papers
Chair: TBD
16:00-16:18 ModeLing: A Novel Dataset for Testing Linguistic Reasoning in Language Models - Nathan Andrew Chi, Teodor Malchev, Riley Kong, Ryan Andrew Chi, Lucas Huang, Ethan A Chi, R. Thomas McCoy and Dragomir Radev
16:18-16:36 Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models - Umer Butt, Stalin Varanasi and Günter Neumann
16:36-16:54 Encoder-Aware Sequence-Level Knowledge Distillation for Low-Resource Neural Machine Translation - Menan Velayuthan, Nisansa De Silva and Surangika Ranathunga
16:54-17:12 UTER: Capturing the Human Touch in Evaluating Morphologically Rich and Low-Resource Languages - Samy Ouzerrout
17:13 -17:23 Closing remarks by Workshop Chairs