11th February2026: Dr Gary Tam
Spring semester, week 3
1.10-2pm [with optional extended discussion time until 2.20pm]
CymruFluency – A Fusion Technique and a 4D Welsh Dataset for Welsh Fluency (Proficiency) Analysis
Dr Gary Tam (Swansea University)
Welsh is a linguistically rich yet under-resourced minority language. Despite its cultural significance, automated fluency assessment remains largely unexplored due to limited datasets and tools. Existing models focus on high-resource languages, leaving Welsh without sufficient multi-modal resources. To address this, we introduce CymruFluency, the first 4D dataset for Welsh fluency assessment, capturing both audio and 3D lip movements with expert-annotated fluency scores. Building on this, we propose a multi-modal fluency classification framework that combines audio features (mel spectrograms) and manually annotated 3D lip landmarks. Our fusion approach significantly improves fluency prediction over unimodal models, emphasizing the critical role of 3D lip dynamics in Welsh learning. This research advances minority language processing by integrating articulatory features into fluency evaluation, offering a powerful tool for Welsh language learning, assessment, and preservation.
This session takes place in Room 3.58 of the John Percival Building [venue currently TBC] at Cardiff University or join us via Teams using this link.