|国家预印本平台
首页|Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models

Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models

Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models

来源:Arxiv_logoArxiv
英文摘要

Speaker-dependent modelling can substantially improve performance in speech-based health monitoring applications. While mixed-effect models are commonly used for such speaker adaptation, they require computationally expensive retraining for each new observation, making them impractical in a production environment. We reformulate this task as a meta-learning problem and explore three approaches of increasing complexity: ensemble-based distance models, prototypical networks, and transformer-based sequence models. Using pre-trained speech embeddings, we evaluate these methods on a large longitudinal dataset of shift workers (N=1,185, 10,286 recordings), predicting time since sleep from speech as a function of fatigue, a symptom commonly associated with ill-health. Our results demonstrate that all meta-learning approaches tested outperformed both cross-sectional and conventional mixed-effects models, with a transformer-based method achieving the strongest performance.

Roseline Polle、Agnes Norbury、Alexandra Livia Georgescu、Nicholas Cummins、Stefano Goria

医学研究方法计算技术、计算机技术

Roseline Polle,Agnes Norbury,Alexandra Livia Georgescu,Nicholas Cummins,Stefano Goria.Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models[EB/OL].(2025-05-29)[2025-07-03].https://arxiv.org/abs/2505.23378.点此复制

评论