|国家预印本平台
首页|LSM-2: Learning from Incomplete Wearable Sensor Data

LSM-2: Learning from Incomplete Wearable Sensor Data

LSM-2: Learning from Incomplete Wearable Sensor Data

来源:Arxiv_logoArxiv
英文摘要

Foundation models, a cornerstone of recent advancements in machine learning, have predominantly thrived on complete and well-structured data. Wearable sensor data frequently suffers from significant missingness, posing a substantial challenge for self-supervised learning (SSL) models that typically assume complete data inputs. This paper introduces the second generation of Large Sensor Model (LSM-2) with Adaptive and Inherited Masking (AIM), a novel SSL approach that learns robust representations directly from incomplete data without requiring explicit imputation. AIM's core novelty lies in its use of learnable mask tokens to model both existing ("inherited") and artificially introduced missingness, enabling it to robustly handle fragmented real-world data during inference. Pre-trained on an extensive dataset of 40M hours of day-long multimodal sensor data, our LSM-2 with AIM achieves the best performance across a diverse range of tasks, including classification, regression and generative modeling. Furthermore, LSM-2 with AIM exhibits superior scaling performance, and critically, maintains high performance even under targeted missingness scenarios, reflecting clinically coherent patterns, such as the diagnostic value of nighttime biosignals for hypertension prediction. This makes AIM a more reliable choice for real-world wearable data applications.

Maxwell A. Xu、Girish Narayanswamy、Kumar Ayush、Dimitris Spathis、Shun Liao、Shyam A. Tailor、Ahmed Metwally、A. Ali Heydari、Yuwei Zhang、Jake Garrison、Samy Abdel-Ghaffar、Xuhai Xu、Ken Gu、Jacob Sunshine、Ming-Zher Poh、Yun Liu、Tim Althoff、Shrikanth Narayanan、Pushmeet Kohli、Mark Malhotra、Shwetak Patel、Yuzhe Yang、James M. Rehg、Xin Liu、Daniel McDuff

计算技术、计算机技术

Maxwell A. Xu,Girish Narayanswamy,Kumar Ayush,Dimitris Spathis,Shun Liao,Shyam A. Tailor,Ahmed Metwally,A. Ali Heydari,Yuwei Zhang,Jake Garrison,Samy Abdel-Ghaffar,Xuhai Xu,Ken Gu,Jacob Sunshine,Ming-Zher Poh,Yun Liu,Tim Althoff,Shrikanth Narayanan,Pushmeet Kohli,Mark Malhotra,Shwetak Patel,Yuzhe Yang,James M. Rehg,Xin Liu,Daniel McDuff.LSM-2: Learning from Incomplete Wearable Sensor Data[EB/OL].(2025-06-05)[2025-06-24].https://arxiv.org/abs/2506.05321.点此复制

评论