生物医学大数据发展的新挑战与趋势
New Challenges and Trends in Bio-Med Big Data
生物医学数据从PB量级的组学时代进入到EB量级的多维度大数据时代,引发了生物医学研究向数据密集型的第四科学范式的深刻变革。如何将临床数据与研究数据进行高维度多层次的汇交共享,实现从“组学”到临床与健康人群数据的生物医学大数据的综合管理利用,从而使大数据迅速转化为新知识,成为生物医学大数据所面临的挑战。发展以递交为基础、以整合为导向的数据存储技术,以主题为基础、以交互为导向的数据共享技术,以及以传统信息技术为基础、以前沿信息技术为导向的数据分析挖掘技术,并同时开展标准质控相关研究,是生物医学大数据存储、共享和转化的新思路,也是构建新一代生物医学大数据研究中心的技术关键和未来趋势。
he bio-medical data has entered a new era from exabyte-scale of genomic data to petabyte-scale of multi-dimensional big data, transforming the biological and medical research into a data-intensive science that is also referred as the fourth paradigm of discovery. Such transformation presented a set of new challenges: we have to efficiently gather and share high-dimensional and multi-level clinical and research data, further facilitate the comprehensive utilization of various omics data, clinical data, and phenome data of large population, eventually convert big data to new knowledge. Such challenges have to be faced by employing a new series of paradigm shifting ideas. In particular, new frameworks should be developed to improve the current submission-based data storage system to an integration-oriented system; to improve the subjective-based data sharing system to an interactive-oriented system; to integrate the cutting edge information technologies into the current data mining system. At the same time, large efforts have to be invested in developing data standardization guidelines and quality control technologies. These ideas will be critical in order to establish next generation of bio-medical big data centers and will be a new trend of future research.
赵国屏、张国庆、李亦学、王泽峰
生物科学现状、生物科学发展生物科学研究方法、生物科学研究技术计算技术、计算机技术
生物医学大数据整合交互数据挖掘
赵国屏,张国庆,李亦学,王泽峰.生物医学大数据发展的新挑战与趋势[EB/OL].(2023-03-19)[2025-08-02].https://chinaxiv.org/abs/202303.00693.点此复制
评论