机器学习的信息科学原理:基于形式化信息映射的因果链元框架
[Objective] This study focuses on addressing the current lack of a unified formal theoretical framework in machine learning, as well as the deficiencies in interpretability and ethical safety assurance.[Methods] A formal information model is first constructed, utilizing sets of well-formed formulas to explicitly define the ontological states and carrier mappings of typical components in machine learning. Learnable and processable predicates, along with learning and processing functions, are introduced to analyze the logical deduction and constraint rules of the causal chains within models.[Results] A meta-framework for machine learning theory (MLT-MF) is established. Based on this framework, universal definitions for model interpretability and ethical safety are proposed. Furthermore, three key theorems are proved: the equivalence of model interpretability and information recoverability, the assurance of ethical safety, and the estimation of generalization error.[Limitations] The current framework assumes ideal conditions with noiseless information-enabling mappings and primarily targets model learning and processing logic in static scenarios. It does not yet address information fusion and conflict resolution across ontological spaces in multimodal or multi-agent systems.[Conclusions] This work overcomes the limitations of fragmented research and provides a unified theoretical foundation for systematically addressing the critical challenges currently faced in machine learning.
许建峰
上海交通大学凯原法学院,智慧司法研究院,计算机学院
计算技术、计算机技术
机器学习客观信息论形式逻辑系统可解释性伦理安全泛化误差
machine learningobjective information theoryformal logic systemsinterpretabilityethical and safety assurancegeneralization error
许建峰.机器学习的信息科学原理:基于形式化信息映射的因果链元框架[EB/OL].(2025-05-30)[2025-06-06].https://chinaxiv.org/abs/202505.00162.点此复制
评论