|国家预印本平台
首页|基于情感主题的博客性别分类模型

基于情感主题的博客性别分类模型

Blog Gender Classification Model Baed on Sentiment Topic

中文摘要英文摘要

随着互联网的发展,博客已经被广大用户熟知,大型的门户网站以及SNS网站都拥有自己的博客空间。在学术界博客也成为研究是热点,博客性别分类是博客研究的重要组成部分。本文提出了基于情感的博客分类模型,通过情感主题来分析解决博客性别分类问题。该模型首先给出了一种基于LDA的情感词扩展方法;其次利用WordNet-Affect的情感词以及扩展的情感词,通过LDA模型给出了男性和女性的情感主题并提出了筛选情感主题的方法,得到更有性别区分度的情感主题;最后,通过情感主题与内部词典给出了模型的性别计算公式。实验表明,情感主题有助于提升博客性别分类结果。

s long as the development of Internet, bolg is more and more popular. Web portal and SNS platform always has their own blog space.Researchers are also very interested in blog while blog gender classification is an important part of blog research. This paper provides a blog gender classification model based on sentiment topic. First,the model provides a sentiment extension method based on LDA, then sentiment topics of men and women are proposed using LDA while a selecting method is also proposed to get the useful topic.At last the blog gender classification model is provided by mixing sentiment topic and inside dictionary. The experiment result shows that sentiment topic is useful to advance the blog gender classification result.

杨亮、林鸿飞、王昊

信息传播、知识传播科学、科学研究

自然语言处理博客性别分类情感主题LDA模型

Natrual Language ProcessingBlog Gender ClassificationSentiment TopicLDA Model

杨亮,林鸿飞,王昊.基于情感主题的博客性别分类模型[EB/OL].(2012-12-28)[2025-08-19].http://www.paper.edu.cn/releasepaper/content/201212-1176.点此复制

评论