博客质量检索的研究与实现
Study on In-depth Blog Retrieval
传统的博客检索仅仅着眼于主题相关性,但是这已经不能满足越来越多博客用户的需求。博客用户可能对在某个主题上有深入探讨、分析的博客文章更加感兴趣,他们可以有针对性的花费更多时间精力在这些博客文章上。本文重点关注于如何根据用户查询,给出与查询相关且内容质量较高的博客。我们使用L-Qtf权重对博客中的博文进行分析,不同类型的L-Qtf权重在博文内容质量分析上有着不同的效果,本文将给出不同L-Qtf系数下的实验对比并进行分析。我们提出了一个改进的博客检索框架,以实现根据用户查询,给予与查询相关且按博文内容质量程度进行排序后的结果。同时我们还引入了一种应用于博客质量度检索中的模糊性分析BAF参数,并有效的提升检索效果。本文中的实验均基于BLOG08数据集,实验表明改进后的系统比2009年在TREC博客检索评测中实验结果有着显著的提高。
he traditional blog retrieval which only focus on topical relevance is no longer satisfied by more and more blog users. They might be interested to follow bloggers whose posts express in-depth thoughts and analysis on the reported issues. This paper focuses on the problem of finding blogs that are relevant and in-depth about a user's query. We use L-Qtf coefficient, which is a kind of pivoted normalization weighting coefficient, to analyze the posts in blogs. And the effect of different kinds of in-depth analysis coefficient based on L-Qtf coefficient is also discussed. We propose an improved framework of in-depth facet blog distillation system in order to obtain in-depth blog to the query and set up experiments for comparison. And we carry out BAF coefficient to deal with the ambiguous of the blog. Experimental results on the BLOG08 dataset show that the improved system is more effective than the prior system in TREC 2009 blog track.
关静怡、徐蔚然
计算技术、计算机技术
信息检索in-depthPNW博客检索L-QtfBAF
Information Retrievedin-depthblog distillationpivoted normalization weighting coefficientL-QtfBAF
关静怡,徐蔚然.博客质量检索的研究与实现[EB/OL].(2011-10-19)[2025-07-21].http://www.paper.edu.cn/releasepaper/content/201110-163.点此复制
评论