中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 软件所图书馆  > 期刊论文
Title:
基于翻译模型的查询会话检测方法研究
Alternative Title: A Translation Model Based Method for Query Session Detection
Author: 张振中 ; 孙乐 ; 韩先培
Keyword: 查询会话检测 ; 词语不匹配问题 ; 查询日志
Source: 中文信息学报
Issued Date: 2015
Volume: 29, Issue:4, Pages:95-102
Indexed Type: CSCD
Department: 张振中, 中国科学院软件研究所基础软件中心, 北京 100190, 中国;孙乐, 中国科学院软件研究所基础软件中心, 北京 100190, 中国;韩先培, 中国科学院软件研究所基础软件中心, 北京 100190, 中国;
Abstract: 查询会话检测的目的是确定用户为了满足某个特定需求而连续提交的相关查询。查询会话检测对于查询日志分析以及用户行为分析来说是非常有用的。传统的查询会 话检测方法大都基于查询词的比较,无法解决词语不匹配问题(vocabulary-mismatch problem)---有些主题相关的查询之间并没有相同的词语。为了解决词语不匹配问题,我们在该文提出了一种基于翻译模型的查询会话检测方法,该方法 将词与词之间的关系刻画为词与词之间的翻译概率,这样即使词与词之间没有相同的词语,我们也可以捕捉到它们之间的语义关系。同时,我们也提出了两种从查询 日志中估计词翻译概率的方法,第一种方法基于查询的时间间隔,第二种方法基于查询的点击URLs。实验结果证明了该方法的有效性。
English Abstract: Query session detection is critical for query log analysis and user behavior characterization.It aims at identifying the consecutive queries submitted by a user for the same information need.Traditional query session detection methods are based on lexical comparisons,which often suffer from the vocabulary-mismatch problem(i.e,the topically related queries may not share any common words).To resolve the issue,this paper proposes a translation model based method for query session detection,which can model the relationship between words as word translation probability.In this way our method can capture the relatedness between queries even they do not share any common words.Furthermore,we also propose two approaches for generating training data from web query log for translation probability estimation.The first approach is based on time gap between queries and the second is based on the clicked URLs of queries.Experimental results show that our method can significantly outperform the baselines.
Language: 中文
Citation statistics:
Content Type: 期刊论文
URI: http://ir.iscas.ac.cn/handle/311060/17402
Appears in Collections:软件所图书馆_期刊论文

Files in This Item:
File Name/ File Size Content Type Version Access License
基于翻译模型的查询会话检测方法研究.pdf(952KB)----限制开放 联系获取全文

Recommended Citation:
张振中,孙乐,韩先培. 基于翻译模型的查询会话检测方法研究[J]. 中文信息学报,2015-01-01,29(4):95-102.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[张振中]'s Articles
[孙乐]'s Articles
[韩先培]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[张振中]‘s Articles
[孙乐]‘s Articles
[韩先培]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2019  中国科学院软件研究所 - Feedback
Powered by CSpace