Home News About Us Contact Contributors Disclaimer Privacy Policy Help FAQ

Home
Search
Quick Search
Advanced
Fulltext
Browse
Collections
Persons
My eDoc
Session History
Login
Name:
Password:
Documentation
Help
Support Wiki
Direct access to
document ID:


          Institute: MPI für Informatik     Collection: Databases and Information Systems Group     Display Documents



ID: 356459.0, MPI für Informatik / Databases and Information Systems Group
Efficient Text Proximity Search
Authors:Schenkel, Ralf; Broschart, Andreas; Hwang, Seungwon; Theobald, Martin; Weikum, Gerhard
Language:English
Publisher:Springer
Place of Publication:Berlin, Germany
Date of Publication (YYYY-MM-DD):2007
Title of Proceedings:String Processing and Information Retrieval : 14th International Symposium, SPIRE 2007
Start Page:287
End Page:299
Title of Series:Lecture Notes in Computer Science
Place of Conference/Meeting:Santiago, Chile
(Start) Date of Conference/Meeting
 (YYYY-MM-DD):
2007-10-29
End Date of Conference/Meeting 
 (YYYY-MM-DD):
2007-10-31
Audience:Experts Only
Intended Educational Use:No
Abstract / Description:In addition to purely occurrence-based relevance models, term proximity has
been frequently used to enhance retrieval quality of keyword-oriented retrieval
systems. While there have been approaches on effective scoring functions that
incorporate proximity, there has not been much work on algorithms or access
methods for their efficient evaluation. This paper presents an efficient
evaluation framework including a proximity scoring function integrated within a
top-k query engine for text retrieval. We propose precomputed and materialized
index structures that boost performance. The increased retrieval effectiveness
and efficiency of our framework are demonstrated through extensive experiments
on a very large text benchmark collection. In combination with static index
pruning for the proximity lists, our algorithm achieves an improvement of two
orders of magnitude compared to a term-based top-k evaluation, with a
significantly improved result quality.
Last Change of the Resource (YYYY-MM-DD):2008-03-20
External Publication Status:published
Document Type:Conference-Paper
Communicated by:Gerhard Weikum
Affiliations:MPI für Informatik/Databases and Information Systems Group
Identifiers:LOCALID:C12573CC004A8E26-F778C4B7609BEE48C1257301002F4B0C-...
DOI:10.1007/978-3-540-75530-2_26
ISBN:978-3-540-75529-6
Full Text:
You have privileges to view the following file(s):
schenkelBHTW-SPIRE07.pdf  [4,00 Kb] [Comment:file from upload service]  
 
The scope and number of records on eDoc is subject to the collection policies defined by each institute - see "info" button in the collection browse view.