Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all domains
Limit my searches in the following domains
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(14)
Hybrid Model
Information Extraction
Information Retrieval
Information Retrieval Model
Language Model
Object Oriented
Paradigm Shift
Relevance Ranking
Retrieval Model
Search Engine
Site Quality
Web Databases
Web Pages
Web Search Engine
Related Publications
(11)
Entity Search Engine: Towards Agile Best-Effort Information Integration over the Web
Structured Querying of Web Text Data: A Technical Challenge
Dynamic personalized pagerank in entity-relation graphs
Object-level ranking: bringing order to web objects
Breaking Through the Syntax Barrier: Searching with Entities and Relations
Subscribe
Academic
Publications
Web Object Retrieval
Edit
Web Object Retrieval
(
Citations: 44
)
BibTex
|
RIS
|
RefWorks
Download
Zaiqing Nie
,
Yunxiao Ma
,
Shuming Shi
,
Ji-Rong Wen
,
Wei-Ying Ma
The primary function of current
Web search
engines is essentially
relevance ranking
at the document level. However, myriad structured information about real-world objects is embedded in static
Web pages
and online Web databases. Document-level
information retrieval
can unfortunately lead to highly inaccurate
relevance ranking
in answering object-oriented queries. In this paper, we propose a
paradigm shift
to enable searching at the object level. In traditional
information retrieval
models, documents are taken as the retrieval units and the content of a document is considered reliable. However, this reliability assumption is no longer valid in the object retrieval context when multiple copies of information about the same object typically exist. These copies may be inconsistent because of diversity of Web site qualities and the limited performance of current
information extraction
techniques. If we simply combine the noisy and inaccurate attribute information extracted from different sources, we may not be able to achieve satisfactory retrieval performance. In this paper, we propose several language models for Web object retrieval, namely an unstructured object retrieval model, a structured object retrieval model, and a
hybrid model
with both structured and unstructured retrieval features. We test these models on a paper
search engine
and compare their performances. We conclude that the
hybrid model
is the superior by taking into account the extraction errors at varying levels.
Conference:
World Wide Web Conference Series - WWW
, pp. 81-90, 2007
DOI:
10.1145/1242572.1242584
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
portal.acm.org
)
(
portal.acm.org
)
(
doi.acm.org
)
(
www.informatik.uni-trier.de
)
(
research.microsoft.com
)
(
research.microsoft.com
)
More »
Citation Context
(34)
...For example, both
Nie et al. (2007)
and Balog et al. (2006) propose extended language models to address the expert finding problem...
Jie Tang
,
et al.
Topic level expertise search over heterogeneous networks
...Semantic desktop search, Deep-Web search, vertical search, Dataspaces, EntityRank, ExDBMS, Libra, [10, 18, 13,
35
, 12, 34] attempt to query and index collections of Web and personal data but the extraction techniques are less general...
Michael Gubanov
,
et al.
READFAST: Browsing large documents through unified famous objects (UFO...
...Ranking the results of queries that yield more answers than a human would want to see has been intensively studied for entity-centric search (e.g., finding soccer players who played for FC Barcelona) [31, 38, 62, 75, 81,
99
, 102, 116, 132]...
Gerhard Weikum
,
et al.
From information to knowledge: harvesting entities and relationships f...
...We are now witnessing an emerging research trend on using entities and relationships to facilitate various search and mining tasks [7, 8, 25, 13, 12, 4, 5, 6,
20
, 27, 9, 30]...
Tao Cheng
,
et al.
Beyond pages: supporting efficient, scalable entity search with dual-i...
...With an observed paradigm shift in search from whole web pages to smaller subunits [
7
], the granularity of visual structure analysis can be adapted to allow comparison at the web object level; (3 ) Exploration...
Paul Bohunsky
,
et al.
Visual structure-based web page clustering and retrieval
References
(43)
Effective retrieval with distributed collections
(
Citations: 141
)
Jinxi Xu
,
James P. Callan
Conference:
Research and Development in Information Retrieval - SIGIR
, pp. 112-120, 1998
Assessment Methods for Information Quality Criteria
(
Citations: 57
)
Felix Naumann
,
Claudia Rolker
Conference:
MIT Conference on Information Quality - IQ
, pp. 148-162, 2000
Retrieving web pages using content
(
Citations: 18
)
T. Westerveld
,
W. Kraaij
,
D. Hiemstra
Published in 2001.
Data extraction and label assignment for web databases
(
Citations: 138
)
Jiying Wang
,
Frederick H. Lochovsky
Conference:
World Wide Web Conference Series - WWW
, pp. 187-196, 2003
Length normalization in XML retrieval
(
Citations: 42
)
Jaap Kamps
,
Maarten de Rijke
,
Börkur Sigurbjörnsson
Conference:
Research and Development in Information Retrieval - SIGIR
, pp. 80-87, 2004
Order by:
Citations
(44)
Topic level expertise search over heterogeneous networks
Jie Tang
,
Jing Zhang
,
Ruoming Jin
,
Zi Yang
,
Keke Cai
,
Li Zhang
,
Zhong Su
Journal:
Machine Learning - ML
, vol. 82, no. 2, pp. 211-237, 2011
READFAST: Browsing large documents through unified famous objects (UFO)
Michael Gubanov
,
Anna Pyayt
,
Linda Shapiro
Conference:
Information Reuse and Integration - IRI
, 2011
From information to knowledge: harvesting entities and relationships from web sources
(
Citations: 4
)
Gerhard Weikum
,
Martin Theobald
Conference:
Symposium on Principles of Database Systems - PODS
, pp. 65-76, 2010
Beyond pages: supporting efficient, scalable entity search with dual-inversion index
(
Citations: 1
)
Tao Cheng
,
Kevin Chen-Chuan Chang
Conference:
Extending Database Technology - EDBT
, pp. 15-26, 2010
Visual structure-based web page clustering and retrieval
(
Citations: 1
)
Paul Bohunsky
,
Wolfgang Gatterbauer
Conference:
World Wide Web Conference Series - WWW
, pp. 1067-1068, 2010