research-article

Corroborating information from disagreeing views

Authors:
Alban Galland

INRIA Saclay--Ile-de-France, Saclay, France

INRIA Saclay--Ile-de-France, Saclay, France
View Profile

,
Serge Abiteboul

INRIA Saclay-Ile-de-France, Saclay, France

INRIA Saclay-Ile-de-France, Saclay, France
View Profile

,
Amélie Marian

Rutgers University, New Brunswick, USA

Rutgers University, New Brunswick, USA
View Profile

,
Pierre Senellart

Institut Télécom; Télécom ParisTech, Paris, France

Institut Télécom; Télécom ParisTech, Paris, France
View Profile

WSDM '10: Proceedings of the third ACM international conference on Web search and data miningFebruary 2010Pages 131–140https://doi.org/10.1145/1718487.1718504

Published:04 February 2010Publication History

WSDM '10: Proceedings of the third ACM international conference on Web search and data mining

Pages 131–140

ABSTRACT

We consider a set of views stating possibly conflicting facts. Negative facts in the views may come, e.g., from functional dependencies in the underlying database schema. We want to predict the truth values of the facts. Beyond simple methods such as voting (typically rather accurate), we explore techniques based on "corroboration", i.e., taking into account trust in the views. We introduce three fixpoint algorithms corresponding to different levels of complexity of an underlying probabilistic model. They all estimate both truth values of facts and trust in the views. We present experimental studies on synthetic and real-world data. This analysis illustrates how and in which context these methods improve corroboration results over baseline methods. We believe that corroboration can serve in a wide range of applications such as source selection in the semantic Web, data quality assessment or semantic annotation cleaning in social networks. This work sets the bases for a wide range of techniques for solving these more complex problems.

References

S. Abiteboul, M. Preda, and G. Cobena. Adaptive on-line page importance computation. In Proc. WWW, Budapest, Hungary, May 2003. Google ScholarDigital Library
M. Arenas, L. Bertossi, and J. Chomicki. Consistent query answers in inconsistent databases. In Proc. PODS, Philadelphia, Pennsylvania, USA, May 1999. Google ScholarDigital Library
E. Brill, S. Dumais, and M. Banko. An analysis of the AskMSR question-answering system. In Proc. EMNLP, July 2002. Google ScholarDigital Library
S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107--117, 1998. Google ScholarDigital Library
C.-H. Chang, M. Kayed, M.R. Girgis, and K.F. Shaalan. A survey of Web information extraction systems. IEEE Transactions on Knowledge and Data Engineering, 18(10):1411--1428, Oct. 2006. Google ScholarDigital Library
A.P. Dempster, N.M. Laird, and D.B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39(1):1--38, 1977.Google Scholar
X. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: The role of source dependence. In Proc. VLDB, Lyon, France, 2009.Google ScholarDigital Library
X. Dong, L. Berti-Equille, and D. Srivastava. Truth discovery and copying detection in a dynamic world. In Proc. VLDB, Lyon, France, 2009. Google ScholarDigital Library
D. Downey, O. Etzioni, and S. Soderland. A probabilistic model of redundancy in information extraction. In Proc. IJCAI, Edinburgh, United Kingdom, July 2005. Google ScholarDigital Library
A. Fuxman, E. Fazli, and R.J. Miller. Conquer: efficient management of inconsistent databases. In Proc. SIGMOD, Baltimore, Maryland, USA, June 2005. Google ScholarDigital Library
A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroboration de vues discordantes fondées sur la confiance. In Proc. BDA, Namur, Belgium, Oct. 2009. Conference without formal proceedings.Google Scholar
S. Golder and B.A. Huberman. Usage patterns of collaborative tagging systems. Journal of Information Science, 32(2):198--208, April 2006. Google ScholarDigital Library
O. Häggström. Finite Markov chains and algorithmic applications, volume 52 of London Mathematical Society Student Texts. Cambridge University Press, Cambridge, United Kingdom, 2002.Google Scholar
A. Jøsang, S. Marsh, and S. Pope. Exploring different types of trust propagation. In Proc. Trust Management, Pisa, Italy, May 2006. Google ScholarDigital Library
C.C.T. Kwok, O. Etzioni, and D.S. Weld. Scaling question answering to the Web. In Proc. WWW, Hong Kong, China, May 2001. Google ScholarDigital Library
C.D. Manning, P. Raghavan, and H. Schutze. Introduction to Information Retrieval. Cambridge University Press, Cambridge, United Kingdom, 2008. Google ScholarDigital Library
G.A. Mihaila, L. Raschid, and M.-E. Vidal. Using quality of data metadata for source selection and ranking. In Proc. WebDB, Dallas, Texas, USA, May 2000.Google Scholar
D. Osherson and M.Y. Vardi. Aggregating disparate estimates of chance. Games and Economic Behavior, 56(1):148--173, July 2006.Google ScholarCross Ref
N.E. Taylor and Z.G. Ives. Reconciling while tolerating disagreement in collaborative data sharing. In Proc. SIGMOD, Chicago, Illinois, USA, June 2006. Google ScholarDigital Library
M. Wu and A. Marian. Corroborating answers from multiple Web sources. In Proc. WebDB, Beijing, China, June 2007.Google Scholar
X. Yin, J. Han, and P.S. Yu. Truth discovery with multiple conflicting information providers on the Web. In Proc. KDD, San Jose, California, USA, Aug. 2007. Google ScholarDigital Library

Index Terms

Corroborating information from disagreeing views
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Heterogeneous (hybrid) systems
2. Information systems
  1. Data management systems
  2. Information systems applications
    1. Data mining

Recommendations

Patterns for Implementing Uncertainty Propagation
EuroPLoP '18: Proceedings of the 23rd European Conference on Pattern Languages of Programs

In this paper, the design patterns Uncertain Number and Propagation Strategy are presented. They are useful for storing uncertainties of values and propagating them throughout calculations in an application. Uncertain Number represents a numerical value ...
Read More
Agents’ model of uncertainty

Multi-agent systems play an increasing role in sensor networks, software engineering, web design, e-commerce, robotics, and many others areas. Uncertainty is a fundamental property of these areas. Agent-based systems use probabilistic and other ...
Read More
Certainty, trust and evidence

We assume a group of agents, in which a process of opinion gathering takes place.We examine the values of agent's reputation, trust and certainty.Confidence value for agents depend on the values of trust and certainty.Increasing the importance of trust ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '10: Proceedings of the third ACM international conference on Web search and data mining
February 2010
468 pages
ISBN:9781605588896
DOI:10.1145/1718487
General Chairs:
Brian D. Davison
Lehigh University, USA
,
Torsten Suel
Polytechnic Institute of NYU, USA
,
Program Chairs:
Nick Craswell
Microsoft, USA
,
Bing Liu
University of Illinois, Chicago, USA
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 February 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
confidence
contradiction
corroboration
fix-point
probabilistic model
view
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate498of2,863submissions,17%
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 228
  Total Citations
  View Citations
- 784
  Total Downloads
- Downloads (Last 12 months)35
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Corroborating information from disagreeing views

WSDM '10: Proceedings of the third ACM international conference on Web search and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Patterns for Implementing Uncertainty Propagation

Agents’ model of uncertainty

Certainty, trust and evidence