Efficient verification of webcontent searching through authenticated web crawlers

Michael T. Goodrich, Duy Nguyen, Olga Ohrimenko, Charalampos Papamanthou, Roberto Tamassia, Nikos Triandopoulos, Cristina Videira Lopes

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

We consider the problem of verifying the correctness and completeness of the result of a keyword search. We introduce the concept of an authenticated web crawler and present its design and prototype implementation. An authenticated web crawler is a trusted program that computes a specially-crafted signature over the web contents it visits. This signa-ture enables (i) the verication of common Internet queries on web pages, such as conjunctive keyword searches|this guarantees that the output of a conjunctive keyword search is correct and complete; (ii) the verication of the content returned by such Internet queries|this guarantees that web data is authentic and has not been maliciously altered since the computation of the signature by the crawler. In our solu-tion, the search engine returns a cryptographic proof of the query result. Both the proof size and the verication time are proportional only to the sizes of the query description and the query result, but do not depend on the number or sizes of the web pages over which the search is performed. As we experimentally demonstrate, the prototype implementa-tion of our system provides a low communication overhead between the search engine and the user, and fast verication of the returned results by the user.

Original languageEnglish
Pages (from-to)920-931
Number of pages12
JournalProceedings of the VLDB Endowment
Volume5
Issue number10
DOIs
StatePublished - Jun 2012

Fingerprint

Dive into the research topics of 'Efficient verification of webcontent searching through authenticated web crawlers'. Together they form a unique fingerprint.

Cite this