2022-07-22: Summary of "Web Archiving and Search Personalized"

The Web Archiving and Search Personalized system automatically captures, archives, and indexes pages for both full-text search and replay. (Source: Kiesel et al., Figure 1a) According to a study conducted by Teevan et al. in 2007, 39% of search queries represent users trying to re-find previously viewed pages [1]. One approach to supporting users in this task is automatic personal web archiving. Each page that the user visits is saved, so that it can be found later, similar to an automated version of the "bookmark as archive" feature in Mabe et al.’s Memento-aware browser prototype [2]. However, creating a system that can save web pages as they are viewed, index them for full-text search, and replay them later is an ambitious goal. Johannes Kiesel ( @KieselJohannes ), Arjen P. de Vries ( @arjenpdevries ), Matthias Hagen ( @matthias_hagen ), Benno Stein ( @bennostein ), and Martin Potthast ( @martinpotthast ) created a prototype system for this purpose in their paper “Web Arc...