Protecting science: TIB builds dark archive for arXiv

Long-term protection for international research content

Science is international – and free access to the latest research findings is a key requirement for scientific progress. The arXiv.org platform plays a very important role here: a globally used online platform for so-called preprints – i.e. pre-publications of scientific papers that have not yet been peer-reviewed. Since 1991, arXiv has been an essential part of scientific communication, especially for the fields of physics, mathematics and computer science.

Although arXiv is operated by Cornell University in the USA, it is financed internationally. Together with the Helmholtz Association of German Research Centres (HGF) and the Max Planck Society (MPG), the TIB – Leibniz Information Centre for Science and Technology provides the German share of funding for the service. 

TIB takes responsibility for arXiv data

The TIB has now set up a so-called dark archive for the arXiv content in order to be able to make the backed-up data accessible if the data stored in the USA is lost. The archive functions as a silent reserve: the complete copy of the content is stored decentrally at the TIB, but is not publicly accessible. This means that the data stock – almost 10 terabytes – is protected against potential outages and can be activated in an emergency.

The TIB is currently working on processes to keep the archive up to date: New submissions and updated versions must be backed up regularly in order to preserve the state of research as completely as possible.

“Building a Dark Archive is an expression of our longstanding commitment for a reliable, international academic provision, and as a partner of arXiv. Even though the Dark Archive today only works in the background, it is a key element in safeguarding digital research contents in the long term, because in case of a crisis, we could open the archive,” explains Dr Irina Sens, Deputy Director of the TIB.

More information on the development of the dark archive for arXiv content can be found on the TIB-Blog.

Feedback