Policies and technologies to digital preservation in webarchiving
DOI:
https://doi.org/10.26512/rici.v11.n1.2018.8473Keywords:
digital preservation, preservation policy, web archivingAbstract
The objective of this paper was to analyze digital preservation from the web archiving approach, addressing the technologies involved in the archiving process, as well as policies for the selection, preservation and availability of these contents, as well as the study of international institutions that work on preservation of the web. The methodology uses bibliographic and documentary research on international archival web initiatives and aims to foment the discussion in Brazil, as well as to serve as a subsidy for applied studies. It analyzes the scientific publications based on Scopus journals of the last five years (2012-2016) that deal with web archiving, web content selection policies and technologies applied to the harvest, storage and access to archiv ed website s. It also provides an overview of the technologies used by the community of web archiving initiatives, based on the identification of the data available on the web site of the International Internet Preservation Consortium. It concludes that countries that do not yet have their own initiatives, such as Brazil, with the establishment of selection policies with specific approaches (institutional, thematic, domain, etc.), as well as web archive adoption of open source technologies can not only preserve your digital memory but also contribute to the international web archiving community. Digital preservation; Preservation policy; W eb archiving.Downloads
References
ARQUIVO DA WEB PORTUGUESA. Termos e condições. 2017 Disponível em: http://sobre.arquivo.pt/pt/acerca/termos-e-condicoes/ Acesso em: 5 nov. 2017.
BERNERS-LEE, Tim. Information management: a proposal. Switzerland: CERN, 1989. Disponível em: http://cds.cern.ch/record/369245/files/dd-89-001.pdf Acesso em: 08 dez. 2017.
BIBLIOTHÈQUE NATIONALE DE FRANCE. Digital legal deposit: four questions about Web Archiving at the BnF. 2017. Disponível em: http://www.bnf.fr/en/professionals/digital_legal_deposit/a.digital_legal_deposit_web_archiving.html Acesso em: 10 nov. 2017.
BRITISH LIBRARY. The British Library Collection Development Policy for websites. 2014. Disponível em: https://www.bl.uk/aboutus/stratpolprog/digi/webarch/bl_collection_development_policy_v3-0.pdf Acesso em: 3 nov. 2017.
BRUNELLE, J. F., KELLY, M., WEIGLE, M. C., NELSON, M. L. The impact of JavaScript on archivability. International Journal on Digital Libraries, v. 17, n. 2, p. 95-117, 2016. Disponível em: https://link.springer.com/article/10.1007/s00799-015-0140-8 Acesso em: 8 dez. 2017.
COLUMBIA UNIVERSITY LIBRARIES. Web Resource Collection Program - Policies. 2017. Disponível em: https://library.columbia.edu/bts/web_resources_collection/policies.html Acesso em: 6 nov. 2017.
COSTA, Miguel; GOMES, Daniel; SILVA, Mário J. The evolution of web archiving. International Journal on Digital Libraries, v. 18, n. 3, p. 191”“205, 2017. Disponível em: https://doi.org/10.1007/s00799-016-0171-9 Acesso em: 8 dez. 2017.
HERITRIX. Heritrix public wiki. 2017. Disponível em: https://webarchive.jira.com/wiki/spaces/Heritrix/overview Acesso em: 15 nov. 2017.
INTERNATIONAL INTERNET PRESERVATION CONSORTIUM. Disponível em: http://netpreserve.org Acesso em: 2 nov. 2017.
INTERNATIONAL INTERNET PRESERVATION CONSORTIUM. Strategic Plan (2016-2017), 2016. Disponível em: http://netpreserve.org/wp-content/uploads/2017/04/IIPC-Strategic-Plan-2016-2017.pdf Acesso em: 11 nov. 2017.
INTERNATIONAL ORGANIZATION FOR STANDARDIZATION. ISO 28500:2009. Information and documentation - WARC file format. Geneva: ISO, 2009. Disponível em: https://www.iso.org/obp/ui/#iso:std:iso:28500:ed-1:v1:en Acesso em: 8 dez.2017
KONINKLIJKE BIBLIOTHEEK. Web Archiving. 2017. Disponível em: https://www.kb.nl/en/organisation/research-expertise/long-term-usability-of-digital-resources/web-archiving Acesso em: 12 nov. 2017.
LIBRARY OF CONGRESS Collections Policy Statements Supplementary Guidelines. 2017. Disponível em: http://www.loc.gov/acq/devpol/webarchive.pdf Acesso em: 3 nov. 2017.
MASANÈS, Julien. Web Archiving. Berlin, Heidelberg: Springer, 2006.
NATIONAL ARCHIVES UK. Records collection policy. 2012. Disponível em: http://www.nationalarchives.gov.uk/documents/records-collection-policy-2012.pdf Acesso em: 10 nov. 2017.
NATIONAL ARCHIVES UK. Twitter Archives. 2017. Disponível em: http://webarchive.nationalarchives.gov.uk/twitter/ Acesso em: 14 nov. 2017.
NATIONAL LIBRARY OF FINLAND. Web Archiving in Finland: Memorandum for the members of the CDNL, 2010. Disponível em: http://www.doria.fi/bitstream/handle/10024/67051/webarchivingfinland_cdnl.pdf?sequence=1&isAllowed=y Acesso em: 8 nov. 2017.
ROCKEMBACH, Moisés. Arquivamento da Web: estudos de caso internacionais e o caso brasileiro. Revista Digital de Biblioteconomia e Ciência da Informação. Campinas, v. 16, n. 1, 2018. Disponível em: http://hdl.handle.net/10183/169433 Acesso em: 8 dez. 2017.
RUEST, N., MILLIGAN, Ian. An Open-Source Strategy for Documenting Events: The Case Study of the 42nd Canadian Federal Election on Twitter. Code4Lib Journal, n. 32, 2016. Disponível em: http://journal.code4lib.org/articles/11358 Acesso em: 8 dez. 2017.
STANFORD UNIVERSITY LIBRARIES. Collection Development. 2017 Disponível em: http://library.stanford.edu/projects/web-archiving/collection-development Acesso em: 3 nov. 2017.
XIE, Z., VAN DE SOMPEL, H., LIU, J., VAN REENEN, J., JORDAN, R. Archiving the relaxed consistency web. In: ACM INTERNATIONAL CONFERENCE ON CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 22., 2013. Proceedings. San Francisco: ACM, p. 2119-2128, 2013. Disponível em: https://dl.acm.org/citation.cfm?id=2505551 Acesso em: 8 dez. 2017.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Moises Rockembach, Caterina Marta Groposo Pavão
![Creative Commons License](http://i.creativecommons.org/l/by/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright Notice
Authors who publish in this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under the Creative Commons Attribution License 4.0, allowing the sharing of work and recognition of the work of authorship and initial publication in this journal.
- Authors are able to take on additional contracts separately, non-exclusive distribution of the version of the paper published in this journal (ex.: distribute to an institutional repository or publish as a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to distribute their work online (eg.: in institutional repositories or on their website) at any point before or during the editorial process, as it can lead to productive exchanges, as well as increase the impact and citation the published work.