International Internet Preservation Consortium

(Redirected from Web Curator Tool)

The International Internet Preservation Consortium is an international organization of libraries and other organizations established to coordinate efforts to preserve internet content for the future.[2] It was founded in July 2003 by 12 participating institutions,[1] and had grown to 35 members by January 2010.[3] As of January 2022, there are 52 members.

International Internet Preservation Consortium
AbbreviationIIPC
FormationJuly 2003; 21 years ago (2003-07)
PurposeAcquire, preserve and make accessible knowledge and information from the Internet for future generations everywhere, promoting global exchange and international relations.[1]
Websitehttp://netpreserve.org/

Membership is open to archives, museums, libraries (including national libraries), and cultural heritage institutions.[1][4]


Members

edit

National libraries

edit

Participating national libraries and archives include:[5]

Participating organisations

edit

Other participating organizations include:[5]

Past members

edit

WebCite used to be, but is no longer, a member of the IIPC.[6] In a 2012 message, its founder Gunther Eysenbach commented that "WebCite has no funding, and IIPC charges 4000 Euro/yr in membership fees."[7]

Projects

edit

The IIPC sponsors and collaborates on a number of different projects with its member organizations.

Current projects

edit
  • Support for transitioning to pywb (Python Wayback).[8]
  • Collaborative Collections: IIPC members are collaborating to build public web archive collections based on transnational themes or events of mutual interest. Topics of existing collections include: European Refugee Crisis, Intergovernmental Organizations, Olympics, World War I Commemoration, Climate Change, Artificial Intelligence, and Novel Coronavirus (COVID-19).[9]
  • Memento: aggregate metadata of the IIPC archives and provide access to Memento.[10]

IIPC also maintains an electronic mailing list open to anyone interested in issues associated with web harvesting, archiving, and quality maintenance issues.[11]

Past projects

edit
  • Developing Bloom Filters for Web Archives’ Holdings.[12]
  • Improving the Dark and Stormy Archives Framework by Summarizing the Collections of the National Library of Australia[13]
  • LinkGate: Core Functionality and Future Use Cases.[14]
  • Asking questions with web archives – introductory notebooks for historians: The project output is a set of 16 Jupyter notebooks that demonstrate how specific historical research questions can be explored by analysing data from web archives.[15][16][17]
  • IIPC sponsored a project on "cross-archival search strategies" which included the creation of an archive focused on the 2010 Winter Olympics.[18]
  • Starting in 2006, the National Library of New Zealand and the British Library developed the Web Curator Tool, an open-source workflow management system for selective web archiving. Since 2017 the Royal Library of the Netherlands has collaboratively developed the tool with the National Library of New Zealand. Version 3.2.1[19] was released in August 2024, and is available at GitHub.[20] The Web Curator Tool is built upon Java and utilizes Internet Archive’s technology, the Heritrix web archiving crawler, and replay tools such as OpenWayback[21] and Pywb.[22]
  • IIPC Web Archiving Doctoral Support Award: grant to provide three years of funding for a student to earn a PhD in Interdisciplinary Information Science at The University of North Texas College of Information.[23]
  • IIPC Member Staff Exchange: onsite training by experts for participating IIPC members to use Heritrix 3 web crawler.[24]
  • Working group on Statistics and Quality Indicators for Web Archiving: development of guidelines on the management and evaluation of Web archiving activities and products.[25]

References

edit
  1. ^ a b c "Mission & Goals | IIPC". www.netpreserve.org. International Internet Preservation Consortium. Archived from the original on 2017-06-06. Retrieved 2015-09-12.
  2. ^ "International Internet Preservation Consortium" (Press release). International Internet Preservation Consortium. May 5, 2004. Archived from the original on May 1, 2012.
  3. ^ "Web Archives Registry Launched". News & Events. Library of Congress. January 29, 2010. Archived from the original on April 8, 2011. Retrieved 2011-04-17.
  4. ^ Hiiragi, Wasuke; Shigeo Sugimoto; Tetsuo Sakaguchi. "Web archiving in the world - International Internet Preservation Consortium (IIPC) and their activities". The Journal of Information Science and Technology Association. 58 (8). Japan.
  5. ^ a b "Members". International Internet Preservation Consortium. 2020.
  6. ^ "WebCite Consortium FAQ". webcitation.org. WebCite. Archived from the original on 2008-08-28.
  7. ^ "Twitter post". 2012-06-11. Archived from the original on 2014-01-07. Retrieved 2013-03-10.
  8. ^ "Support for transitioning to pywb". International Internet Preservation Consortium. Retrieved 24 January 2021.
  9. ^ "Collaborative Collections". International Internet Preservation Consortium. Retrieved 24 January 2021.
  10. ^ "Memento". International Internet Preservation Consortium. Retrieved 17 March 2014.
  11. ^ "Web Curators Mailing List". International Internet Preservation Consortium. Archived from the original on 2014-01-25. Retrieved 2017-10-17.
  12. ^ "Developing Bloom Filters for Web Archives' Holdings". International Internet Preservation Consortium. Retrieved 24 January 2021.
  13. ^ "Improving the Dark and Stormy Archives Framework by Summarizing the Collections of the National Library of Australia". International Internet Preservation Consortium. Retrieved 24 January 2021.
  14. ^ "LinkGate: Core Functionality and Future Use Cases". International Internet Preservation Consortium. Retrieved 24 January 2021.
  15. ^ "Asking questions with web archives – introductory notebooks for historians". International Internet Preservation Consortium. Retrieved 24 January 2021.
  16. ^ "Web Archives". GLAM Workbench. Retrieved 24 January 2021.
  17. ^ "IIPC RSS webinar: Tim Sherratt: Jupyter notebooks for web archives". International Internet Preservation Consortium. Archived from the original on 2021-12-20. Retrieved 24 January 2021.
  18. ^ "2010 Winter Olympics". California Digital Library. 2010. Archived from the original on 2011-09-02.
  19. ^ "The Web Curator Tool Release History". ReadTheDocs. Retrieved 2024-10-30.
  20. ^ "Web Curator Tool". GitHub. Retrieved 2024-10-30.
  21. ^ "OpenWayback". International Internet Preservation Consortium. Retrieved 29 October 2024.
  22. ^ "pywb". GitHub. Retrieved 29 October 2024.
  23. ^ "PhD Sponsorship". International Internet Preservation Consortium. Archived from the original on 17 October 2014. Retrieved 17 March 2014.
  24. ^ "Staff Exchange". International Internet Preservation Consortium. Archived from the original on 7 November 2014. Retrieved 17 March 2014.
  25. ^ "Statistics and Quality Indicators for Web Archiving". International Internet Preservation Consortium. Archived from the original on 7 November 2014. Retrieved 17 March 2014.
edit