Wikipedia:Meetup/DC/SIL Wikidata Workshop 2020

Smithsonian Libraries Wikidata Workshop 2020
Event details
Date:10 June 2020

The Smithsonian Libraries is hosting a private online event for staff and collaborators to work on creating a data model for the Smithsonian Institution's Wikibase structured data project (under construction), based on Wikidata.

Date

edit

10 June 2020

Time

edit

10:00 EDT (Eastern (US & Canada)) | 16:00 CET (Berlin)

Where

edit

Remote

Register

edit

Participation by invitation only

Agenda

edit
  • Data modeling in Wikibase (understanding items and properties, and how to map your structured data to items/properties)
  • Understanding different data types
  • How to do bulk uploading of data (existing tools and challenges)

Background

edit

The above topics are to help participants working through the data modeling exercises so that the design of Wikibase will meet the Smithsonian Institution's collective needs. The Smithsonian Libraries' Wikidata team has also added a few topics that the team are working through. These may or may not be addressed at the workshop.

Wikibase's data model consists of items and properties. It follows the triple statement structure that items are linked via property. The initial need of the Smithsonian Libraries' Wikibase pilot is 1) to understand the Wikibase data model and 2) to customize Smithsonian-related authority data. We will identity needs for personal, family, geographic and corporate body names, with their necessary mapping from respective in-house databases.

The Smithsonian Libraries has begun a list of over 200 properties potentially useful to the pilot. The team has also identified names from potential collections that are candidates for Wikibase item generation.

Through this workshop, we will learn how to prepare the various types of data that SIL needs for bulk uploading into our Wikibase.

All of the referenced links are found in the (privately shared) Workshop Google document for ease of access.


Facilitators

edit


Participants

edit

Please note: prior to the workshop, every participant must have:

  • a Wikipedia/Wikidata account with username and password (ask Diane for help if you have questions on how to do this)
  • a general understanding of RDF triple statements in the Wikidata environment. Links in the Wikidata training section of the Meetup page provide basic understanding of Wikidata (structure, creation and editing of items, etc., & linked data)
  Discovery Services (DISC):  
      Heidy Berthoud: Head of Resource Description 
      Amy Watson: cataloging technician
      Mario Rups: librarian  

  Digital Programs and Initiatives (DPI):  
       Suzanne Pilsk: Head, Metadata Services 
       Bess Missell, Librarian, Metadata Services 
       Sue Graves, Metadata Technician, Metadata Services
       Richard Naples: Technical Information Specialist, Metadata Services
 
  Freer Gallery of Art and Arthur M. Sackler Gallery Library (FSG):   
       Yue Shu: librarian for Chinese language
       Kathryn Phillips: librarian for South Asian, Southeast Asian and Middle Eastern art and architecture
       Mike Smith: librarian for Korean art, American art, Conservation and Scientific Research, and serials
       Reiko Yoshimura: head librarian, Freer | Sackler and librarian for Japanese art

  Smithsonian American Art & Portrait Gallery Library (AAPG):
      Anne Evenhaugen: head librarian
   
  National Portrait Gallery (NPG): 
      Sue Garton, Collections Information Office
    
  Office of the Chief Info Officer: (OCIO):
     Nancy Kennedy (SIRIS/SIAC: Smithsonian Institution Research Information Service/Archives Center    

  Office of the Undersecretary for Museums and Culture/American Women’s History Initiative
     Effie Kapsalis Senior Digital Program Officer
  
  Smithsonian Archives (SIA)    
     Tammy Peters, Interim Director
    
  National Museum of Natural History (NMNH)
     Rebecca Snyder, Acting Informatics Branch Chief
  
  Smithsonian Archives of American Art (AAA)
     Tobias (Toby) Reiter: Web Developer
     Jess Purkis: Digital Initiatives Archivist
edit


SourceMD

edit

SourceMD is the "source metadata" tool for copying citation information to Wikidata.

Data models

edit

In general, a data model draws the representation of data, "the things", and the relationship between them. The exercises of data modeling are efforts to capture the semantics of an organization's business needs. For instance, in SI environment, the data modeling is conducted for the purpose of communicating both with the community needs and providing architecture for databases and system design. Semantic is important for systems to be able to communicate with one another. Ontologies, thesauri, glossaries serve as means to carry out their respective activities.

In Wikidata, a "data model" is a recommendation for what properties to apply to Wikidata items. Event participants can engage in Wikidata without consulting the below. However, if anyone wants inspiration when editing Wikidata items for people, publications, and institutions, etc., check the below:

Some Wikidata projects of interest to libraries:

edit