Wikidata:WikiProject Performing arts/Use cases/Swiss/Lot 2023-1

Import of structured data from performing-arts.ch (Q104173640) into Wikidata
Institution: SAPA Foundation, Swiss Archive of the Performing Arts (Q50920401) (Contact: Baptiste Coulon SAPA bdc)
Commissioned by: Wikimedia CH (Q15279140) (Contact: Sandra Becker sandra.beckerwikimedia.ch)
Contractors: Nicolas Vigneron (VIGNERON en résidence) and Léa Lacroix (Auregann)
Timeframe: November 2023 - December 2028
Project summary
[edit]The SAPA Foundation has embarked on a process of opening up data in the field of the performing arts and would like to import data relating to productions into Wikidata. The first batch (2023-1) consists of a dataset of 14,000 productions, to be taken from the SPARQL endpoint of performing-arts.ch (Q104173640). Several other batches of upload are taking place from 2025 to 2028 with the goal to upload the entire dataset of 54,000 productions. As part of this project, we will analyse the existing data, prepare its modelling in Wikidata, import and reconcile this data in OpenRefine, while maintaining contact with the community through discussions and monitoring on this documentation page and its talk page.
The project is commissioned by Wikimedia CH and takes place in several rounds, from 2023 to 2028.
- Data analysis, export and import, OpenRefine: Nicolas Vigneron (VIGNERON en résidence)
- Coordination, contact with the community and the data provider, documentation: Léa Lacroix (Auregann)
- Contact at SAPA Foundation, Swiss Archive of the Performing Arts (Q50920401): Baptiste Coulon (SAPA bdc)
- Contact at Wikimedia CH (Q15279140): Sandra Becker
Each import round is taking place in several phases:
- Analyze, clean and refine the data to get it ready to import on Wikidata.
- Analyze and improve the existing content on Wikidata. Create new entries if needed to enrich the data.
- Define the data model that will be used for the import into Wikidata and validate it together with the community.
- Communicate with the Foundation to transmit questions, issues with the data and requests for clarification.
- Provide a test sample for validation by the SAPA Foundation.
- Import the previously cleaned and refined data on Wikidata.
- Prepare visualizations to give an overview on the imported content and allow monitoring and maintenance.
- Update the documentation on this page.
See also
[edit]Publications
[edit]- Paper about the project (in French) (summary in English, French, Italian and German), 2024
Discussions
[edit]Questions, suggestions, issues? Feel free to write on the talk page!