Skip to content

[Feature] Add Openverse as a Data Source #184

@Babi-B

Description

@Babi-B

Problem

The project currently has GitHub and GCS as automated data sources, but not Openverse. Openverse provides a large collection of openly licensed media, which will greatly enhance the breadth and depth of this data observatory

Description

Openverse aggregates data from several other openly licensed repositories like Flickr. It provides:

Alternatives

  • Work on another source

Additional context

  • Still understanding the project and solving this issue with one simple PR at a time
  • Openverse is compatible with the project structure for tracking CC Legal tools usage

Implementation

  • I will be implementing this feature
  • Focus on a single non-monolithic script scripts/1-fetch/openverse_fetch.py
  • Design script to run from the repository via pipenv
  • include --enable-save and --enable-git for consistent behavior with other scripts

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions