Skip to content

Add Wikipedia as a new data source #159

@oree-xx

Description

@oree-xx

Description

Currently, Quantifying uses Google Custom Search and GitHub APIs. I would like to add Wikipedia as a new source for fetching data. So we would have wikipedia_fetch.py under scripts directory.

Implementation

  • Integrate the Wikipedia API as a source for it to retrieve the count of Wikipedia articles containing references to specific Creative Commons licenses or keywords.
  • Implement fetching functions similar to existing github_fetch.py so the data flow remains consistent.
  • Update documentation to reflect Wikipedia as a supported data source.

API documentation: https://www.mediawiki.org/wiki/API:Siteinfo#Rightsinfo

Available statistics:

  • Number of articles
  • Number of pages
  • Number of edits
  • Number of users
  • Number of images

I would update the implementation as I continue working on it.

Metadata

Metadata

Assignees

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions