A content gap (gender) is associated with a set of categories (female, male, non-binary,...). The content gap metrics timeseries are available at 4 aggregation levels.
- by category / content gap / wiki_db
- by category / content gap
- content gap / wiki_db
- content gap
The current implementation of the content gaps integration in AQS is for the most granular level (by category / content gap / wiki_db), and the transforms the (content gap / wiki_db) level into the most granular by using an all_categories category.
All these datasets are available for use, and the aqs api does not use all of them.
- In particular, the aggregation level by category / content gap (i.e. aggregated across all wikis) has been used in notebooks/reports, but is not available with the current api proposal. Can and should we include this (e.g. by using a wiki called "all_wikis")?
- How does the choice of data to use impact the work required on the wikistats side?