Page MenuHomePhabricator

Improve documentation of the metrics available in the knowledge gap index
Closed, ResolvedPublic

Description

The documentation for the knowledge gaps metrics is quite extensive and it includes both the research component and the actual development component. Clean up is needed to make sure that when we point at the documentation for the metrics that are actually in the index, we have a clear table with each available metric, how it is computed, and what it represents.

Current documentation: https://meta.wikimedia.org/wiki/Research:Knowledge_Gaps_Index/Measurement

Event Timeline

The measurement page and its subpages has been updated! @fkaelin could you update the Datasets page with the latest links and info?

@fkaelin since the data is now relatively stable, could you kindly update the dataset page? thank you!

The measurement page and its subpages has been updated! @fkaelin could you update the Datasets page with the latest links and info?

@fkaelin assigning this to you so that you can updated the data documentation before we work on the advertising/comms plan. Thank you!

  • the content gap metrics datasets are now ingested into datahub
  • they are also available via superset as virtual datasets, and I created a dashboard to experiment

@Miriam, let's discuss this week to see what else we want to make part of this phab.

Next steps:

For completeness: the datasets are also documented for the hive tables (which are equivalent to the published datasets) that are only available internally; see datahub (SSO login required)