We are looking at improving the taxonomy of topics we use to classify articles so it better reflects topics that are relevant to the community.
- At the moment, the taxonomy includes an expanded version of the second level of the Wikiproject directory taxonomy.
- We would like to stick to Wikiprojects as a reference unit because
- They are largely adopted by the community as a way to organize labour
- They provide high-quality label data to train topic classifiers
- The idea is that we can expand the existing topic taxonomy to include more wikiprojects whose topics are considered relevant/impactful by the community
- The above implies setting up community consultations to define what those topical categories are.
- Once (a part of ) this set of topical categories is finalized, we can retrain our topic models so that they can classify articles according to a more impactful set of topics.