diego (Diego S-T)
Senior Research Scientist

Projects

Calendar

User Details

User Since: Aug 8 2017, 10:56 AM (374 w, 22 h)
Availability: Available
LDAP User: Unknown
MediaWiki User: Diego (WMF) [ Global Accounts ]

Recent Activity
View All

Mon, Oct 7

diego added a subtask for T335799: Review papers and give feedback: Unknown Object (Task).

Mon, Oct 7, 11:21 AM · Epic, Research-outreach, Research

Fri, Oct 4

diego added a comment to T371158: [SPIKE] What percentage of edits are reverted because of peacock behavior?.

Hi @MNeisler ! I've just shared on Slack the list and code to create the peacock related words. Just for the records a put the links to notebook and list here.

Fri, Oct 4, 2:15 PM · Product-Analytics (Kanban), Editing-team (Tracking), EditCheck

Wed, Oct 2

diego closed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs as Resolved.

Wed, Oct 2, 5:54 PM · Research (FY2024-25-Research-July-September)

diego updated the task description for T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Wed, Oct 2, 5:53 PM · Research (FY2024-25-Research-July-September)

diego closed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs, a subtask of T365301: Peacock Check: Prompt people to revise promotional language, as Resolved.

Wed, Oct 2, 5:53 PM · EditCheck, Editing-team, VisualEditor

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Confirm if the hypothesis was supported or contradicted

Wed, Oct 2, 5:32 PM · Research (FY2024-25-Research-July-September)

Fri, Sep 13

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

I’m working on building a set of keywords related to peacock behavior and promotional tone. To do this, I’m using a TF-IDF approach, a well-known method to identify terms (keywords) that characterize a set of documents.
This and next week are short for me (taking several days off), so it might take a bit more time to finalize this.
I also communicated with my manager that there might be the possibility of trying to build a product based on the fine-tune model. In case we decide to move forward, we would need to coordinate with her and other teams involved how to proceed.

Fri, Sep 13, 11:59 PM · Research (FY2024-25-Research-July-September)

diego added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

In T356102#10143922, @Strainu wrote:

I see an opportunity to use this endpoint for evaluating new pages. Any thoughts @achou or @diego ?

Fri, Sep 13, 2:08 PM · Temporary accounts, Machine-Learning-Team

Sep 7 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Experiments:
- As planned I studied the ability of the model fine tuned to detect peacock behavior to detect other promotion-related content issues, described in this data set.
- I run the model on 4 other datasets: {{fanpov}}, {{advert}}, {{autobiography}}, {{weasel}}
- The results show (see below) a similar behavior with the peacock detection task. The model shows a good precision and low recall (lower for templates different from peacock). This suggest that there is information about promotional tone that can be detect by the model, and depending on the setup the model could focus on precision or recall
Coordination:
- We have a meeting with Peter Pelberg, Nicola Ayub , and Megan Neisler to discuss next steps.
- First, we decided that the model needs to be tested again a simple baseline, that can be just a string matching approach, looking for common peacock keywords. I’ll be working on this during the next week(s) (notice I’ll be OoO few days during the next two weeks)
- Peter is going to decide if we want to go deeper on this specific task, and analysis how other factors related to transform this model into a product (serving time, ux, etc) or work on other tasks that involves ML and user experiences

Sep 7 2024, 9:28 AM · Research (FY2024-25-Research-July-September)

Sep 4 2024

diego added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

@achou, just for my curiosity, is the "predict time" the total end-to-end period or total = preprocess predict?

Sep 4 2024, 10:08 AM · Temporary accounts, Machine-Learning-Team

Aug 30 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 30 2024, 10:48 PM · Research (FY2024-25-Research-July-September)

Aug 23 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 23 2024, 8:48 PM · Research (FY2024-25-Research-July-September)

diego added a comment to T372298: [SPIKE]Perform a load test for Multilingual Revert Risk on LiftWing[4H].

This looks great @jsn.sherman. Do you know if there is an overlap on the revisions that returns an error for each model?
I'm just wondering if the ML fails on very long diffs (given that needs to process the text itself).

Aug 23 2024, 8:45 PM · Moderator-Tools-Team (Kanban), Machine-Learning-Team, Automoderator

Aug 19 2024

diego updated subscribers of T372747: Repeat Automoderator testing process with Multilingual Revert Risk data.

I would also like to tag @Pablo @diego: do we have regular snapshots of revert risk scores based on the multilingual model as well, or even a single snapshot for a few months?

I don't know, maybe @fkaelin knows.

Aug 19 2024, 12:11 PM · Moderator-Tools-Team, Product-Analytics (Kanban), Automoderator

Aug 14 2024

diego added a subtask for T368791: SDS 1.2.2 Causes behind human administration recruiting, retention, or departure patterns: T372479: RS Support for SDS 1.2.2.

Aug 14 2024, 3:50 PM · OKR-Work, Research (FY2024-25-Research-July-September)

diego added a parent task for T372479: RS Support for SDS 1.2.2: T368791: SDS 1.2.2 Causes behind human administration recruiting, retention, or departure patterns.

Aug 14 2024, 3:50 PM · Research

diego updated the task description for T372479: RS Support for SDS 1.2.2.

Aug 14 2024, 3:49 PM · Research

diego created T372479: RS Support for SDS 1.2.2.

Aug 14 2024, 3:48 PM · Research

Aug 8 2024

diego added a comment to T365581: Use multilingual revert risk model in Automoderator on supported wikis.

@Samwalton9-WMF , just keep in mind that the scores from RRML and RRLA are different. This means that you maybe need to run new users' test to (re)define the thresholds.

Aug 8 2024, 3:06 PM · Machine-Learning-Team, Moderator-Tools-Team, Automoderator

diego added a comment to T365581: Use multilingual revert risk model in Automoderator on supported wikis.

Hi @Samwalton9-WMF , we choose RRLA because it was more stable, but since then, we made some updates to RRML (it was not only about serving time, but getting errors for some revisions), that aimed to make it more stable.
So, if there is interest to switch to RRML (for 47 languages with coverage), my recommendation would be to run some stress test on that service, and measure the % of errors and if Automoderator can tolerate them.

Aug 8 2024, 3:00 PM · Machine-Learning-Team, Moderator-Tools-Team, Automoderator

Aug 2 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 2 2024, 6:48 PM · Research (FY2024-25-Research-July-September)

Jul 26 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

I've been coordinating with ML-team to show code examples that make their (experimental) infrastructure to fail. They will be using this code as part of their use-case studies when testing new LLMs infrastructure.
In the meantime I've been working on writing code to fine-tune smaller Language Models, this requires:
- Data preprocessing and cleaning (done)
- Experimental design (done)
- Run experiments on stats machine (in progress)
Met with KR owner (Peter Pelberg) and explain the progress and next steps for this hypothesis.

Jul 26 2024, 3:08 PM · Research (FY2024-25-Research-July-September)

Jul 23 2024

diego reopened T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history as "Open".

Jul 23 2024, 9:44 AM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

Hi! Apparently the data has missing again:

Jul 23 2024, 9:43 AM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

Jul 22 2024

diego updated subscribers of T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Jul 22 2024, 4:02 PM · Research (FY2024-25-Research-July-September)

Jul 19 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Studied how to create prompts for Gemma2. Noticed the importance of using special tokens and format.
Designed zero-shot experiment for detecting Peacock behavior.
Wrote code for testing the Gemma2 instance hosted by the ML-team.
- The instance took more than 5 seconds per query.
- After few requests (around 200) the instance stop responding.
- O've reported this issue to ML-Team, my understanding is they will be working on fixing this during the next week (cc: Chris Albon)

Jul 19 2024, 7:42 PM · Research (FY2024-25-Research-July-September)

diego merged T363718: Deploy Wikidata Revert Risk to LiftWing into T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 19 2024, 10:57 AM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

diego merged task T363718: Deploy Wikidata Revert Risk to LiftWing into T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 19 2024, 10:55 AM · Research

Jul 18 2024

diego updated subscribers of T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

You are right @leila we should merge them.

Jul 18 2024, 3:37 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Jul 12 2024

diego updated the task description for T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Jul 12 2024, 4:09 PM · Research (FY2024-25-Research-July-September)

diego renamed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs from [W.E.1.2.4] Detecting Peacook behavior with LLMs to [W.E.1.2.4] Detecting Peacock behavior with LLMs.

Jul 12 2024, 3:53 PM · Research (FY2024-25-Research-July-September)

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Based on our previous research, we have created a dataset containing 9276 articles affected by peacock and other related policy violations on English Wikipedia. For each of them we have negative (no policy violations) and positive examples: * Autobiography: 1472

fanpov: 350
peacock 2587
weasel 805
advert: 4062
Total: 9276

Jul 12 2024, 3:52 PM · Research (FY2024-25-Research-July-September)

Jul 5 2024

diego closed T328813: Develop a ML-based service to detect vandalism on Wikidata as Resolved.

Jul 5 2024, 3:54 PM · Research, Wikidata data quality and trust, Wikidata

diego closed T328813: Develop a ML-based service to detect vandalism on Wikidata, a subtask of T333892: Develop a new generation of ML models for Wikidata, as Resolved.

Jul 5 2024, 3:52 PM · Research-Freezer, Epic, Wikidata data quality and trust, Wikidata, address-knowledge-gaps, Knowledge-Integrity

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

I'm resolving the task and track model's deployment in T369371

Jul 5 2024, 3:52 PM · Research, Wikidata data quality and trust, Wikidata

diego added a subtask for T328813: Develop a ML-based service to detect vandalism on Wikidata: T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 5 2024, 3:52 PM · Research, Wikidata data quality and trust, Wikidata

diego added a parent task for T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model: T328813: Develop a ML-based service to detect vandalism on Wikidata.

Jul 5 2024, 3:52 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

diego created T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 5 2024, 3:48 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Jul 4 2024

diego added a comment to T367444: Replace or remove Debian Buster VMs in 'wmf-research-tools' cloud-vps project.

@diego:

covid-data.wmf-research-tools.eqiad1.wikimedia.cloud (this one is shut-off so maybe just needs deleted?)

wikipediaWikidata.wmf-research-tools.eqiad1.wikimedia.cloud

I've just removed these two

Jul 4 2024, 6:22 PM · cloud-services-team, Cloud-VPS (Debian Buster Deprecation), Research

diego updated the task description for T357033: Reference model Research and Development work.

Jul 4 2024, 6:10 PM · Research (FY2024-25-Research-July-September), Wikimedia Enterprise

diego added a comment to T369055: Investigate deployment of gemma2 on LiftWing.

Thanks for this work @isarantopoulos!

Jul 4 2024, 2:50 PM · Lift-Wing, Machine-Learning-Team

Jun 28 2024

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

@Trokhymovych, please post here the models' performance results

Jun 28 2024, 8:10 AM · Research, Wikidata data quality and trust, Wikidata

diego added a comment to T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

To keep this task updated, models for Wikipedia are ready and can be found here:

Jun 28 2024, 8:07 AM · Machine-Learning-Team, Research, Epic

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

@Trokhymovych has addresed the comments and submitted the merge request. Model binary can be found here.
I'm going to coordinate with research engineers to decide next steps.

Jun 28 2024, 8:02 AM · Research, Wikidata data quality and trust, Wikidata

diego updated subscribers of T328813: Develop a ML-based service to detect vandalism on Wikidata.

Jun 28 2024, 7:58 AM · Research, Wikidata data quality and trust, Wikidata

diego updated subscribers of T328813: Develop a ML-based service to detect vandalism on Wikidata.

Jun 28 2024, 7:57 AM · Research, Wikidata data quality and trust, Wikidata

Jun 25 2024

diego added a comment to T367551: Cloud VPS "research-collaborations-api" project Buster deprecation.

Just for the records, we have migrated the fact-checking API to another instance and deleted the old one.

Jun 25 2024, 9:36 AM · Research, Cloud-VPS (Debian Buster Deprecation)

Jun 24 2024

diego created T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Jun 24 2024, 2:24 PM · Research (FY2024-25-Research-July-September)

Jun 18 2024

diego closed T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history as Resolved.

Jun 18 2024, 2:23 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

Thanks @JAllemandou !

Jun 18 2024, 2:23 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 20 2024

diego added a comment to T365360: adopt production ready code structure.

@XiaoXiao-WMF can you please provide more context?

May 20 2024, 1:10 PM · Research

May 6 2024

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

@lbowmaker the proposed solution sounds ok to me. I have two questions around:

May 6 2024, 3:58 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 3 2024

diego updated subscribers of T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

May 3 2024, 1:51 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

@lbowmaker if understand correctly, there is no alternative for obtaining historical data for Wikidata edits? If this is the case, we can't keep the Wikidata Revert Risk model updated

May 3 2024, 1:50 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 2 2024

diego created T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

May 2 2024, 9:02 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

Apr 29 2024

diego closed T341820: Evaluate and improve the Revert Risk model for Wikidata. as Resolved.

Apr 29 2024, 3:32 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

This task has been resolved, please follow the model deployment here: T363718

Apr 29 2024, 3:31 PM · Research (FY2023-24-Research-April-June)

diego closed T341820: Evaluate and improve the Revert Risk model for Wikidata., a subtask of T328813: Develop a ML-based service to detect vandalism on Wikidata, as Resolved.

Apr 29 2024, 3:30 PM · Research, Wikidata data quality and trust, Wikidata

diego created T363718: Deploy Wikidata Revert Risk to LiftWing.

Apr 29 2024, 3:29 PM · Research

Apr 17 2024

diego added a comment to T343064: Expand types of edits for Wikidata revert risk model.

This was solved. More details here T341820

Apr 17 2024, 4:30 PM · Research

diego closed T343064: Expand types of edits for Wikidata revert risk model as Resolved.

Apr 17 2024, 4:30 PM · Research

diego changed Due Date from Dec 21 2023, 11:00 PM to Jun 27 2024, 10:00 PM on T341820: Evaluate and improve the Revert Risk model for Wikidata..

Apr 17 2024, 3:43 PM · Research (FY2023-24-Research-April-June)

Mar 1 2024

diego updated the task description for T341820: Evaluate and improve the Revert Risk model for Wikidata..

Mar 1 2024, 8:01 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

We have improve the model accuracy, currently I'm working on making the model faster, to be able to work in real time.

Mar 1 2024, 7:48 PM · Research (FY2023-24-Research-April-June)

diego moved T357036: References Model: Multilingual Reference Need from Backlog to FY2023-24-Research-January-March on the Research board.

Mar 1 2024, 7:47 PM · Research

Feb 29 2024

diego added a comment to T348298: Add revertrisk-language-agnostic to RecentChanges filters.

@kostajh , to the best of my knowledge @KStoller-WMF is leading this project. We had a meeting on January and I gave my input there. I think other teams that have done community testing process can talk more about this. Technically we could go for targeting certain precision, what would involve different thresholds per wiki. Using the Knowledge Observatory data it should be easy to compute these numbers, however, maintenance could be hard, so my understanding was the decision was to go for a single threshold for all wikis.

Feb 29 2024, 1:43 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), MW-1.42-notes (1.42.0-wmf.16; 2024-01-30), Wikipedia-Android-App-Backlog, Growth-Team, MediaWiki-extensions-ORES, Machine-Learning-Team

Feb 28 2024

diego renamed T357036: References Model: Multilingual Reference Need from reference model build to Rerefences Model: Multilingual Reference Need .

Feb 28 2024, 4:00 PM · Research

Feb 17 2024

diego updated subscribers of T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

my two cents:

Feb 17 2024, 4:28 PM · Temporary accounts, Machine-Learning-Team

Feb 9 2024

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

In order to improve the interaction between structured and text data , I'm experimenting with a full pytorch approach.

Feb 9 2024, 10:55 PM · Research (FY2023-24-Research-April-June)

Feb 5 2024

diego created T356708: wmfdata.__version__ doesn't exist in wmfdata-python.

Feb 5 2024, 9:27 PM · Data-Engineering, Wmfdata-Python

Jan 24 2024

diego updated subscribers of T349755: Training pipeline for Revert Risk Language Agnostic (RRLA) model.

Jan 24 2024, 4:54 PM · Knowledge-Integrity, Research

Jan 22 2024

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

We are using this tasks as umbrella for reporting improvements and our coordination with products teams regarding the Revert Risk models.
Given that the model showed to be good enough for the Automoderator project, and also would be integrated on the MediaWiki Recent Changes feed (T352217), I think we can resolve this task and report future updates related to revert risk to the EPIC task; T314384

Jan 22 2024, 4:47 PM · Research (FY2023-24-Research-October-December)

diego moved T341820: Evaluate and improve the Revert Risk model for Wikidata. from FY2023-24-Research-October-December to FY2023-24-Research-January-March on the Research board.

Jan 22 2024, 4:35 PM · Research (FY2023-24-Research-April-June)

Jan 20 2024

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

We are collecting preparing a new dataset (using diffs) to train the model.
We are experimenting with language models, such as mBert and LaBSE to evaluate structured (claims) edits.

Jan 20 2024, 1:56 AM · Research (FY2023-24-Research-April-June)

Jan 18 2024

diego added a comment to T352839: RevertRisk model readiness for temporary accounts.

@MunizaA , until we don't have enough training data we should treat temporary accounts as anonymous users. In practice this means to overwrite temporary users features.
So, basically

Jan 18 2024, 10:18 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

Jan 17 2024

diego added a comment to T352839: RevertRisk model readiness for temporary accounts.

Ideally, by the time we are deploying to pilot wikis, the model will understand that revisions made by temp accounts should be scored differently than if those revisions came from full accounts. I am not sure how much you'll be able to do, though, without a lot of real world data of temp account edits?

Jan 17 2024, 8:19 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

Dec 22 2023

diego updated subscribers of T341820: Evaluate and improve the Revert Risk model for Wikidata..

We have obtained 590 labels from 540 different revisions. Data is available here.
This is the confusion matrix:

92	28
56	364

Given the following scores:

	Revert Risk	ORES
Precision:	0.93	0.91
F1:	0.90	0.91

Dec 22 2023, 8:44 PM · Research (FY2023-24-Research-April-June)

diego updated the task description for T341820: Evaluate and improve the Revert Risk model for Wikidata..

Dec 22 2023, 8:00 PM · Research (FY2023-24-Research-April-June)

diego updated subscribers of T352839: RevertRisk model readiness for temporary accounts.

Ok! I understand.
Currently, Revert Risk uses several user's features. I think the "revision count" could be used as a replacement of the "anonymous" field. However, probably the most straight forward solution would be to replace the "anonymous" column for a "temporary" column.

Dec 22 2023, 6:54 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

Dec 18 2023

diego added a comment to T352839: RevertRisk model readiness for temporary accounts.

Hi @kostajh , I'm not sure if I'm understanding the question. Are you proposing to add the "user status" (temporary/full) as feature on Revert Risk?

Dec 18 2023, 3:00 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

diego added a comment to T353164: Citations/References ML Proposal - Research & Enterprise.

Few comments:

Dec 18 2023, 2:39 PM · Wikimedia Enterprise - Content Integrity, Research

diego updated the task description for T353164: Citations/References ML Proposal - Research & Enterprise.

Dec 18 2023, 2:31 PM · Wikimedia Enterprise - Content Integrity, Research

Dec 8 2023

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

The Privacy Engineering team has reviewed the model finding no privacy-related concerns with the model.
The patch for adding revert risk on the recent changes' feed has been merged. This enables the option of integrate Revert Risk on MediaWiki. Now, I'm working in finding the adequate thresholds for RR scores (T351897) to add the corresponding mediawiki tags.

Dec 8 2023, 3:45 PM · Research (FY2023-24-Research-October-December)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Currently we have around 200 labels. WMDE is helping to increase this number.
We are preparing a new dataset for training the de RR Wikidata model.

Dec 8 2023, 3:36 PM · Research (FY2023-24-Research-April-June)

Dec 1 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

No updates this week.

Dec 1 2023, 4:34 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

The Moderation Tools team is running tests and community discussions to implement the Automoderator project, we are in coordination with them to learn about potential areas of improvement for RR.
There has been some community initiatives to evaluate the quality of the RR models. In T336934 a group of rowiki editors had manually labeled a set of risky revisions. We have analyzed these results, showing reasonable good performance.
The ML-team is working on integrating RRLA to recent changes feed T348298. We are working on defining the best thresholds for this integration T351897 .
We have been working with Wikimedia Enterprise to clarify some doubts about the RRLA model T346095

Dec 1 2023, 4:21 PM · Research (FY2023-24-Research-October-December)

Nov 24 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

I have presented the Revert Risk model Wikidata and the Annotool at the WikiProject LD4 Wikidata gathering.
We have started collecting new annotations on the second Wikidata labeling. The campaign is available here. @Lydia_Pintscher is helping us to find more annotators (thanks!).

Nov 24 2023, 6:26 PM · Research (FY2023-24-Research-April-June)

Nov 23 2023

diego added a comment to T336934: Enable communities to configure automated reversion of bad edits.

In T336934#9355952, @Strainu wrote:

In T336934#9350484, @diego wrote:

Great. Having some manual labels is always valuable.
I have done a quick check and I've seen there are few cases were the RR scores are not higher than 0.93. For example, this one:

That is very weird. I checked the code and a few of the pages identified in a newer run and did not see any mismatch. Is it possible for the score to change? I know in ores it was possible in certain conditions.

I can't think a case where this is possible but I'll have a look.

Anyhow, I've done some cleaning, and merged the datasets, and then I've computed some scores:

These scores seem to be based on the prediction, not the score returned by the algorithm, so they seem a bit useless in the context of a reverter - the community will almost certainly not accept a 53% success rate. Can you advise on why you chose these and not the score-based results, which seem better?

I've done both, you can find them on the jupyter notebook. But in summary the precision is very similar (almost identical) to ORES rowing-damagging

Nov 23 2023, 9:06 PM · User-notice, Automoderator, Epic, Moderator-Tools-Team

diego added a subtask for T314384: Develop a ML-based service to predict reverts on Wikipedia(s): T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension).

Nov 23 2023, 3:36 PM · Machine-Learning-Team, Research, Epic

diego added a parent task for T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension): T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

Nov 23 2023, 3:36 PM · Research

diego set Due Date to Dec 13 2023, 11:00 PM on T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension).

Nov 23 2023, 3:34 PM · Research

diego created T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension).

Nov 23 2023, 3:33 PM · Research

Nov 21 2023

diego renamed T343064: Expand types of edits for Wikidata revert risk model from Expand edit types for Wikidata revert risk model to Expand types of edits for Wikidata revert risk model.

Nov 21 2023, 6:58 PM · Research

diego updated subscribers of T346095: Investigate the tangible differences between the results from goodfaith vs revertrisk.

Older revisions.

Concerns around our understanding of the limitations of the model for older edits given the training window. If there is a user that is looking at taking a full snapshot of either our current corpus of Wikipedia, or a past version, both include revisions from a broader window of time than the training window specifically and may show “latent” bad revisions that either perform differently with the LiftWing model or are uncaught.

I am curious what you may recommend to evaluate older content that could be vandalized without us knowing due to a lack of revisions/content attention by editors.

I'm not completely sure if I'm understanding your question. What I can say is that any model would have certain time drift, that includes RR and ORES. I think the model's precision would decay if we use it for very old data, but probably it tends to a certain limit (I would assume that the same is true for ORES, and that model is probably already working close to it's boundaries). The Language Agnostic model shouldn't be difficult to run on a large old dataset, I understand that @fkaelin
and @Pablo had been working on running the model on large data, so maybe if you have an specific question to be answered, the four of us could try to design an experiment to answer that question.

Performance on different types of pages. You already addressed this in part, but what I mean by different page types isn’t necessarily subject-related (though cross-language data is helpful as well) but instead based on the metadata of the page.

How does the model typically perform on revisions in pages with low/high pageviews, low/high amounts of content, more/less edits, etc. This is less critical for our use-case, but we are imagining cases where a user may want to create their own filtering system based on their tolerance for risk and may want an approach that divides article approaches based on metadata.

Let us know if there are potential low-risk exercises we can collaborate on to subsect the data.

I don't have such statistics, maybe the Knowledge Integrity Observatory have some data to answer this (@Pablo ?)

What to know that we do not know.

This is what I was trying to pull on with the question on use. If ORES had fallen out of style among some users and/or grown in use with others- why? If we can understand points of friction with use (Usability? Performance? Different approach needed?) it will help us integrate learnings as we design similar features (credibility signals,
etc.).

I don't think there is clear pattern here. I think the adoption/attrition of these tools is opportunistic, in the sense that ppl used them according to their needs. With no other options, ppl would use what there is available. And even with more tools available, developers would use what fits better on their workflows, or even what the have seen working in the past. Unless we have dramatic difference between models' accuracy, I don't think that differences in models' quality is something that is easy to asses for developers.
Probably the attrition is related to the (lack of) success of the tools created using ML models, and not directly to the model itself (although that model quality probably has an impact on the tool success).

Nov 21 2023, 6:30 PM · Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise

diego added a comment to T336934: Enable communities to configure automated reversion of bad edits.

Great. Having some manual labels is always valuable.
I have done a quick check and I've seen there are few cases were the RR scores are not higher than 0.93. For example, this one:

Nov 21 2023, 6:27 PM · User-notice, Automoderator, Epic, Moderator-Tools-Team

Nov 14 2023

diego added a comment to T336934: Enable communities to configure automated reversion of bad edits.

Hi @Strainu , here Diego from the WMF Research team.

Nov 14 2023, 12:05 AM · User-notice, Automoderator, Epic, Moderator-Tools-Team

Nov 13 2023

diego added a comment to T349410: Document and curate the learnings from executing the existing roadmap.

We would also include @cwylo !

Nov 13 2023, 11:40 PM · Research

Nov 6 2023

diego updated subscribers of T346095: Investigate the tangible differences between the results from goodfaith vs revertrisk.

Hi, here Diego from Research. I'll take some of the questions you raised:

Nov 6 2023, 11:15 PM · Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise

diego added a comment to T348298: Add revertrisk-language-agnostic to RecentChanges filters.

Nov 6 2023, 2:20 PM · MW-1.43-notes (1.43.0-wmf.2; 2024-04-23), MW-1.42-notes (1.42.0-wmf.16; 2024-01-30), Wikipedia-Android-App-Backlog, Growth-Team, MediaWiki-extensions-ORES, Machine-Learning-Team

Oct 31 2023

diego added a comment to T350061: [Annotool] Errors loading edits.

@MunizaA got it. Would this mean to create a another project?

Oct 31 2023, 2:46 PM · Research

diego (Diego S-T)Senior Research Scientist

Projects

Calendar

Today

Tomorrow

Friday

User Details

Recent ActivityView All

Mon, Oct 7

Fri, Oct 4

Wed, Oct 2

Fri, Sep 13

Sep 7 2024

Sep 4 2024

Aug 30 2024

Aug 23 2024

Aug 19 2024

Aug 14 2024

Aug 8 2024

Aug 2 2024

Jul 26 2024

Jul 23 2024

Jul 22 2024

Jul 19 2024

Jul 18 2024

Jul 12 2024

Jul 5 2024

Jul 4 2024

Jun 28 2024

Jun 25 2024

Jun 24 2024

Jun 18 2024

May 20 2024

May 6 2024

May 3 2024

May 2 2024

Apr 29 2024

Apr 17 2024

Mar 1 2024

Feb 29 2024

Feb 28 2024

Feb 17 2024

Feb 9 2024

Feb 5 2024

Jan 24 2024

Jan 22 2024

Jan 20 2024

Jan 18 2024

Jan 17 2024

Dec 22 2023

Dec 18 2023

Dec 8 2023

Dec 1 2023

Nov 24 2023

Nov 23 2023

Nov 21 2023

Nov 14 2023

Nov 13 2023

Nov 6 2023

Oct 31 2023

diego (Diego S-T)
Senior Research Scientist

Recent Activity
View All