Page MenuHomePhabricator

Enable MinT, Content and Section Translation for a 2nd group of 10 languages previously lacking machine translation
Closed, ResolvedPublic

Description

As machine translation becomes available in more languages with MinT and NLLB-200 models, we want to expose the improved translation tools more broadly to facilitate the creation of content for those communities.

This ticket selects a second set of languages, after T337290, that had no machine translation services available before. This ticket proposes to enable machine transition using MinT, set Content Translation as a default tool, and enable the mobile experience with Section Translation.

As part of this process, we'll generate the template parameter alignments (T221211) for these languages, enable access to the tool in test wiki, and communicate with the different communities to inform them about the changes.

These are the languages selected for this task:

  1. Crimean Tatar (crh/crh_Latn)
  2. Faroese (fo/fao_Latn)
  3. Friulian (fur/fur_Latn)
  4. Ligurian (lij/lij_Latn)
  5. Limburgish (li/lim_Latn)
  6. Lombard (lmo/lmo_Latn)
  7. Silesian (szl/szl_Latn)
  8. Shan (shn/shn_Mymr)
  9. Venetian (vec/vec_Latn)
  10. Waray (war/war_Latn)

Steps:

  • Enable MinT for the selected languages.
  • Enable selected wikis on Test Wikipedia.
  • Enable Section Translation in the selected Wikipedias.
    • Crimean Tatar (crh/crh_Latn)
    • Faroese (fo/fao_Latn)
    • Friulian (fur/fur_Latn)
    • Ligurian (lij/lij_Latn)
    • Limburgish (li/lim_Latn)
    • Lombard (lmo/lmo_Latn)
    • Silesian (szl/szl_Latn)
    • Shan (shn/shn_Mymr)
    • Venetian (vec/vec_Latn)
    • Waray (war/war_Latn)

Next group of languages to enable:
T337834: Enable MinT, Content and Section Translation for a 3rd group of 10 languages previously lacking machine translation

Event Timeline

Change 924913 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/services/cxserver@master] Enable MinT for a 2nd group of 10 languages previously lacking machine translation

https://gerrit.wikimedia.org/r/924913

Change 924913 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Enable MinT for a 2nd group of 10 languages previously lacking machine translation

https://gerrit.wikimedia.org/r/924913

Change 925298 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update cxserver to 2023-06-01-041016-production

https://gerrit.wikimedia.org/r/925298

Change 925298 merged by jenkins-bot:

[operations/deployment-charts@master] Update cxserver to 2023-06-01-041016-production

https://gerrit.wikimedia.org/r/925298

Mentioned in SAL (#wikimedia-operations) [2023-06-01T05:39:50Z] <kart_> Updated cxserver to 2023-06-01-041016-production (T337669)

Change 925390 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/services/machinetranslation@master] Add mapping for crh (Crimean Tatar)

https://gerrit.wikimedia.org/r/925390

Change 926833 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/mediawiki-config@master] testwiki: Enable Section Translation for 10 Wikipedias

https://gerrit.wikimedia.org/r/926833

Change 925390 abandoned by KartikMistry:

[mediawiki/services/machinetranslation@master] Add mapping for crh (Crimean Tatar)

Reason:

This should be fix in cxserver, not in MinT.

https://gerrit.wikimedia.org/r/925390

Change 927024 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/services/cxserver@master] Update language code for Crimean Tatar

https://gerrit.wikimedia.org/r/927024

Change 926833 merged by jenkins-bot:

[operations/mediawiki-config@master] testwiki: Enable Section Translation for 10 Wikipedias

https://gerrit.wikimedia.org/r/926833

Mentioned in SAL (#wikimedia-operations) [2023-06-05T07:28:23Z] <kartik@deploy1002> Started scap: Backport for [[gerrit:926833|testwiki: Enable Section Translation for 10 Wikipedias (T337669)]]

Mentioned in SAL (#wikimedia-operations) [2023-06-05T07:30:02Z] <kartik@deploy1002> kartik: Backport for [[gerrit:926833|testwiki: Enable Section Translation for 10 Wikipedias (T337669)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2023-06-05T07:38:22Z] <kartik@deploy1002> Finished scap: Backport for [[gerrit:926833|testwiki: Enable Section Translation for 10 Wikipedias (T337669)]] (duration: 09m 58s)

Change 927024 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Update language code for Crimean Tatar

https://gerrit.wikimedia.org/r/927024

Change 927144 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/extensions/ContentTranslation@master] Remove unused domain mapping for crh

https://gerrit.wikimedia.org/r/927144

Result of template parameter alignment script run:

sqlite> select count(*) from templates where source_lang='en' and target_lang='crh';
5
sqlite> select count(*) from templates where source_lang='en' and target_lang='fo';
76
sqlite> select count(*) from templates where source_lang='en' and target_lang='fur';
3
sqlite> select count(*) from templates where source_lang='en' and target_lang='li';
3
sqlite> select count(*) from templates where source_lang='en' and target_lang='lij';
8
sqlite> select count(*) from templates where source_lang='en' and target_lang='lmo';
8
sqlite> select count(*) from templates where source_lang='en' and target_lang='szl';
11
sqlite> select count(*) from templates where source_lang='en' and target_lang='vec';
6
sqlite> select count(*) from templates where source_lang='en' and target_lang='war';
9
sqlite> select count(*) from templates where source_lang='crh' and target_lang='en';
4
sqlite> select count(*) from templates where source_lang='fo' and target_lang='en';
7
sqlite> select count(*) from templates where source_lang='fur' and target_lang='en';
0
sqlite> select count(*) from templates where source_lang='li' and target_lang='en';
1
sqlite> select count(*) from templates where source_lang='lij' and target_lang='en';
6
sqlite> select count(*) from templates where source_lang='lmo' and target_lang='en';
0
sqlite> select count(*) from templates where source_lang='szl' and target_lang='en';
4
sqlite> select count(*) from templates where source_lang='vec' and target_lang='en';
4
sqlite> select count(*) from templates where source_lang='war' and target_lang='en';
0

Change 927325 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/services/cxserver@master] Add template parameter alignments for 17 language pairs

https://gerrit.wikimedia.org/r/927325

Change 927144 merged by KartikMistry:

[mediawiki/extensions/ContentTranslation@master] Remove unused domain mapping for crh

https://gerrit.wikimedia.org/r/927144

Change 927325 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Add template parameter alignments for 17 language pairs

https://gerrit.wikimedia.org/r/927325

Change 927812 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update cxserver to 2023-06-07-044025-production

https://gerrit.wikimedia.org/r/927812

Change 927812 merged by jenkins-bot:

[operations/deployment-charts@master] Update cxserver to 2023-06-07-044025-production

https://gerrit.wikimedia.org/r/927812

Mentioned in SAL (#wikimedia-operations) [2023-06-07T05:28:33Z] <kart_> Updated cxserver to 2023-06-07-044025-production (T337290, T337669, T337834)

Hello @KartikMistry, the communication for the marked Wikis are completed except for Ligurian, I am still communicating with them. Thank you!

Change 929741 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/mediawiki-config@master] Enable Content and Section Translation for a 2nd group of 9 languages previously lacking machine translation

https://gerrit.wikimedia.org/r/929741

Change 929741 merged by jenkins-bot:

[operations/mediawiki-config@master] Enable Content and Section Translation for a 2nd group of 9 languages previously lacking machine translation

https://gerrit.wikimedia.org/r/929741

Mentioned in SAL (#wikimedia-operations) [2023-06-14T07:04:36Z] <kartik@deploy1002> Started scap: Backport for [[gerrit:929741|Enable Content and Section Translation for a 2nd group of 9 languages previously lacking machine translation (T337669)]]

Mentioned in SAL (#wikimedia-operations) [2023-06-14T07:06:13Z] <kartik@deploy1002> kartik: Backport for [[gerrit:929741|Enable Content and Section Translation for a 2nd group of 9 languages previously lacking machine translation (T337669)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2023-06-14T07:18:12Z] <kartik@deploy1002> Finished scap: Backport for [[gerrit:929741|Enable Content and Section Translation for a 2nd group of 9 languages previously lacking machine translation (T337669)]] (duration: 13m 35s)

When trying to access Section translation on mobile for Crimean Tatar (https://crh.m.wikipedia.org/wiki/Special:ContentTranslation) the browser redirects me to Amharic Wikipedia instead.

When trying to access Section translation on mobile for Crimean Tatar (https://crh.m.wikipedia.org/wiki/Special:ContentTranslation) the browser redirects me to Amharic Wikipedia instead.

Can you try again if this is still happening? Working fine for me.

When trying to access Section translation on mobile for Crimean Tatar (https://crh.m.wikipedia.org/wiki/Special:ContentTranslation) the browser redirects me to Amharic Wikipedia instead.

Can you try again if this is still happening? Working fine for me.

This is no longer happening. Accessing the link now leads to Crimean Tatar without further redirects, as expected.
Thanks!

KartikMistry changed the task status from Open to In Progress.Aug 28 2023, 12:04 PM

Change 952846 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/mediawiki-config@master] Enable Content and Section translation in Ligurian Wikipedia

https://gerrit.wikimedia.org/r/952846

Change 952846 merged by jenkins-bot:

[operations/mediawiki-config@master] Enable Content and Section translation in Ligurian Wikipedia

https://gerrit.wikimedia.org/r/952846

Mentioned in SAL (#wikimedia-operations) [2023-08-29T07:03:43Z] <kartik@deploy1002> Started scap: Backport for [[gerrit:952846|Enable Content and Section translation in Ligurian Wikipedia (T337669)]]

Mentioned in SAL (#wikimedia-operations) [2023-08-29T07:07:38Z] <kartik@deploy1002> kartik: Backport for [[gerrit:952846|Enable Content and Section translation in Ligurian Wikipedia (T337669)]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option)

Mentioned in SAL (#wikimedia-operations) [2023-08-29T07:24:46Z] <kartik@deploy1002> Finished scap: Backport for [[gerrit:952846|Enable Content and Section translation in Ligurian Wikipedia (T337669)]] (duration: 21m 02s)