Commons:Dumps and backups

This page is intended to be the central place for public information about Wikimedia Commons data dumps and backups.

Dumps

Media files

As of 2024, there are no publicly available dumps of media files for download since about 2013. There is a request and an open Phabricator ticket to resume them again.

Partial scraping

Wikimedia Commons wiki content

All Wikimedia Commons wiki pages, including all of their past revisions in their History (excluding deleted ones), are included in XML dumps, which are generated on a regular basis, and publicly available for download at https://dumps.wikimedia.org/commonswiki/ (info).

Backups

Media files

All media files (including their past versions) in Wikimedia Commons are backed up in dedicated servers in both Wikimedia Foundation application data centers: Eqiad (Ashburn, Virginia, USA) and Codfw (Carrollton, Texas, USA). These backups are not accessible for the public (including registered Wikimedia users), only to Wikimedia Foundation staff, since they include deleted files and other data that can't be publicly accessible. Backups at each data center are fully independent from each other for redundancy reasons.

Wikimedia Commons wiki content

Wiki content of all Wikimedia wikis (including Commons), which is stored in MariaDB databases, is also backed up in both Wikimedia Foundation application data centers. Those backups also include full version history for all pages.

History

Before 2014, when a second facility for redundancy came online, all Wikimedia sites operated from a single application data center (there were, as there are now, more data centers for caching and optimal content distribution, but without any permanent data storage). While having geographical redundancy and XML text dumps in place, no true text-only database backups were implemented until 2020-2021, after several years of work. Backups for media files weren't in place until 2021-2022. Offline backups (for example, on tape) were featured as "coming next" in a Wikimedia Foundation presentation (slide 48), but there is no explicit mention of them in Wikitech nor Phabricator.

Commons:Dumps and backups

Contents

Dumps

Media files

Partial scraping

Wikimedia Commons wiki content

Backups

Media files

Wikimedia Commons wiki content

History

Sources

See also