Page MenuHomePhabricator

Springle (Sean Pringle)
Disabled

Projects

User Details

User Since
Oct 7 2014, 2:34 AM (529 w, 3 d)
Roles
Disabled
LDAP User
Springle
MediaWiki User
Unknown

Recent Activity

Aug 8 2018

jcrespo awarded T59186: Drop blob_tracking and blob_orphans everywhere a Love token.
Aug 8 2018, 6:58 AM · Patch-For-Review, DBA, MediaWiki-libs-Rdbms

May 21 2018

Springle added a comment to T179059: Consider skipping or modifying recombine step for page content dumps for wikidata.

Messing with bzip2, pbzip2, lbzip2 on snapshot001, looking at timing only (not archive size or non-default config).

May 21 2018, 3:00 AM · Patch-For-Review, Dumps-Generation

Apr 13 2018

Dzahn awarded T191478: Requesting access to shell (snapshot, dumpsdata) for springle a Like token.
Apr 13 2018, 11:57 PM · Patch-For-Review, SRE, SRE-Access-Requests

Apr 10 2018

Springle added a comment to T191478: Requesting access to shell (snapshot, dumpsdata) for springle.
ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAACAQCZwhGWhhv 9QdjhhShbLdSZSV349oFxPH73CfvI0jRsQFXsQIlPQaSeKcFqw kjhUoxvfgCw3YWoExHTT6jxHUxrOswI6ZVPeicHNBQ4kiRRY4uKE0xpqbdnkbLRSNWyru8zG1aB/uxpkhsQhwnUZ9fpGtDkXzX1In8NZ7X9jMQB6yrHFxqK/549WELGnpscL79lX7uKM2Ri/ v61th7kuDyn6VjsIMSLdt46dKoW9WgQ2UgkjEh67HOZd1FYt4V OaQcNr2JtHj7nSI6YsXx9TQnBrQVqWQXk63AFNxw4uD7xFVByc4FIqefIYjHqHANRWpRmaNOcj6LaBTqXZUBSmtYRiLkXUhqhr1Tf1NiE75UjGKhknucpywXTYI02HaTdEcdxfN4C9guI ojxwUKrIMEk9Wz3qcYzyN0QZmCL/6EcRxjEUzYDpEt0tMBRsRqE5Qp0TLPuDsK5trY1rtdzy/HckqmSik9N1p2WQ941SWs2EEiFji1jiCM4N8gwy1r6mf9xo5LWRVY/LtNYbCf/2EfW3mjreP9MaOGI vedcS8I4sd6O3VP8WPpXZtoBU1 EKLhEHvfp/E9qYYr6iWIltCFySi67fWlv83cUNezJ6uMrDR  g8ANkFJKEWSHJzdVyrtf2fiwNNyIPrkEawHAcKHsZsVGdzkP9Xr8eBb7Q== sean@laptop
Apr 10 2018, 12:50 PM · Patch-For-Review, SRE, SRE-Access-Requests

Apr 5 2018

Springle added a comment to T191478: Requesting access to shell (snapshot, dumpsdata) for springle.

Updated to a non-wmf email in phabricator profile settings.

Apr 5 2018, 2:16 AM · Patch-For-Review, SRE, SRE-Access-Requests

Apr 4 2018

Springle created T191478: Requesting access to shell (snapshot, dumpsdata) for springle.
Apr 4 2018, 11:21 PM · Patch-For-Review, SRE, SRE-Access-Requests

Nov 2 2016

jcrespo awarded T97641: WebVideoTranscodeJob is keeping database connections open for several minutes on s4 master a Cookie token.
Nov 2 2016, 5:23 PM · TimedMediaHandler-Transcode

Aug 28 2015

Springle updated subscribers of T62539: [Task] Convert wb_terms term_row_id from INT to BIGINT on wikidatawiki.
Aug 28 2015, 4:32 AM · Schema-change-in-production, DBA, Wikidata, Schema-change, Wikidata.org
Springle updated subscribers of T79922: Set up backup strategy for es clusters.
Aug 28 2015, 4:30 AM · Data-Persistence-Backup, Patch-For-Review, Goal, SRE
Springle updated subscribers of T86482: puppet stopped mysqld using orphan pid file from puppet agent.
Aug 28 2015, 4:28 AM · SRE, DBA
Springle updated subscribers of T88770: sleeper database connection surges during outage.
Aug 28 2015, 4:28 AM · Sustainability (Incident Followup), SRE, DBA, Incident-20150205-SiteOutage
Springle updated subscribers of T89986: Update GeoData schema.
Aug 28 2015, 4:27 AM · DBA, GeoData, Schema-change
Springle closed T96383: install/setup/deploy db2043-db2070 as Resolved.
Aug 28 2015, 4:27 AM · SRE, DBA
Springle updated subscribers of T96499: dbtree loads third party resources (from google.com/jsapi).
Aug 28 2015, 4:26 AM · Privacy Engineering, Privacy, HTTPS, SRE, Patch-For-Review, DBA, WMF-Legal
Springle updated subscribers of T92841: Database replicas: replicate user.user_touched.
Aug 28 2015, 4:25 AM · DBA, Cloud-Services
Springle closed T105879: db1022 duplicate key errors as Resolved.
Aug 28 2015, 4:25 AM · SRE, Patch-For-Review, DBA
Springle updated subscribers of T70942: Database upgrade MariaDB 10: Engine / Option mismatch on table `user_properties`.
Aug 28 2015, 4:23 AM · DBA, Cloud-Services, Cloud-VPS
Springle closed T71110: Database upgrade MariaDB 10: 600 seconds timeout as Resolved.
Aug 28 2015, 4:23 AM · Cloud-Services, Cloud-VPS
Springle updated subscribers of T71127: Discrepancies with logging table on different wikis.
Aug 28 2015, 4:22 AM · Data-Services, DBA
Springle updated subscribers of T71182: Database upgrade MariaDB 10: Metadata access in INFORMATION_SCHEMA causes complete blocks.
Aug 28 2015, 4:21 AM · DBA, Cloud-Services, Cloud-VPS
Springle updated subscribers of T85266: Look into Maria 10 parallel-replication.
Aug 28 2015, 4:21 AM · DBA, Sustainability
Springle updated subscribers of T71463: Create a table in labs with replication lag data.
Aug 28 2015, 4:20 AM · Patch-For-Review, DBA, Analytics-General-or-Unknown

Aug 24 2015

Springle added a comment to T108850: Set up auto-purging after 90 days {tick}.

Regarding implementing this on replicas: nothing special is needed. Change on the master will propagate (but rate-limit everything, to avoid replag).

Aug 24 2015, 10:01 AM · Analytics-Radar, User-Elukey, Patch-For-Review, DBA
Springle added a comment to T108850: Set up auto-purging after 90 days {tick}.

A white list is fine. No real difference from DBA perspective, so 1.

Aug 24 2015, 9:59 AM · Analytics-Radar, User-Elukey, Patch-For-Review, DBA
Springle added a comment to T108856: Set up bucketization of editCount fields {tick}.

@mforns, yes, feel free.

Aug 24 2015, 9:57 AM · Analytics-Radar, DBA

Aug 13 2015

Jcornelius awarded T69223: Schema change for page content language a Like token.
Aug 13 2015, 3:33 PM · Community-Wishlist-Survey-2016, TechCom-RFC (TechCom-RFC-Closed), Schema-change-in-production, Wikimedia-Site-requests, DBA, All-and-every-Wikisource

Aug 7 2015

Springle added a comment to T108255: Enable MariaDB/MySQL's Strict Mode.

It's correct in that Mediawiki still supports MySQL 5.0.2 [1] with a default SQL_MODE setting [2]. Previous discussions about enforcing strict mode havn't gone far, mainly due to questions about MW extensions.

Aug 7 2015, 2:36 AM · SRE-Sprint-Week-Sustainability-March2023, Epic, Beta-Cluster-Infrastructure, DBA, MediaWiki-libs-Rdbms

Jul 30 2015

Springle added a comment to T107282: Reduce memory commitment on database hosts with many objects, specially s3, dbstore/research and labs.

Just to clarify, I don't think we've seen actual OOM killer on s[1-7], right? The only front-line production concern is swapping on s3 db1035?

Jul 30 2015, 2:15 AM · Patch-For-Review, SRE, DBA
Springle added a comment to T107265: MIMEsearchPage::reallyDoQuery failing on the logs due to taking too long to query.

Also, all bot driven, eg: tide543.microsoft.com

Jul 30 2015, 1:34 AM · User-notice-archive, MW-1.28-release (WMF-deploy-2016-10-11_(1.28.0-wmf.22)), MW-1.28-release-notes, MW-1.27-release (WMF-deploy-2015-12-15_(1.27.0-wmf.9)), MW-1.27-release-notes, SRE, WMF-deploy-2015-08-04_(1.26wmf17), WMF-deploy-2015-07-28_(1.26wmf16), MW-1.26-release, Developer-notice, Patch-For-Review, Commons, MediaWiki-Special-pages
Springle added a comment to T107265: MIMEsearchPage::reallyDoQuery failing on the logs due to taking too long to query.

Continuing on db1042 right now and queries regularly hitting 5min limit. The LIMIT 10405000,501 is most of the problem here; shouldn't be possible to generate such a search.

Jul 30 2015, 1:33 AM · User-notice-archive, MW-1.28-release (WMF-deploy-2016-10-11_(1.28.0-wmf.22)), MW-1.28-release-notes, MW-1.27-release (WMF-deploy-2015-12-15_(1.27.0-wmf.9)), MW-1.27-release-notes, SRE, WMF-deploy-2015-08-04_(1.26wmf17), WMF-deploy-2015-07-28_(1.26wmf16), MW-1.26-release, Developer-notice, Patch-For-Review, Commons, MediaWiki-Special-pages

Jul 28 2015

Springle added a comment to T104476: Provision a labsdb useraccount that can be used to run replica-addusers.pl.

At Yuvi's request on IRC I added 'labsdbadmin'@'10.64.37.7' with the same permissions/password as @jcrespo added for 'labsdbadmin'@'10.64.37.6', since the catastrophic failure of the latter.

Jul 28 2015, 2:21 AM · DBA, Cloud-Services

Jul 27 2015

Springle added a comment to T105843: new external storage cluster(s).

1 to the provisioning.

Jul 27 2015, 1:14 AM · SRE, hardware-requests, DBA

Jul 24 2015

Springle added a comment to T106647: mariadb multi-source replication glitch with site_identifiers.

@jcrespo, no, I did not out-of-band change or use skip counter. I found the machine exactly as you described on IRC, and only did research by dumping logs to get the query examples shown in ticket description.

Jul 24 2015, 2:04 AM · SRE, DBA

Jul 23 2015

Springle created T106647: mariadb multi-source replication glitch with site_identifiers.
Jul 23 2015, 6:06 AM · SRE, DBA

Jul 20 2015

Springle updated the task description for T106312: m1-master switch from db1001 to db1016.
Jul 20 2015, 2:49 AM · Patch-For-Review, SRE, DBA
Springle created T106312: m1-master switch from db1001 to db1016.
Jul 20 2015, 2:49 AM · Patch-For-Review, SRE, DBA
Springle added a comment to T105135: Implement mariadb 10.0 masters.

All s1-7 slaves are now 10.0.

Jul 20 2015, 2:38 AM · Patch-For-Review, SRE, DBA
Springle added a project to T105135: Implement mariadb 10.0 masters: acl*sre-team.
Jul 20 2015, 2:30 AM · Patch-For-Review, SRE, DBA
Springle added a comment to T105879: db1022 duplicate key errors.

@jcrespo, ouch, so maybe something is broken in our physical backup process (was it xtrabackup?) that required the SQL_SLAVE_SKIP_COUNTER after reinstall on 2015-06-29? Scary...

Jul 20 2015, 2:23 AM · SRE, Patch-For-Review, DBA

Jul 15 2015

Springle added a comment to T105879: db1022 duplicate key errors.

For now, I will depool db1022.

Jul 15 2015, 1:06 PM · SRE, Patch-For-Review, DBA
Springle created T105879: db1022 duplicate key errors.
Jul 15 2015, 1:06 PM · SRE, Patch-For-Review, DBA
Springle added a comment to T105713: missing database on replica server.

So, I need to get some more information from Jaime about what occurred on the weekend (he is unwell and on leave since then), but looking over the logs I see:

Jul 15 2015, 12:33 AM · Cloud-Services, Toolforge
Springle created T105843: new external storage cluster(s).
Jul 15 2015, 12:19 AM · SRE, hardware-requests, DBA

Jul 13 2015

Springle added a comment to T103110: db1050 raid degraded.

Also, yes, time to plan another batch.

Jul 13 2015, 12:43 AM · SRE, ops-eqiad
Springle added a comment to T103110: db1050 raid degraded.

Sounds good.

Jul 13 2015, 12:42 AM · SRE, ops-eqiad

Jul 10 2015

Springle created T105440: include primary key field as secondary index suffix.
Jul 10 2015, 5:01 AM · DBA

Jul 8 2015

Springle created T105135: Implement mariadb 10.0 masters.
Jul 8 2015, 10:49 AM · Patch-For-Review, SRE, DBA

Jul 6 2015

Springle lowered the priority of T59176: ApiQueryExtLinksUsage::run query has crazy limit from High to Medium.
Jul 6 2015, 4:51 AM · MW-1.32-notes (WMF-deploy-2018-05-22 (1.32.0-wmf.5)), MW-1.29-release-notes, Schema-change, DBA, MediaWiki-Action-API, Performance Issue, MediaWiki-libs-Rdbms
Springle added a comment to T104573: codfw frontends cannot connect to mysql at db2029.

A bunch of "unauthenticated user" in processlist still makes me suspect the thread pool, since that symptom has been seen on prod slaves with thread_pool_size=16 (but not the immediate all-connections-fail, which is indeed odd).

Jul 6 2015, 4:50 AM · SRE, DBA
Springle added a comment to T104699: Firewall configurations for database hosts.

Wasn't sure if you wanted to change that process :)

Jul 6 2015, 4:45 AM · DBA, SRE, Patch-For-Review

Jul 4 2015

Springle added a comment to T104699: Firewall configurations for database hosts.

Doing this to most production DBs seems straight forward. Pain points, due just to complexity, will be on M[1-4].

Jul 4 2015, 1:25 AM · DBA, SRE, Patch-For-Review
Springle created T104748: db1046 innodb signal 6 abort and restart.
Jul 4 2015, 1:11 AM · MediaWiki-extensions-EventLogging, DBA

Jul 3 2015

Springle updated subscribers of T104670: recentchanges deadlocks for dewiki (db1058).
Jul 3 2015, 1:17 AM · WMF-deploy-2015-07-14_(1.26wmf14), MW-1.26-release, Patch-For-Review, WMF-JobQueue, MediaWiki-Core-JobQueue, DBA
Springle created T104670: recentchanges deadlocks for dewiki (db1058).
Jul 3 2015, 1:15 AM · WMF-deploy-2015-07-14_(1.26wmf14), MW-1.26-release, Patch-For-Review, WMF-JobQueue, MediaWiki-Core-JobQueue, DBA
Springle added a comment to T104573: codfw frontends cannot connect to mysql at db2029.

(4) == EINTR on connect. Presumably the max_connections you observed, which in turn possibly something to do with:

Jul 3 2015, 12:32 AM · SRE, DBA

Jul 2 2015

Springle added a comment to T104471: Tables created on the s7 master have not been replicated to dbstore2001 and dbstore2002. Replication issue?.

Tried a restart of dbstore2002, but s7 replication behavior was unchanged: Yes/Yes for replication threads, master exec position advancing, yet no changes appearing.

Jul 2 2015, 12:13 AM · Patch-For-Review, DBA

Jun 19 2015

Springle created T103110: db1050 raid degraded.
Jun 19 2015, 3:35 PM · SRE, ops-eqiad

Jun 18 2015

Springle closed T95927: centralauth database on dbstore1002 is out of date, replication stuck? as Resolved.

Oops, sorry, this was fixed a while back.

Jun 18 2015, 5:30 AM · SRE, SUL-Finalization
Springle closed T102871: Create flow_workflow_update_timestamp index as Resolved.

Schema change done on x1-master flowdb.

Jun 18 2015, 5:26 AM · WMF-deploy-2015-06-23_(1.26wmf11), Patch-For-Review, Collaboration-Team-Triage, StructuredDiscussions, Schema-change
Springle closed T102871: Create flow_workflow_update_timestamp index, a subtask of T51188: [DO NOT USE] Schema changes for Wikimedia wikis (tracking) [superseded by #Blocked-on-schema-change], as Resolved.
Jun 18 2015, 5:26 AM · DBA, Tracking-Neverending, Schema-change
Springle claimed T102871: Create flow_workflow_update_timestamp index.
Jun 18 2015, 5:25 AM · WMF-deploy-2015-06-23_(1.26wmf11), Patch-For-Review, Collaboration-Team-Triage, StructuredDiscussions, Schema-change

Jun 17 2015

Springle triaged T102740: db1023 raid degraded as High priority.
Jun 17 2015, 5:43 AM · SRE, ops-eqiad
Springle created T102740: db1023 raid degraded.
Jun 17 2015, 5:36 AM · SRE, ops-eqiad

Jun 15 2015

Springle created T102532: revision page_user_timestamp index problematic on large wikis.
Jun 15 2015, 6:26 PM · DBA, Performance Issue, MediaWiki-libs-Rdbms

Jun 11 2015

Springle added a comment to T101937: setup logging for #wikimedia-databases.

We'll use something other than !log, to maintain -operations supremacy. This is more about quickly posting notes on database hosts -- and tendril is just an obvious place, for us -- since db maintenance tasks tend to be slow, have multiple stages, and likely to be handed over to the next shift.

Jun 11 2015, 3:05 AM · Patch-For-Review, DBA

Jun 10 2015

Springle added a comment to T101937: setup logging for #wikimedia-databases.

So, to be clear, this isn't intended to:

Jun 10 2015, 8:03 AM · Patch-For-Review, DBA
Springle created T101937: setup logging for #wikimedia-databases.
Jun 10 2015, 7:05 AM · Patch-For-Review, DBA
Springle added a comment to T101915: Move private wikis to a dedicated cluster.

This would need a bit of downtime to migrate data, but in the order of hours, not days. Not especially difficult, and would slightly simplify the sanitarium/labs setup (though most of the complexity there is not the private wikis, which are easy to blanket-filter). 1

Jun 10 2015, 12:22 AM · DBA, Security, WMF-General-or-Unknown

Jun 8 2015

Springle added a comment to T101567: Rebuild s6 and s7 on labsdb1002.

If the box is taken down, keep the list in the loop.

Jun 8 2015, 6:30 AM · Patch-For-Review, DBA, Cloud-Services

Jun 7 2015

Springle added a comment to T101567: Rebuild s6 and s7 on labsdb1002.

I did indeed do a similar fix to get s6 going, and then sinned slightly by setting slave_exec_mode=idempotent to keep it alive for the weekend.

Jun 7 2015, 7:42 AM · Patch-For-Review, DBA, Cloud-Services

Jun 6 2015

Springle added a comment to T101516: Upgrade db1022, which has an older kernel.
  • Move vslow/dump to db1037 with load 0
  • Reinstall db1022 as trusty (or try out Jessie? we should do this eventually, and wmf-mariadb10 package is fine on Jessie)
  • Check which s6 slaves have small ibdata global tablespaces. If they're all still large (from pre file-per-table days), also consider a fresh s6 dump/reload on db1022 from which we can eventually reclone the others.
Jun 6 2015, 3:47 AM · SRE, DBA

Jun 5 2015

hashar awarded T98489: investigate HHVM mysqlExtension::ConnectTimeout a Yellow Medal token.
Jun 5 2015, 9:08 AM · SRE, Wikimedia-production-error, HHVM
jcrespo awarded T98489: investigate HHVM mysqlExtension::ConnectTimeout a Yellow Medal token.
Jun 5 2015, 7:43 AM · SRE, Wikimedia-production-error, HHVM
Springle added a comment to T98490: Add (list, title) index for Gather lists.

This can go out with T101460 at any time.

Jun 5 2015, 1:24 AM · Schema-change, Patch-For-Review, Gather
Springle added a comment to T101460: Add gather_list.gl_item_count.

The schema change only affects a few wikis[1] and small tables, so imo it can go out at any time.

Jun 5 2015, 1:17 AM · Mobile-Web-Sprint-49-Wayne's-World, Gather, Schema-change

Jun 4 2015

Springle added a comment to T98489: investigate HHVM mysqlExtension::ConnectTimeout.

While a proper config ini fix would be both welcome and necessary... have discussed with @Joe the possibility of just patching hhvm's hardcoded mysqlExtension::ConnectTimeout to 3000ms in our next build.

Jun 4 2015, 1:18 PM · SRE, Wikimedia-production-error, HHVM
Springle closed T101347: need 'spaces' database for phabricator as Resolved.

Upgrades should use the phadmin user. Pointed @mmodell in this direction via IRC.

Jun 4 2015, 1:25 AM · SRE, DBA, Phabricator

Jun 3 2015

Springle added a comment to T101009: Enabling automatic buffer pool dumping on start/stop (puppet) for all servers.

Nice :-) I see it can be easily aborted too, which was my only (vague) concern.

Jun 3 2015, 12:10 AM · SRE, Patch-For-Review, DBA

Jun 1 2015

Springle added a comment to T101009: Enabling automatic buffer pool dumping on start/stop (puppet) for all servers.

Sounds good? What are the cons?

Jun 1 2015, 6:17 PM · SRE, Patch-For-Review, DBA

May 28 2015

Springle added a comment to T99074: Limn language dashboard: eswiki graph is wrong/stuck.

Indeed, I reported S7 issues to analytics@ a couple weeks ago (they started after the full /tmp problems in late April), and have been jumping over some hurdles to rebuild S7 on analytics-store while the box is under load. Context:

May 28 2015, 4:21 AM · Analytics, ContentTranslation

May 27 2015

Springle added a comment to T99913: Fix unindexed queries doing full table scans during the reindexing maintenance script.

There isn't any 60s cut-off being applied server-side. There is only a 300s limit for wikiuser (but not wikiadmin).

May 27 2015, 7:44 AM · Discovery-ARCHIVED, CirrusSearch

May 25 2015

Springle triaged T100301: pc100[123] maintenance and upgrade as Medium priority.
May 25 2015, 1:25 PM · SRE
Springle created T100301: pc100[123] maintenance and upgrade.
May 25 2015, 1:25 PM · SRE

May 22 2015

Springle reassigned T94427: Perform schema change to echo_target_page changing from a 1 to 1 mapping between pages and user/notification to a 1 to many. from Springle to jcrespo.
May 22 2015, 10:34 AM · OKR-Work, Collaboration-Team-Archive-2015-2016, Collaboration-Team-Sprint-T-2015-04-08, Notifications, Schema-change
Springle raised the priority of T94427: Perform schema change to echo_target_page changing from a 1 to 1 mapping between pages and user/notification to a 1 to many. from Medium to High.
May 22 2015, 10:30 AM · OKR-Work, Collaboration-Team-Archive-2015-2016, Collaboration-Team-Sprint-T-2015-04-08, Notifications, Schema-change

May 18 2015

Springle raised the priority of T98489: investigate HHVM mysqlExtension::ConnectTimeout from Medium to High.
May 18 2015, 4:25 AM · SRE, Wikimedia-production-error, HHVM
Springle added a comment to T98339: m3 set max_allowed_packet to 33554432 or greater.

If we change this, check mysqldump command for m3 backups on dbstore1001.

May 18 2015, 4:20 AM · SRE, Patch-For-Review, Phabricator
Springle updated subscribers of T83609: create script & docs to rename wiki databases.

This needs investigation. @Reedy and @Dzahn have had past input. The issues are:

May 18 2015, 4:16 AM · Wikimedia-Site-requests, SRE, DBA
Springle reassigned T83609: create script & docs to rename wiki databases from Springle to jcrespo.
May 18 2015, 4:10 AM · Wikimedia-Site-requests, SRE, DBA
Springle added a comment to T84178: investigate RAID BBU auto-learn on db hosts.

This is made more complicated by the switch to HP boxes in CODFW. May be related to, or influenced by, T97998.

May 18 2015, 4:09 AM · SRE, DBA, Patch-For-Review
Springle reassigned T84178: investigate RAID BBU auto-learn on db hosts from Springle to jcrespo.
May 18 2015, 4:06 AM · SRE, DBA, Patch-For-Review
Springle reassigned T98339: m3 set max_allowed_packet to 33554432 or greater from Springle to jcrespo.
May 18 2015, 4:05 AM · SRE, Patch-For-Review, Phabricator
Springle created T99486: switch to innodb tables for replication state.
May 18 2015, 3:59 AM · SRE, DBA
Springle created T99485: implement performance_schema for mysql monitoring.
May 18 2015, 3:51 AM · Prometheus-metrics-monitoring, SRE, Patch-For-Review, DBA
Springle added a comment to T99459: ips_site_page is too short to store some (full) page titles.

Smaller fields are still better for performance, so I vote for a conservative 300, unless 400 is demonstrably needed.

May 18 2015, 1:54 AM · Schema-change, DBA, Wikidata, MediaWiki-extensions-WikibaseRepository

May 14 2015

Springle added a comment to T98998: Database connection failure issues on s7 shard.

Not an overreaction. But logical backups run on dbstore1001, not on the production shard itself, and the "dumps"[1], if that's what you saw, should only affect the s7 "vslow" slave.

May 14 2015, 12:21 AM · SRE, DBA

May 13 2015

Springle updated subscribers of T92693: Get Labs openstack service dbs on a proper db server.

Note that services should connect to a CNAME, m5-master.eqiad.wmnet. Check if we need any special network/vlan rules (think not, but since it is labs-related, ask @faidon)

May 13 2015, 1:24 PM · DBA, Patch-For-Review, Cloud-Services
Springle added a comment to T98958: Reinstall db1009.eqiad from zero.
  • add an m5-master.eqiad.wmnet CNAME to dns
May 13 2015, 1:22 PM · Patch-For-Review, Cloud-Services

May 12 2015

Springle added a comment to T92693: Get Labs openstack service dbs on a proper db server.

Probably we should setup an M5 cluster dedicated to databases for labs services, and also move pdns/designate over from M1 at the same time. A single EQIAD R510 master (db1009 is available) replicating to a CODFW slave.

May 12 2015, 10:21 AM · DBA, Patch-For-Review, Cloud-Services
Springle reassigned T92693: Get Labs openstack service dbs on a proper db server from Springle to jcrespo.
May 12 2015, 9:55 AM · DBA, Patch-For-Review, Cloud-Services