801-850 of 10000 results (20ms)
2025-09-04 §
07:23 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti3005.esams.wmnet with reason: host reimage [production]
07:21 <phuedx@deploy1003> Started scap sync-world: Backport for [[gerrit:1184549|MetricsPlatform: Enable overrides everywhere (T402369)]] [production]
07:20 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'configure' for AS: 5400 [production]
07:02 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti3005.esams.wmnet with OS bookworm [production]
06:46 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2045.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
06:36 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host cp2045.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
05:18 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1252 (T402925)', diff saved to https://phabricator.wikimedia.org/P82524 and previous config saved to /var/cache/conftool/dbconfig/20250904-051806-ladsgroup.json [production]
05:17 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1252.eqiad.wmnet with reason: Maintenance [production]
05:17 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1249 (T402925)', diff saved to https://phabricator.wikimedia.org/P82523 and previous config saved to /var/cache/conftool/dbconfig/20250904-051743-ladsgroup.json [production]
05:02 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P82522 and previous config saved to /var/cache/conftool/dbconfig/20250904-050235-ladsgroup.json [production]
04:47 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P82521 and previous config saved to /var/cache/conftool/dbconfig/20250904-044728-ladsgroup.json [production]
04:32 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1249 (T402925)', diff saved to https://phabricator.wikimedia.org/P82520 and previous config saved to /var/cache/conftool/dbconfig/20250904-043220-ladsgroup.json [production]
01:59 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1249 (T402925)', diff saved to https://phabricator.wikimedia.org/P82519 and previous config saved to /var/cache/conftool/dbconfig/20250904-015952-ladsgroup.json [production]
01:59 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1249.eqiad.wmnet with reason: Maintenance [production]
01:59 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248 (T402925)', diff saved to https://phabricator.wikimedia.org/P82518 and previous config saved to /var/cache/conftool/dbconfig/20250904-015929-ladsgroup.json [production]
01:44 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P82517 and previous config saved to /var/cache/conftool/dbconfig/20250904-014422-ladsgroup.json [production]
01:29 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P82516 and previous config saved to /var/cache/conftool/dbconfig/20250904-012914-ladsgroup.json [production]
01:14 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1248 (T402925)', diff saved to https://phabricator.wikimedia.org/P82515 and previous config saved to /var/cache/conftool/dbconfig/20250904-011407-ladsgroup.json [production]
01:13 <kemayo@deploy1003> Finished scap sync-world: Backport for [[gerrit:1184618|EditAttemptStep: don't error if something is blocking session logging (T403656)]], [[gerrit:1184619|EditAttemptStep: don't error if something is blocking session logging (T403656)]] (duration: 12m 07s) [production]
01:08 <kemayo@deploy1003> jforrester, kemayo: Continuing with sync [production]
01:06 <kemayo@deploy1003> jforrester, kemayo: Backport for [[gerrit:1184618|EditAttemptStep: don't error if something is blocking session logging (T403656)]], [[gerrit:1184619|EditAttemptStep: don't error if something is blocking session logging (T403656)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
01:01 <kemayo@deploy1003> Started scap sync-world: Backport for [[gerrit:1184618|EditAttemptStep: don't error if something is blocking session logging (T403656)]], [[gerrit:1184619|EditAttemptStep: don't error if something is blocking session logging (T403656)]] [production]
00:45 <jforrester@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
00:45 <jforrester@deploy1003> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
00:45 <jforrester@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
00:44 <jforrester@deploy1003> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
00:44 <jforrester@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
00:43 <jforrester@deploy1003> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
00:06 <krinkle@deploy1003> Finished scap sync-world: Backport for [[gerrit:1183700|Disable wmgUseMdotRouting on testwiki in prod (T401595)]] (duration: 09m 30s) [production]
00:01 <krinkle@deploy1003> krinkle: Continuing with sync [production]
00:00 <krinkle@deploy1003> krinkle: Backport for [[gerrit:1183700|Disable wmgUseMdotRouting on testwiki in prod (T401595)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
2025-09-03 §
23:57 <krinkle@deploy1003> Started scap sync-world: Backport for [[gerrit:1183700|Disable wmgUseMdotRouting on testwiki in prod (T401595)]] [production]
23:38 <denisse> Adding slack_bot_token to private repo - T401730 [production]
22:57 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1248 (T402925)', diff saved to https://phabricator.wikimedia.org/P82513 and previous config saved to /var/cache/conftool/dbconfig/20250903-225738-ladsgroup.json [production]
22:57 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1248.eqiad.wmnet with reason: Maintenance [production]
22:57 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247 (T402925)', diff saved to https://phabricator.wikimedia.org/P82512 and previous config saved to /var/cache/conftool/dbconfig/20250903-225714-ladsgroup.json [production]
22:42 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P82511 and previous config saved to /var/cache/conftool/dbconfig/20250903-224206-ladsgroup.json [production]
22:27 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P82510 and previous config saved to /var/cache/conftool/dbconfig/20250903-222659-ladsgroup.json [production]
22:23 <jdlrobson@deploy1003> Finished scap sync-world: Backport for [[gerrit:1184559|Cleanup special wikis (T400066)]] (duration: 11m 47s) [production]
22:18 <jdlrobson@deploy1003> jdlrobson: Continuing with sync [production]
22:16 <jdlrobson@deploy1003> jdlrobson: Backport for [[gerrit:1184559|Cleanup special wikis (T400066)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
22:11 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247 (T402925)', diff saved to https://phabricator.wikimedia.org/P82509 and previous config saved to /var/cache/conftool/dbconfig/20250903-221151-ladsgroup.json [production]
22:11 <jdlrobson@deploy1003> Started scap sync-world: Backport for [[gerrit:1184559|Cleanup special wikis (T400066)]] [production]
21:56 <jhathaway@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host backup1012.eqiad.wmnet with OS bookworm [production]
21:50 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host backup1012.eqiad.wmnet with OS bookworm [production]
21:43 <jhathaway@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host backup1012.eqiad.wmnet with OS bookworm [production]
21:37 <James_F> Running `mwscript-k8s -f -- extensions/WikiLambda/maintenance/updateSecondaryTables.php --wiki=wikifunctionswiki --quick --zType Z4 --verbose` to try to fix T403671 [production]
21:13 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:13 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: modifiy DNS for frm2002 and frdb2002 - pt1979@cumin2002" [production]
21:13 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: modifiy DNS for frm2002 and frdb2002 - pt1979@cumin2002" [production]