3001-3050 of 10000 results (49ms)
2024-05-14 §
07:48 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.worker.drain (exit_code=99) for node tools-k8s-worker-nfs-9 [tools]
07:48 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.worker.drain for node tools-k8s-worker-nfs-9 [tools]
07:46 <moritzm> installing libgd2 security updates [production]
07:44 <kartik@deploy1002> kartik: Continuing with sync [production]
07:42 <kartik@deploy1002> kartik: Backport for [[gerrit:1030325|CX: Add mw.cx.UserPermissionChecker (T349959)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:39 <kartik@deploy1002> Started scap: Backport for [[gerrit:1030325|CX: Add mw.cx.UserPermissionChecker (T349959)]] [production]
07:27 <kartik@deploy1002> Finished scap: Backport for [[gerrit:1030978|Set $wgSignatureValidation to 'disallow' on Polish Wikipedia (T364769)]] (duration: 18m 28s) [production]
07:17 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2185.codfw.wmnet with OS bookworm [production]
07:15 <kartik@deploy1002> kartik and msz2001: Continuing with sync [production]
07:12 <kartik@deploy1002> kartik and msz2001: Backport for [[gerrit:1030978|Set $wgSignatureValidation to 'disallow' on Polish Wikipedia (T364769)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:09 <kartik@deploy1002> Started scap: Backport for [[gerrit:1030978|Set $wgSignatureValidation to 'disallow' on Polish Wikipedia (T364769)]] [production]
07:04 <moritzm> installing glib2.0 security updates [production]
06:56 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2185.codfw.wmnet with reason: host reimage [production]
06:54 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2185.codfw.wmnet with reason: host reimage [production]
06:35 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db2185.codfw.wmnet with OS bookworm [production]
06:33 <marostegui@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host db2185.codfw.wmnet with OS bookworm [production]
06:33 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db2185.codfw.wmnet with OS bookworm [production]
05:31 <kart_> Updated cxserver to 2024-04-23-221507-production (T363263, T333969, T360303, T360310) [production]
05:25 <kartik@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
05:24 <kartik@deploy1002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
05:22 <kartik@deploy1002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
05:22 <kartik@deploy1002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
05:19 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
05:19 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
05:15 <kart_> Updated MinT to 2024-03-28-061726-production (T333969) [production]
05:08 <kartik@deploy1002> helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply [production]
04:59 <kartik@deploy1002> helmfile [eqiad] START helmfile.d/services/machinetranslation: apply [production]
04:33 <kartik@deploy1002> helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply [production]
04:25 <kartik@deploy1002> helmfile [codfw] START helmfile.d/services/machinetranslation: apply [production]
04:18 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
04:14 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
04:00 <mwpresync@deploy1002> Finished scap: testwikis wikis to 1.43.0-wmf.5 refs T361399 (duration: 57m 45s) [production]
03:02 <mwpresync@deploy1002> Started scap: testwikis wikis to 1.43.0-wmf.5 refs T361399 [production]
02:34 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
02:34 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
02:33 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T352010)', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20240514-023316-ladsgroup.json [production]
02:18 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P62375 and previous config saved to /var/cache/conftool/dbconfig/20240514-021809-ladsgroup.json [production]
02:03 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P62374 and previous config saved to /var/cache/conftool/dbconfig/20240514-020301-ladsgroup.json [production]
01:47 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T352010)', diff saved to https://phabricator.wikimedia.org/P62373 and previous config saved to /var/cache/conftool/dbconfig/20240514-014753-ladsgroup.json [production]
01:18 <ejegg> fundraising civicrm upgraded from c854dd3a to c7b0dfbb [production]
00:35 <tstarling@deploy1002> Finished scap: Fix SecurePoll exception T209892 and CodeMirror 5 RTL T363752 (duration: 14m 56s) [production]
00:20 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) [admin]
00:20 <tstarling@deploy1002> Started scap: Fix SecurePoll exception T209892 and CodeMirror 5 RTL T363752 [production]
00:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2152 (T364299)', diff saved to https://phabricator.wikimedia.org/P62372 and previous config saved to /var/cache/conftool/dbconfig/20240514-001956-marostegui.json [production]
00:19 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2152.codfw.wmnet with reason: Maintenance [production]
00:19 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db2152.codfw.wmnet with reason: Maintenance [production]
00:19 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack [admin]
00:18 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) [admin]
00:15 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack [admin]
2024-05-13 §
23:21 <wmbot~bd808@tools-bastion-12> Restarted gitlab job to pick up fix for T364719 [tools.wikibugs]