801-850 of 10000 results (82ms)
2024-07-23 §
07:08 <kartik@deploy1002> Started scap sync-world: Backport for [[gerrit:1055653|uzwiki: Limit publishing in CX to 'patroller' and 'sysop' groups (T370387)]] [production]
06:58 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
06:58 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
05:00 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2173 (T367856)', diff saved to https://phabricator.wikimedia.org/P66892 and previous config saved to /var/cache/conftool/dbconfig/20240723-050042-marostegui.json [production]
05:00 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db2186.codfw.wmnet with reason: Maintenance [production]
05:00 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db2186.codfw.wmnet with reason: Maintenance [production]
05:00 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2173.codfw.wmnet with reason: Maintenance [production]
05:00 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2173.codfw.wmnet with reason: Maintenance [production]
05:00 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66891 and previous config saved to /var/cache/conftool/dbconfig/20240723-050004-marostegui.json [production]
04:44 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P66890 and previous config saved to /var/cache/conftool/dbconfig/20240723-044457-marostegui.json [production]
04:29 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P66889 and previous config saved to /var/cache/conftool/dbconfig/20240723-042950-marostegui.json [production]
04:14 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66888 and previous config saved to /var/cache/conftool/dbconfig/20240723-041442-marostegui.json [production]
04:01 <mwpresync@deploy1002> Pruned MediaWiki: 1.43.0-wmf.12 (duration: 01m 00s) [production]
03:54 <mwpresync@deploy1002> Finished scap: testwikis to 1.43.0-wmf.15 refs T366960 (duration: 51m 50s) [production]
03:03 <mwpresync@deploy1002> Started scap sync-world: testwikis to 1.43.0-wmf.15 refs T366960 [production]
01:28 <dani@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
01:27 <dani@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
01:27 <dani@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
01:27 <dani@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
01:27 <dani@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
01:27 <dani@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
01:24 <dani@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
00:22 <eevans@deploy1002> helmfile [staging] DONE helmfile.d/services/data-gateway: apply [production]
00:22 <eevans@deploy1002> helmfile [staging] START helmfile.d/services/data-gateway: apply [production]
00:05 <cmooney@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM netflow2003.codfw.wmnet [production]
00:02 <cmooney@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM netflow2003.codfw.wmnet [production]
00:00 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on netflow2003.codfw.wmnet with reason: reboot netflow2003 [production]
00:00 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 0:15:00 on netflow2003.codfw.wmnet with reason: reboot netflow2003 [production]
2024-07-22 §
23:08 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "set lsw in codfw to active - cmooney@cumin1002" [production]
23:07 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "set lsw in codfw to active - cmooney@cumin1002" [production]
23:05 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
23:03 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
22:47 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephmon1005.eqiad.wmnet with OS bullseye [production]
22:38 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephmon1004.eqiad.wmnet with OS bullseye [production]
22:37 <bking@cumin2002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: elastic110[0-2]* for T348977 - bking@cumin2002 [production]
22:36 <bking@cumin2002> START - Cookbook sre.elasticsearch.ban Banning hosts: elastic110[0-2]* for T348977 - bking@cumin2002 [production]
22:35 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic[1100-1102].eqiad.wmnet with reason: T348977 [production]
22:34 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic[1100-1102].eqiad.wmnet with reason: T348977 [production]
22:01 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephmon1005.eqiad.wmnet with OS bullseye [production]