1-50 of 10000 results (101ms)
2025-11-22 §
09:17 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1185 (T410589)', diff saved to https://phabricator.wikimedia.org/P85458 and previous config saved to /var/cache/conftool/dbconfig/20251122-091726-ladsgroup.json [production]
09:17 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1185.eqiad.wmnet with reason: Maintenance [production]
09:17 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T410589)', diff saved to https://phabricator.wikimedia.org/P85457 and previous config saved to /var/cache/conftool/dbconfig/20251122-091703-ladsgroup.json [production]
09:01 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P85456 and previous config saved to /var/cache/conftool/dbconfig/20251122-090155-ladsgroup.json [production]
08:46 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P85455 and previous config saved to /var/cache/conftool/dbconfig/20251122-084647-ladsgroup.json [production]
08:31 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T410589)', diff saved to https://phabricator.wikimedia.org/P85454 and previous config saved to /var/cache/conftool/dbconfig/20251122-083140-ladsgroup.json [production]
01:01 <ejegg> fundraising python tools rolled back from fe42b9a2 to 773e8d11 [production]
01:01 <ejegg> fundraising civicrm rolled back from 11e95839 to e4748b9f [production]
01:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
00:16 <rzl@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
00:16 <rzl@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
00:12 <rzl@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
00:12 <rzl@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
00:00 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db1161 (T410589)', diff saved to https://phabricator.wikimedia.org/P85453 and previous config saved to /var/cache/conftool/dbconfig/20251122-000026-ladsgroup.json [production]
00:00 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
00:00 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T410589)', diff saved to https://phabricator.wikimedia.org/P85452 and previous config saved to /var/cache/conftool/dbconfig/20251122-000001-ladsgroup.json [production]
2025-11-21 §
23:44 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P85451 and previous config saved to /var/cache/conftool/dbconfig/20251121-234454-ladsgroup.json [production]
23:29 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P85450 and previous config saved to /var/cache/conftool/dbconfig/20251121-232946-ladsgroup.json [production]
23:14 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T410589)', diff saved to https://phabricator.wikimedia.org/P85449 and previous config saved to /var/cache/conftool/dbconfig/20251121-231439-ladsgroup.json [production]
22:53 <rzl@deploy2002> helmfile [staging] DONE helmfile.d/services/apertium: apply [production]
22:53 <rzl@deploy2002> helmfile [staging] START helmfile.d/services/apertium: apply [production]
22:49 <inflatador> bking@wdqs2007 roll-restart wdqs CODFW for high lag https://w.wiki/GDad [production]
22:24 <inflatador> bking@wdqs1011 `systemctl restart wdqs-blazegraph.service` (responding to ProbeDown) [production]
22:19 <ejegg> fundraising python tools upgraded from 773e8d11 to fe42b9a2 [production]
22:16 <bking@cumin2002> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
22:14 <ejegg> civicrm upgraded from e4748b9f to 11e95839 [production]
22:04 <bking@cumin2002> START - Cookbook sre.wdqs.restart [production]
22:03 <bking@cumin2002> END (ERROR) - Cookbook sre.wdqs.restart (exit_code=97) [production]
22:03 <bking@cumin2002> START - Cookbook sre.wdqs.restart [production]
21:03 <andrew@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host cloudidp2001-dev.codfw.wmnet [production]
21:03 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudidp2001-dev.codfw.wmnet with OS trixie [production]
20:45 <robh@cumin2002> END (ERROR) - Cookbook sre.hardware.upgrade-firmware (exit_code=97) upgrade firmware for hosts sretest1002.eqiad.wmnet [production]
20:45 <robh@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sretest1002.eqiad.wmnet [production]
20:45 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudidp2001-dev.codfw.wmnet with OS trixie [production]
20:45 <andrew@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM cloudidp2001-dev.codfw.wmnet - andrew@cumin2002" [production]
20:44 <andrew@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM cloudidp2001-dev.codfw.wmnet - andrew@cumin2002" [production]
20:44 <andrew@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cloudidp2001-dev.codfw.wmnet on all recursors [production]
20:44 <andrew@cumin2002> START - Cookbook sre.dns.wipe-cache cloudidp2001-dev.codfw.wmnet on all recursors [production]
20:44 <andrew@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:44 <andrew@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM cloudidp2001-dev.codfw.wmnet - andrew@cumin2002" [production]
20:42 <andrew@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM cloudidp2001-dev.codfw.wmnet - andrew@cumin2002" [production]
20:40 <mutante> zuul2002 - rm /lib/systemd/system/zuul* ; systemctl daemon-reload ; systemctl reset-failed - fixes T410756 [production]
20:35 <andrew@cumin2002> START - Cookbook sre.dns.netbox [production]
20:35 <andrew@cumin2002> START - Cookbook sre.ganeti.makevm for new host cloudidp2001-dev.codfw.wmnet [production]
18:38 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudidp2001-dev.codfw.wmnet with OS trixie [production]
18:27 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host relforge1010.eqiad.wmnet with OS bookworm [production]
18:24 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudidp2001-dev.codfw.wmnet with OS trixie [production]
18:23 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host relforge1009.eqiad.wmnet with OS bookworm [production]
18:12 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on relforge1010.eqiad.wmnet with reason: host reimage [production]
18:12 <bking@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host relforge1008.eqiad.wmnet with OS bookworm [production]