51-100 of 10000 results (103ms)
2025-11-19 ยง
09:36 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:36 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM hcaptcha-proxy7001.wikimedia.org - jmm@cumin2002" [production]
09:35 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host hcaptcha-proxy1002.wikimedia.org [production]
09:31 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host hcaptcha-proxy1002.wikimedia.org [production]
09:31 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM hcaptcha-proxy7001.wikimedia.org - jmm@cumin2002" [production]
09:24 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host hcaptcha-proxy1001.wikimedia.org [production]
09:20 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
09:20 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host hcaptcha-proxy7001.wikimedia.org [production]
09:20 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host hcaptcha-proxy1001.wikimedia.org [production]
09:04 <kharlan@deploy2002> Finished scap sync-world: Backport for [[gerrit:1206906|hCaptcha: Validate sitekey of /siteverify API call (T410024)]] (duration: 10m 32s) [production]
09:00 <kharlan@deploy2002> kharlan: Continuing with sync [production]
08:58 <kharlan@deploy2002> kharlan: Backport for [[gerrit:1206906|hCaptcha: Validate sitekey of /siteverify API call (T410024)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:58 <filippo@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcontrol2010-dev.codfw.wmnet with OS trixie [production]
08:54 <kharlan@deploy2002> Started scap sync-world: Backport for [[gerrit:1206906|hCaptcha: Validate sitekey of /siteverify API call (T410024)]] [production]
08:35 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for backup[1006-1007].eqiad.wmnet,ms-backup[1001-1002].eqiad.wmnet [production]
08:35 <jynus@cumin1003> START - Cookbook sre.hosts.remove-downtime for backup[1006-1007].eqiad.wmnet,ms-backup[1001-1002].eqiad.wmnet [production]
08:17 <dcausse@deploy2002> Finished scap sync-world: Backport for [[gerrit:1205130|cirrus: index field to sort on title (T40403)]] (duration: 13m 42s) [production]
08:13 <filippo@cumin1003> START - Cookbook sre.hosts.reimage for host cloudcontrol2010-dev.codfw.wmnet with OS trixie [production]
08:12 <dcausse@deploy2002> dcausse: Continuing with sync [production]
08:12 <filippo@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcontrol2010-dev.codfw.wmnet with OS trixie [production]
08:09 <dcausse@deploy2002> dcausse: Backport for [[gerrit:1205130|cirrus: index field to sort on title (T40403)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['db1169.eqiad.wmnet'] [production]
08:08 <jmm@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db1169.eqiad.wmnet'] [production]
08:04 <dcausse@deploy2002> Started scap sync-world: Backport for [[gerrit:1205130|cirrus: index field to sort on title (T40403)]] [production]
07:59 <moritzm> started OSM import on maps-test2001 T409528 [production]
07:37 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1189 gradually with 4 steps - Repooling after switchover [production]
07:06 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool pc4', diff saved to https://phabricator.wikimedia.org/P85380 and previous config saved to /var/cache/conftool/dbconfig/20251119-070656-marostegui.json [production]
07:05 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1189.eqiad.wmnet,pc2014.codfw.wmnet,pc1014.eqiad.wmnet with reason: network maintenance [production]
06:52 <marostegui@cumin1003> START - Cookbook sre.mysql.pool db1189 gradually with 4 steps - Repooling after switchover [production]
06:52 <marostegui@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) db1189 gradually with 4 steps - Repooling after switchover [production]
06:48 <marostegui@cumin1003> START - Cookbook sre.mysql.pool db1189 gradually with 4 steps - Repooling after switchover [production]
06:48 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db1189 T410283', diff saved to https://phabricator.wikimedia.org/P85378 and previous config saved to /var/cache/conftool/dbconfig/20251119-064838-marostegui.json [production]
06:47 <marostegui@cumin1003> dbctl commit (dc=all): 'Promote db1223 to s3 primary T410283', diff saved to https://phabricator.wikimedia.org/P85377 and previous config saved to /var/cache/conftool/dbconfig/20251119-064755-marostegui.json [production]
06:47 <marostegui> Starting s3 eqiad failover from db1189 to db1223 - T410283 [production]
06:41 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s3 T410283 [production]
06:40 <marostegui@cumin1003> dbctl commit (dc=all): 'Set db1223 with weight 0 T410283', diff saved to https://phabricator.wikimedia.org/P85376 and previous config saved to /var/cache/conftool/dbconfig/20251119-064055-marostegui.json [production]
06:35 <marostegui@cumin1003> dbctl commit (dc=all): 'Repool pc1 after network maint', diff saved to https://phabricator.wikimedia.org/P85375 and previous config saved to /var/cache/conftool/dbconfig/20251119-063522-marostegui.json [production]
06:28 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2144.codfw.wmnet,db1151.eqiad.wmnet with reason: db2144 went down [production]
06:27 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool ms2', diff saved to https://phabricator.wikimedia.org/P85374 and previous config saved to /var/cache/conftool/dbconfig/20251119-062728-marostegui.json [production]
06:26 <marostegui@cumin1003> dbctl commit (dc=all): 'Repool ms3 T405942', diff saved to https://phabricator.wikimedia.org/P85373 and previous config saved to /var/cache/conftool/dbconfig/20251119-062634-marostegui.json [production]
06:25 <marostegui@cumin1003> dbctl commit (dc=all): 'Repool ms3 T405942', diff saved to https://phabricator.wikimedia.org/P85372 and previous config saved to /var/cache/conftool/dbconfig/20251119-062509-marostegui.json [production]
03:09 <eileen> civicrm upgraded from bc100d63 to f471a3ec [production]
02:59 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1074.eqiad.wmnet with OS trixie [production]
02:46 <eileen> config revision changed from c3e95b76 to 8b1a290c [production]
01:53 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1074.eqiad.wmnet with reason: host reimage [production]
01:50 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1074.eqiad.wmnet with reason: host reimage [production]
01:35 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudvirt1074.eqiad.wmnet with OS trixie [production]
01:23 <andrew@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cloudvirt1074.eqiad.wmnet'] [production]
01:23 <andrew@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudvirt1074.eqiad.wmnet'] [production]
01:18 <andrew@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cloudvirt1074.eqiad.wmnet'] [production]