|
2025-11-19
ยง
|
| 09:31 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM hcaptcha-proxy7001.wikimedia.org - jmm@cumin2002" |
[production] |
| 09:29 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.instance.force_reboot (exit_code=0) vm paws-127c-uwce57bvcgrt-node-1 (cluster eqiad1, project paws) |
[paws] |
| 09:29 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.vps.instance.force_reboot vm paws-127c-uwce57bvcgrt-node-1 (cluster eqiad1, project paws) |
[paws] |
| 09:24 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host hcaptcha-proxy1001.wikimedia.org |
[production] |
| 09:20 |
<jmm@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
| 09:20 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host hcaptcha-proxy7001.wikimedia.org |
[production] |
| 09:20 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host hcaptcha-proxy1001.wikimedia.org |
[production] |
| 09:04 |
<kharlan@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1206906|hCaptcha: Validate sitekey of /siteverify API call (T410024)]] (duration: 10m 32s) |
[production] |
| 09:00 |
<kharlan@deploy2002> |
kharlan: Continuing with sync |
[production] |
| 08:58 |
<kharlan@deploy2002> |
kharlan: Backport for [[gerrit:1206906|hCaptcha: Validate sitekey of /siteverify API call (T410024)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:58 |
<filippo@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcontrol2010-dev.codfw.wmnet with OS trixie |
[production] |
| 08:54 |
<kharlan@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1206906|hCaptcha: Validate sitekey of /siteverify API call (T410024)]] |
[production] |
| 08:35 |
<jynus@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for backup[1006-1007].eqiad.wmnet,ms-backup[1001-1002].eqiad.wmnet |
[production] |
| 08:35 |
<jynus@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for backup[1006-1007].eqiad.wmnet,ms-backup[1001-1002].eqiad.wmnet |
[production] |
| 08:17 |
<dcausse@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1205130|cirrus: index field to sort on title (T40403)]] (duration: 13m 42s) |
[production] |
| 08:13 |
<filippo@cumin1003> |
START - Cookbook sre.hosts.reimage for host cloudcontrol2010-dev.codfw.wmnet with OS trixie |
[production] |
| 08:12 |
<dcausse@deploy2002> |
dcausse: Continuing with sync |
[production] |
| 08:12 |
<filippo@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcontrol2010-dev.codfw.wmnet with OS trixie |
[production] |
| 08:09 |
<dcausse@deploy2002> |
dcausse: Backport for [[gerrit:1205130|cirrus: index field to sort on title (T40403)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:09 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['db1169.eqiad.wmnet'] |
[production] |
| 08:08 |
<jmm@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db1169.eqiad.wmnet'] |
[production] |
| 08:04 |
<dcausse@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1205130|cirrus: index field to sort on title (T40403)]] |
[production] |
| 08:04 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-nginx |
[tools] |
| 08:01 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component ingress-nginx |
[tools] |
| 07:59 |
<moritzm> |
started OSM import on maps-test2001 T409528 |
[production] |
| 07:37 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1189 gradually with 4 steps - Repooling after switchover |
[production] |
| 07:06 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool pc4', diff saved to https://phabricator.wikimedia.org/P85380 and previous config saved to /var/cache/conftool/dbconfig/20251119-070656-marostegui.json |
[production] |
| 07:05 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1189.eqiad.wmnet,pc2014.codfw.wmnet,pc1014.eqiad.wmnet with reason: network maintenance |
[production] |
| 06:52 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool db1189 gradually with 4 steps - Repooling after switchover |
[production] |
| 06:52 |
<marostegui@cumin1003> |
END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) db1189 gradually with 4 steps - Repooling after switchover |
[production] |
| 06:48 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool db1189 gradually with 4 steps - Repooling after switchover |
[production] |
| 06:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1189 T410283', diff saved to https://phabricator.wikimedia.org/P85378 and previous config saved to /var/cache/conftool/dbconfig/20251119-064838-marostegui.json |
[production] |
| 06:47 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db1223 to s3 primary T410283', diff saved to https://phabricator.wikimedia.org/P85377 and previous config saved to /var/cache/conftool/dbconfig/20251119-064755-marostegui.json |
[production] |
| 06:47 |
<marostegui> |
Starting s3 eqiad failover from db1189 to db1223 - T410283 |
[production] |
| 06:41 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s3 T410283 |
[production] |
| 06:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db1223 with weight 0 T410283', diff saved to https://phabricator.wikimedia.org/P85376 and previous config saved to /var/cache/conftool/dbconfig/20251119-064055-marostegui.json |
[production] |
| 06:35 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repool pc1 after network maint', diff saved to https://phabricator.wikimedia.org/P85375 and previous config saved to /var/cache/conftool/dbconfig/20251119-063522-marostegui.json |
[production] |
| 06:28 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2144.codfw.wmnet,db1151.eqiad.wmnet with reason: db2144 went down |
[production] |
| 06:27 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool ms2', diff saved to https://phabricator.wikimedia.org/P85374 and previous config saved to /var/cache/conftool/dbconfig/20251119-062728-marostegui.json |
[production] |
| 06:26 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repool ms3 T405942', diff saved to https://phabricator.wikimedia.org/P85373 and previous config saved to /var/cache/conftool/dbconfig/20251119-062634-marostegui.json |
[production] |
| 06:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repool ms3 T405942', diff saved to https://phabricator.wikimedia.org/P85372 and previous config saved to /var/cache/conftool/dbconfig/20251119-062509-marostegui.json |
[production] |
| 03:21 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1050.eqiad.wmnet' |
[admin] |
| 03:09 |
<eileen> |
civicrm upgraded from bc100d63 to f471a3ec |
[production] |
| 03:04 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1050.eqiad.wmnet' |
[admin] |
| 03:04 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1046.eqiad.wmnet' |
[admin] |
| 03:00 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1046.eqiad.wmnet' |
[admin] |
| 02:59 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1074.eqiad.wmnet with OS trixie |
[production] |
| 02:46 |
<eileen> |
config revision changed from c3e95b76 to 8b1a290c |
[production] |
| 02:14 |
<wmftkbot> |
Test Kitchen mw-user experiment (poll 1) - adds: we-3-3-4-reading-list-test1, we-3-3-4-reading-list-test1-en, growthexperiments-get-started-notification; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD |
[analytics] |
| 02:14 |
<wmftkbot> |
Test Kitchen edge-unique experiments (poll 1) - adds: fy25-26-we-4-2-hcaptcha-editing, hcaptcha-on-french-wikipedia, fy2025-26-we3.1-image-browsing-ab-test, image-browsing-enwiki; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD |
[analytics] |