2025-04-14
ยง
|
10:35 |
<vgutierrez> |
upload varnish 7.1.1-1.1~bpo11+wmf3 to apt.wm.o (bullseye-wikimedia) - T391334 |
[production] |
10:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P74933 and previous config saved to /var/cache/conftool/dbconfig/20250414-103253-root.json |
[production] |
10:23 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P74932 and previous config saved to /var/cache/conftool/dbconfig/20250414-102316-fceratto.json |
[production] |
10:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P74931 and previous config saved to /var/cache/conftool/dbconfig/20250414-101748-root.json |
[production] |
10:15 |
<ladsgroup@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1136339|Bump thumbnail steps to 90% (T360589)]], [[gerrit:1135835|CommonSettings: remove outdated SecurePoll comment (T209892)]] |
[production] |
10:08 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1157 (T391056)', diff saved to https://phabricator.wikimedia.org/P74930 and previous config saved to /var/cache/conftool/dbconfig/20250414-100809-fceratto.json |
[production] |
10:04 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1157 (T391056)', diff saved to https://phabricator.wikimedia.org/P74929 and previous config saved to /var/cache/conftool/dbconfig/20250414-100412-fceratto.json |
[production] |
10:04 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1157.eqiad.wmnet with reason: Maintenance |
[production] |
10:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P74928 and previous config saved to /var/cache/conftool/dbconfig/20250414-100242-root.json |
[production] |
10:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repool pc1', diff saved to https://phabricator.wikimedia.org/P74927 and previous config saved to /var/cache/conftool/dbconfig/20250414-100135-marostegui.json |
[production] |
10:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool pc1', diff saved to https://phabricator.wikimedia.org/P74925 and previous config saved to /var/cache/conftool/dbconfig/20250414-100038-marostegui.json |
[production] |
09:58 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1150.eqiad.wmnet with reason: Maintenance |
[production] |
09:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P74924 and previous config saved to /var/cache/conftool/dbconfig/20250414-094737-root.json |
[production] |
09:35 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db2220 gradually with 4 steps - Finished upgrading host |
[production] |
09:33 |
<vgutierrez> |
restarting acme-chief API servers to catch up on liblzma updates |
[production] |
09:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 30%: Repooling', diff saved to https://phabricator.wikimedia.org/P74922 and previous config saved to /var/cache/conftool/dbconfig/20250414-093232-root.json |
[production] |
09:31 |
<vgutierrez> |
restarting acme-chief to catch up on liblzma updates |
[production] |
09:20 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2230.codfw.wmnet |
[production] |
09:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P74919 and previous config saved to /var/cache/conftool/dbconfig/20250414-091727-root.json |
[production] |
09:15 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2230.codfw.wmnet |
[production] |
09:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P74917 and previous config saved to /var/cache/conftool/dbconfig/20250414-090222-root.json |
[production] |
09:00 |
<XioNoX> |
gnmic: bump `num-workers` to 16 on netflow1002 - T388641 |
[production] |
08:48 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.pool db2220 gradually with 4 steps - Finished upgrading host |
[production] |
08:47 |
<moritzm> |
installing Postgres 15 security updates |
[production] |
08:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1178 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P74914 and previous config saved to /var/cache/conftool/dbconfig/20250414-084716-root.json |
[production] |
08:46 |
<fabfur> |
enable-puppet on A:cp (T391670) |
[production] |
08:45 |
<moritzm> |
restart Postfix/Dovecot on outbound MXes to pick up xz security updates |
[production] |
08:41 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.reimage for host an-worker1178.eqiad.wmnet with OS bullseye |
[production] |
08:40 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1178.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
08:39 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp7001.magru.wmnet |
[production] |
08:39 |
<moritzm> |
restarting ircstream on irc1003, clients will reconnect automatically |
[production] |
08:39 |
<fceratto@cumin1002> |
END (FAIL) - Cookbook sre.mysql.upgrade (exit_code=99) for db2220.codfw.wmnet |
[production] |
08:36 |
<fabfur@cumin1002> |
conftool action : set/pooled=no; selector: name=cp7001.magru.wmnet |
[production] |
08:35 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp1111.eqiad.wmnet |
[production] |
08:34 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host an-worker1178.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
08:32 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2220.codfw.wmnet |
[production] |
08:31 |
<fabfur@cumin1002> |
conftool action : set/pooled=no; selector: name=cp1111.eqiad.wmnet |
[production] |
08:31 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db2220 - Upgrading host |
[production] |
08:30 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.depool db2220 - Upgrading host |
[production] |
08:27 |
<fabfur> |
disable-puppet on A:cp to test https://gerrit.wikimedia.org/r/c/operations/puppet/+/1135827 (T391670) |
[production] |
08:26 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db1178.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
08:25 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host db1178.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
08:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1178', diff saved to https://phabricator.wikimedia.org/P74912 and previous config saved to /var/cache/conftool/dbconfig/20250414-082235-marostegui.json |
[production] |
08:20 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1178.eqiad.wmnet with OS bullseye |
[production] |
08:11 |
<moritzm> |
restarting clamav on vrts to pick up liblzma security updates |
[production] |
07:58 |
<moritzm> |
rebalance ganeti/B T391243 |
[production] |
07:53 |
<XioNoX> |
gnmic: bump `num-workers` to 12 on netflow1002 - T388641 |
[production] |
07:48 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1044.eqiad.wmnet |
[production] |
07:42 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host ganeti1044.eqiad.wmnet |
[production] |
07:39 |
<elukey@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/proton: sync |
[production] |