2025-04-07
ยง
|
18:09 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1168 (T391056)', diff saved to https://phabricator.wikimedia.org/P74624 and previous config saved to /var/cache/conftool/dbconfig/20250407-180927-fceratto.json |
[production] |
18:09 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
18:09 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T391056)', diff saved to https://phabricator.wikimedia.org/P74623 and previous config saved to /var/cache/conftool/dbconfig/20250407-180905-fceratto.json |
[production] |
18:08 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1202.eqiad.wmnet with OS bullseye |
[production] |
18:01 |
<wmbot~lucaswerkmeister@tools-bastion-13> |
deployed e106b7b684 (Quechua verbs + l10n updates: es, pa, qu, zh-hant) |
[tools.lexeme-forms] |
17:59 |
<brett> |
Upload varnishkafka 1.2.0-2 to bullseye-wikimedia (T389605) |
[production] |
17:53 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P74622 and previous config saved to /var/cache/conftool/dbconfig/20250407-175358-fceratto.json |
[production] |
17:50 |
<jhathaway@cumin1002> |
DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Xiaoxiao out of all services on: 2397 hosts |
[production] |
17:44 |
<brett> |
Remove libvmod-netmapper, libvmod-querysort, varnish-re2, varnish, varnishkafka, varnish-modules from bullseye-wikimedia component/varnish-staging |
[production] |
17:38 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P74621 and previous config saved to /var/cache/conftool/dbconfig/20250407-173851-fceratto.json |
[production] |
17:32 |
<wmbot~lucaswerkmeister@tools-bastion-13> |
deployed 348dc8edc7 (l10n updates: es, zh-hant) |
[tools.ranker] |
17:27 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp7002.magru.wmnet |
[production] |
17:26 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp7001.magru.wmnet |
[production] |
17:23 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T391056)', diff saved to https://phabricator.wikimedia.org/P74620 and previous config saved to /var/cache/conftool/dbconfig/20250407-172343-fceratto.json |
[production] |
17:22 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1165 (T391056)', diff saved to https://phabricator.wikimedia.org/P74619 and previous config saved to /var/cache/conftool/dbconfig/20250407-172234-fceratto.json |
[production] |
17:22 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1015,1019].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
17:22 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1165.eqiad.wmnet with reason: Maintenance |
[production] |
17:20 |
<bd808> |
`service navtiming stop` to halt "Unhandled exception in main loop, restarting consumer" crash loop (T391272) |
[releng] |
17:17 |
<brett> |
Re-enabling Puppet on A:cp (T378737) |
[production] |
17:15 |
<bd808> |
Reboot deployment-webperf21 (T391272) |
[releng] |
17:04 |
<brett> |
Disabling puppet on A:cp to roll out removal of vanrish 6/7 template switching (T378737) |
[production] |
17:04 |
<dancy@deploy1003> |
Installation of scap version "4.151.0" completed for 190 hosts |
[production] |
16:59 |
<dancy@deploy1003> |
Installing scap version "4.151.0" for 190 host(s) |
[production] |
16:58 |
<bd808> |
`puppet agent -tv` to catch up with missed puppet runs on deployment-webperf21 (T391272) |
[releng] |
16:56 |
<bd808> |
`rm /var/log/user.log.1` on deployment-webperf21 (T391272) |
[releng] |
16:54 |
<slyngshede@cumin1002> |
DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Xiaoxiao out of all services on: 2396 hosts |
[production] |
16:52 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host an-worker1202.eqiad.wmnet with OS bullseye |
[production] |
16:52 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1202.eqiad.wmnet with OS bullseye |
[production] |
16:47 |
<bd808> |
`sudo /usr/local/sbin/clean-stale-puppet-certs --clean` on deployment-puppetserver-1 to clean up dangling certs for deployment-elastic{09,10,11} |
[releng] |
16:33 |
<brett> |
Upload ncmonitor 1.3.4-1 to bookworm-wikimedia |
[production] |
16:30 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: relforge1003* for test ban syntax - bking@cumin2002 - T391151 |
[production] |
16:30 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.ban Banning hosts: relforge1003* for test ban syntax - bking@cumin2002 - T391151 |
[production] |
16:29 |
<mforns@deploy1003> |
helmfile [staging] DONE helmfile.d/services/commons-impact-analytics: apply |
[production] |
16:29 |
<mforns@deploy1003> |
helmfile [staging] START helmfile.d/services/commons-impact-analytics: apply |
[production] |
16:25 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host an-worker1202.eqiad.wmnet with OS bullseye |
[production] |
16:24 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) an-worker1202.eqiad.wmnet on all recursors |
[production] |
16:24 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.wipe-cache an-worker1202.eqiad.wmnet on all recursors |
[production] |
16:23 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) an-worker1202.eqiad.wmnet on all recursors |
[production] |
16:23 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.wipe-cache an-worker1202.eqiad.wmnet on all recursors |
[production] |
16:17 |
<mforns@deploy1003> |
helmfile [staging] DONE helmfile.d/services/commons-impact-analytics: apply |
[production] |
16:17 |
<mforns@deploy1003> |
helmfile [staging] START helmfile.d/services/commons-impact-analytics: apply |
[production] |
16:15 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1202 |
[production] |
16:15 |
<jclark@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host an-worker1202 |
[production] |
16:10 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1202.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:08 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: relforge1004* for test ban syntax - bking@cumin2002 - T391151 |
[production] |
16:08 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.ban Banning hosts: relforge1004* for test ban syntax - bking@cumin2002 - T391151 |
[production] |
16:07 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in relforge |
[production] |
16:07 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.ban Unbanning all hosts in relforge |
[production] |
15:59 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host an-worker1202.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
15:58 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |