401-450 of 10000 results (45ms)
2021-11-24 §
09:13 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on echostore.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on echostore.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on cxserver.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on cxserver.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on citoid.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on citoid.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on blubberoid.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on blubberoid.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on apple-search.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on apple-search.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on api-gateway.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:13 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on api-gateway.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:11 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on apertium.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:11 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on apertium.svc.codfw.wmnet with reason: helm3 de-deploy T251305 [production]
09:10 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM deneb.codfw.wmnet [production]
09:08 <_joe_> switching search.wikimedia.org to be served by the apple-search servcie [production]
09:04 <jelto> start re-deploy procedure in codfw Kubernetes T251305 [production]
09:01 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM deneb.codfw.wmnet [production]
08:59 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
08:56 <_joe_> repooling cp2027 [production]
08:55 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
08:55 <oblivian@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'apple-search' for release 'main' . [production]
08:51 <ladsgroup@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:741082|Set actor migration to write both on all wikis (T275246)]] (duration: 00m 57s) [production]
08:51 <oblivian@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'apple-search' for release 'main' . [production]
08:41 <vgutierrez> depool cp2027 [production]
08:05 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1125.eqiad.wmnet with OS bullseye [production]
07:40 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1125.eqiad.wmnet with OS bullseye [production]
07:23 <elukey> reboot kubernetes1018 (role::insetup) to verify negotiated speed of eth interface [production]
07:12 <elukey> drop /tmp/blockmgr-20fe4b2b-31fb-4a85-b5b1-bebe254120f8 and other blockmgr-* dirs on stat1006 to free space on the root partition [production]
06:47 <Amir1> running optimize table with replication on db1155:3314 (T296143) [production]
06:45 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance (T296143) [production]
06:45 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 5:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance (T296143) [production]
06:32 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db1121 (re)pooling @ 100%: After optimize table (T296143)', diff saved to https://phabricator.wikimedia.org/P17807 and previous config saved to /var/cache/conftool/dbconfig/20211124-063228-root.json [production]
06:17 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db1121 (re)pooling @ 75%: After optimize table (T296143)', diff saved to https://phabricator.wikimedia.org/P17806 and previous config saved to /var/cache/conftool/dbconfig/20211124-061725-root.json [production]
06:05 <marostegui> Upgrade db1128's kernel T288720 [production]
06:02 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db1121 (re)pooling @ 25%: After optimize table (T296143)', diff saved to https://phabricator.wikimedia.org/P17805 and previous config saved to /var/cache/conftool/dbconfig/20211124-060221-root.json [production]
05:47 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db1121 (re)pooling @ 10%: After optimize table (T296143)', diff saved to https://phabricator.wikimedia.org/P17804 and previous config saved to /var/cache/conftool/dbconfig/20211124-054718-root.json [production]
00:25 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2012.codfw.wmnet with OS buster [production]
2021-11-23 §
23:53 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2012.codfw.wmnet with OS buster [production]
23:43 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2011.codfw.wmnet with OS buster [production]
23:12 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2011.codfw.wmnet with OS buster [production]
23:11 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2010.codfw.wmnet with OS buster [production]
22:40 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2010.codfw.wmnet with OS buster [production]
22:28 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2009.codfw.wmnet with OS buster [production]
21:58 <tgr> UTC evening deploys done [production]
21:57 <tgr@deploy1002> Finished scap: (no justification provided) (duration: 10m 03s) [production]
21:57 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2009.codfw.wmnet with OS buster [production]
21:56 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2009.codfw.wmnet with OS buster [production]
21:53 <krinkle@deploy1002> Finished deploy [integration/docroot@a3435a7]: (no justification provided) (duration: 00m 07s) [production]
21:53 <krinkle@deploy1002> Started deploy [integration/docroot@a3435a7]: (no justification provided) [production]