4451-4500 of 10000 results (106ms)
2023-11-28 ยง
13:35 <marostegui@cumin1001> dbctl commit (dc=all): 'es2028 (re)pooling @ 1%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P53924 and previous config saved to /var/cache/conftool/dbconfig/20231128-133547-root.json [production]
13:33 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2028.codfw.wmnet with OS bookworm [production]
13:18 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2028.codfw.wmnet with reason: host reimage [production]
13:18 <volans> uploaded python3-wmflib_1.2.4 to apt.wikimedia.org buster-wikimedia,bullseye-wikimedia,bookworm-wikimedia [production]
13:15 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on es2028.codfw.wmnet with reason: host reimage [production]
12:58 <XioNoX> re-enable sampling on cr1-esams:fpc1 [production]
12:56 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es2028.codfw.wmnet with OS bookworm [production]
12:52 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es2028 T351916', diff saved to https://phabricator.wikimedia.org/P53923 and previous config saved to /var/cache/conftool/dbconfig/20231128-125235-root.json [production]
12:35 <kart_> Updated Apertium to 2023-11-23-055425-production (ie Bookworm!) (T346997) [production]
12:32 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/apertium: apply [production]
12:32 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/apertium: apply [production]
12:26 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/apertium: apply [production]
12:26 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/apertium: apply [production]
12:13 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/apertium: apply [production]
12:12 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/apertium: apply [production]
12:02 <kamila@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
12:02 <kamila@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
11:58 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
11:57 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
11:56 <kamila@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
11:55 <kamila@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
11:50 <vgutierrez> pool ncredir4001 [production]
11:42 <kamila@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
11:42 <kamila@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
11:41 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
11:41 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
11:33 <volans@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox interface ID cr2-esams:xe-0/1/2 [production]
11:33 <volans@cumin1001> START - Cookbook sre.network.debug for Netbox interface ID cr2-esams:xe-0/1/2 [production]
11:22 <kamila@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
11:22 <kamila@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
11:21 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
11:21 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
10:52 <vgutierrez> depool ncredir4001 [production]
10:45 <vgutierrez> repool ncredir4001 [production]
10:38 <klausman@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
10:37 <moritzm> installing lua5.3 security updates [production]
10:35 <vgutierrez> depool ncredir4001 [production]
10:21 <vgutierrez> rolling restart of pybal on lvs4010 and lvs4008, effectively disabling IPIP encapsulation on ncredir@ulsfo - T351069 [production]
10:09 <vgutierrez> rolling restart of pybal on lvs4010 and lvs4008, effectively enabling IPIP encapsulation on ncredir@ulsfo - T351069 [production]
10:01 <sg912@deploy2002> Finished deploy [airflow-dags/analytics@0283c11]: (no justification provided) (duration: 00m 47s) [production]
10:00 <sg912@deploy2002> Started deploy [airflow-dags/analytics@0283c11]: (no justification provided) [production]
09:58 <moritzm> installing intel-microcode security updates [production]
09:56 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1001.eqiad.wmnet [production]
09:50 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host sretest1001.eqiad.wmnet [production]
09:40 <hashar@deploy2002> rebuilt and synchronized wikiversions files: group0 wikis to 1.42.0-wmf.7 refs T350083 [production]
09:13 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1024.eqiad.wmnet with OS bookworm [production]
08:55 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1024.eqiad.wmnet with reason: host reimage [production]
08:52 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1024.eqiad.wmnet with reason: host reimage [production]
08:47 <hashar@deploy2002> Finished scap: Backport for [[gerrit:977625|Revert "Parsoid DataAccess: Stop processing extensions as top-level docs"]] (duration: 07m 54s) [production]
08:41 <hashar@deploy2002> hashar and ssastry: Continuing with sync [production]