2851-2900 of 10000 results (67ms)
2022-03-24 ยง
13:57 <aborrero@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw2002-dev.codfw.wmnet with reason: host reimage [production]
13:52 <marostegui@cumin1001> dbctl commit (dc=all): 'db1158 (re)pooling @ 50%: After schema change', diff saved to https://phabricator.wikimedia.org/P23026 and previous config saved to /var/cache/conftool/dbconfig/20220324-135225-root.json [production]
13:43 <aborrero@cumin2002> START - Cookbook sre.hosts.reimage for host cloudgw2002-dev.codfw.wmnet with OS bullseye [production]
13:37 <marostegui@cumin1001> dbctl commit (dc=all): 'db1158 (re)pooling @ 25%: After schema change', diff saved to https://phabricator.wikimedia.org/P23025 and previous config saved to /var/cache/conftool/dbconfig/20220324-133721-root.json [production]
13:34 <aborrero@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudgw2001-dev.codfw.wmnet with OS bullseye [production]
13:26 <reedy@deploy1002> Synchronized wmf-config/CommonSettings.php: T45956 (duration: 00m 49s) [production]
13:23 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:23 <reedy@deploy1002> Synchronized multiversion/: T45956 (duration: 00m 50s) [production]
13:22 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:22 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:22 <marostegui@cumin1001> dbctl commit (dc=all): 'db1158 (re)pooling @ 10%: After schema change', diff saved to https://phabricator.wikimedia.org/P23024 and previous config saved to /var/cache/conftool/dbconfig/20220324-132217-root.json [production]
13:21 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:21 <aborrero@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw2001-dev.codfw.wmnet with reason: host reimage [production]
13:18 <aborrero@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw2001-dev.codfw.wmnet with reason: host reimage [production]
13:16 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:15 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:15 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:15 <reedy@deploy1002> Synchronized tests/: T45956 (duration: 00m 49s) [production]
13:11 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:10 <reedy@deploy1002> Synchronized wmf-config/InitialiseSettings.php: T292802 (duration: 00m 50s) [production]
12:54 <aborrero@cumin2002> START - Cookbook sre.hosts.reimage for host cloudgw2001-dev.codfw.wmnet with OS bullseye [production]
12:52 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1158 for schema change', diff saved to https://phabricator.wikimedia.org/P23023 and previous config saved to /var/cache/conftool/dbconfig/20220324-125225-marostegui.json [production]
11:47 <jynus> updating eqiad swift-commonswiki backups of originals T299764 [production]
11:26 <mmandere> pool cp1076 with HAProxy as TLS termination layer - T290005 [production]
11:22 <jbond> puppet cert clean rendering.svc.eqiad.wmnet [production]
11:21 <jbond> removing old api.svc.codfw.wmnet.pem and appservers.svc.codfw.wmnet.pem from root@puppetmaster1001:/var/lib/puppet/server/ssl/ca/signed# [production]
11:15 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1017.eqiad.wmnet with OS bullseye [production]
11:14 <btullis@cumin1001> START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons for openjdk upgrade. [production]
11:10 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1076.eqiad.wmnet with OS buster [production]
11:04 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1017.eqiad.wmnet with reason: host reimage [production]
11:00 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1017.eqiad.wmnet with reason: host reimage [production]
10:56 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1101.eqiad.wmnet [production]
10:51 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-worker1101.eqiad.wmnet [production]
10:49 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1100.eqiad.wmnet [production]
10:46 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1076.eqiad.wmnet with reason: host reimage [production]
10:45 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kubernetes1017.eqiad.wmnet with OS bullseye [production]
10:43 <mmandere@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1076.eqiad.wmnet with reason: host reimage [production]
10:42 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-worker1100.eqiad.wmnet [production]
10:42 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1099.eqiad.wmnet [production]
10:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1014.eqiad.wmnet with OS bullseye [production]
10:34 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-worker1099.eqiad.wmnet [production]
10:34 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1098.eqiad.wmnet [production]
10:28 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1014.eqiad.wmnet with reason: host reimage [production]
10:27 <mmandere@cumin1001> START - Cookbook sre.hosts.reimage for host cp1076.eqiad.wmnet with OS buster [production]
10:26 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-worker1098.eqiad.wmnet [production]
10:25 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1014.eqiad.wmnet with reason: host reimage [production]
10:20 <mmandere> depool cp1076 for reimage - T290005 [production]
10:10 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1097.eqiad.wmnet [production]
10:09 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kubernetes1014.eqiad.wmnet with OS bullseye [production]
10:01 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-worker1097.eqiad.wmnet [production]