5851-5900 of 10000 results (89ms)
2023-03-13 ยง
18:44 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2145.codfw.wmnet with reason: Maintenance [production]
18:43 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@196e10d]: allow spark3-submit as a valid spark exeutable (duration: 00m 13s) [production]
18:43 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@196e10d]: allow spark3-submit as a valid spark exeutable [production]
18:38 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1002.eqiad.wmnet [production]
18:36 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@a8d066e]: Parameterize streaming updater reconcile start date (duration: 00m 14s) [production]
18:36 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
18:36 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@a8d066e]: Parameterize streaming updater reconcile start date [production]
18:36 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2141.codfw.wmnet with reason: Maintenance [production]
18:36 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2130 (T329260)', diff saved to https://phabricator.wikimedia.org/P45795 and previous config saved to /var/cache/conftool/dbconfig/20230313-183628-marostegui.json [production]
18:33 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts sessionstore1002.eqiad.wmnet [production]
18:32 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1002.eqiad.wmnet [production]
18:21 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P45794 and previous config saved to /var/cache/conftool/dbconfig/20230313-182121-marostegui.json [production]
18:17 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sessionstore1002.eqiad.wmnet [production]
18:11 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host sessionstore1002.eqiad.wmnet [production]
18:07 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
18:07 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sessionstore1001.eqiad.wmnet [production]
18:06 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P45793 and previous config saved to /var/cache/conftool/dbconfig/20230313-180615-marostegui.json [production]
17:56 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host sessionstore1001.eqiad.wmnet [production]
17:55 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:51 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2130 (T329260)', diff saved to https://phabricator.wikimedia.org/P45792 and previous config saved to /var/cache/conftool/dbconfig/20230313-175109-marostegui.json [production]
17:50 <dancy@deploy2002> Finished scap: test cleanup (duration: 06m 40s) [production]
17:44 <dancy@deploy2002> Started scap: test cleanup [production]
17:43 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:40 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db2130 (T329260)', diff saved to https://phabricator.wikimedia.org/P45791 and previous config saved to /var/cache/conftool/dbconfig/20230313-174030-marostegui.json [production]
17:40 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2130.codfw.wmnet with reason: Maintenance [production]
17:40 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2130.codfw.wmnet with reason: Maintenance [production]
17:40 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2116 (T329260)', diff saved to https://phabricator.wikimedia.org/P45790 and previous config saved to /var/cache/conftool/dbconfig/20230313-174009-marostegui.json [production]
17:35 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:33 <eevans@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:32 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:25 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P45789 and previous config saved to /var/cache/conftool/dbconfig/20230313-172503-marostegui.json [production]
17:22 <dancy@deploy2002> Finished scap: testing T329857 (duration: 06m 54s) [production]
17:16 <mvernon@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:codfw and (A:swift-fe or A:swift-fe-canary or A:swift-fe-codfw or A:swift-fe-eqiad) [production]
17:15 <dancy@deploy2002> Started scap: testing T329857 [production]
17:13 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:13 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:12 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:12 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sessionstore1001.eqiad.wmnet [production]
17:11 <bd808@deploy2002> helmfile [codfw] DONE helmfile.d/services/developer-portal: apply [production]
17:11 <mvernon@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:codfw and (A:swift-fe or A:swift-fe-canary or A:swift-fe-codfw or A:swift-fe-eqiad) [production]
17:11 <bd808@deploy2002> helmfile [codfw] START helmfile.d/services/developer-portal: apply [production]
17:10 <Emperor> roll-restart of codfw eqiad frontends [production]
17:10 <bd808@deploy2002> helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply [production]
17:10 <bd808@deploy2002> helmfile [eqiad] START helmfile.d/services/developer-portal: apply [production]
17:10 <bd808@deploy2002> helmfile [staging] DONE helmfile.d/services/developer-portal: apply [production]
17:09 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P45788 and previous config saved to /var/cache/conftool/dbconfig/20230313-170955-marostegui.json [production]
17:09 <bd808@deploy2002> helmfile [staging] START helmfile.d/services/developer-portal: apply [production]
17:08 <dancy@deploy2002> Installation of scap version "4.46.0" completed for 553 hosts [production]
17:07 <dancy@deploy2002> Installing scap version "4.46.0" for 553 hosts [production]
17:04 <bd808> Ran cache.purge_openstack_users() for Striker following deploy of e1f7491 (T331674) [production]