351-400 of 10000 results (76ms)
2023-09-07 ยง
22:45 <jhuneidi@deploy1002> Installation of scap version "4.59.0" completed for 594 hosts [production]
22:44 <jhuneidi@deploy1002> Installing scap version "4.59.0" for 594 hosts [production]
22:30 <jhuneidi@deploy1002> Installing scap version "4.59.0" for 595 hosts [production]
22:29 <jeena> installing scap v4.59.0 [production]
22:24 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) [production]
21:47 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2128 (T343198)', diff saved to https://phabricator.wikimedia.org/P52313 and previous config saved to /var/cache/conftool/dbconfig/20230907-214717-arnaudb.json [production]
21:47 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2186.codfw.wmnet with reason: Maintenance [production]
21:46 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2186.codfw.wmnet with reason: Maintenance [production]
21:46 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2128.codfw.wmnet with reason: Maintenance [production]
21:46 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2128.codfw.wmnet with reason: Maintenance [production]
21:46 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2123 (T343198)', diff saved to https://phabricator.wikimedia.org/P52312 and previous config saved to /var/cache/conftool/dbconfig/20230907-214640-arnaudb.json [production]
21:31 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2123', diff saved to https://phabricator.wikimedia.org/P52311 and previous config saved to /var/cache/conftool/dbconfig/20230907-213134-arnaudb.json [production]
21:16 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2123', diff saved to https://phabricator.wikimedia.org/P52310 and previous config saved to /var/cache/conftool/dbconfig/20230907-211628-arnaudb.json [production]
21:01 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2123 (T343198)', diff saved to https://phabricator.wikimedia.org/P52309 and previous config saved to /var/cache/conftool/dbconfig/20230907-210122-arnaudb.json [production]
20:56 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:955792|Preserve Gadget prefs when they can't be enabled (T341421)]], [[gerrit:955791|Fix settings button not working on reference previews (T345829)]] (duration: 11m 12s) [production]
20:50 <thcipriani@deploy1002> jdlrobson and thcipriani: Continuing with sync [production]
20:49 <taavi@cumin1001> conftool action : set/pooled=inactive; selector: name=mw2444.codfw.wmnet [production]
20:46 <bking@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
20:46 <thcipriani@deploy1002> jdlrobson and thcipriani: Backport for [[gerrit:955792|Preserve Gadget prefs when they can't be enabled (T341421)]], [[gerrit:955791|Fix settings button not working on reference previews (T345829)]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD o [production]
20:46 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
20:46 <bking@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
20:45 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:955792|Preserve Gadget prefs when they can't be enabled (T341421)]], [[gerrit:955791|Fix settings button not working on reference previews (T345829)]] [production]
20:41 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:954724|Pre-deploy Reader Demographics 2 pilot survey (T344393)]] (duration: 10m 59s) [production]
20:33 <thcipriani@deploy1002> dani and thcipriani: Continuing with sync [production]
20:31 <thcipriani@deploy1002> dani and thcipriani: Backport for [[gerrit:954724|Pre-deploy Reader Demographics 2 pilot survey (T344393)]] synced to the testservers mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:30 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:954724|Pre-deploy Reader Demographics 2 pilot survey (T344393)]] [production]
20:23 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:955388|Undeploy Campaigns Event Discovery survey (T345158)]] (duration: 17m 58s) [production]
20:23 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs1016.eqiad.wmnet with OS bullseye [production]
20:11 <thcipriani@deploy1002> thcipriani and dani: Continuing with sync [production]
20:07 <thcipriani@deploy1002> thcipriani and dani: Backport for [[gerrit:955388|Undeploy Campaigns Event Discovery survey (T345158)]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:05 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:955388|Undeploy Campaigns Event Discovery survey (T345158)]] [production]
19:41 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1016.eqiad.wmnet with reason: host reimage [production]
19:38 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1016.eqiad.wmnet with reason: host reimage [production]
19:37 <eevans@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe [production]
19:33 <eevans@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe [production]
19:13 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1016.eqiad.wmnet with OS bullseye [production]
18:50 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1010.eqiad.wmnet with reason: T342361 [production]
18:49 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1010.eqiad.wmnet with reason: T342361 [production]
18:31 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2123 (T343198)', diff saved to https://phabricator.wikimedia.org/P52308 and previous config saved to /var/cache/conftool/dbconfig/20230907-183153-arnaudb.json [production]
18:31 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2123.codfw.wmnet with reason: Maintenance [production]
18:31 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2123.codfw.wmnet with reason: Maintenance [production]
18:31 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2111 (T343198)', diff saved to https://phabricator.wikimedia.org/P52307 and previous config saved to /var/cache/conftool/dbconfig/20230907-183132-arnaudb.json [production]
18:16 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2111', diff saved to https://phabricator.wikimedia.org/P52306 and previous config saved to /var/cache/conftool/dbconfig/20230907-181626-arnaudb.json [production]
18:01 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2111', diff saved to https://phabricator.wikimedia.org/P52305 and previous config saved to /var/cache/conftool/dbconfig/20230907-180120-arnaudb.json [production]
17:46 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2111 (T343198)', diff saved to https://phabricator.wikimedia.org/P52304 and previous config saved to /var/cache/conftool/dbconfig/20230907-174613-arnaudb.json [production]
17:43 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2111 (T343198)', diff saved to https://phabricator.wikimedia.org/P52303 and previous config saved to /var/cache/conftool/dbconfig/20230907-174351-arnaudb.json [production]
17:43 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2111.codfw.wmnet with reason: Maintenance [production]
17:43 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2111.codfw.wmnet with reason: Maintenance [production]
16:45 <Amir1> running moveToExternal on all wikis [production]
15:58 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['lists1004.eqiad.wmnet'] [production]