751-800 of 10000 results (79ms)
2023-05-26 §
10:54 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary [production]
10:38 <cmooney@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
10:27 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
09:54 <effie> pool parse1013-parse1016 to the jobrunner cluster - T329366 [production]
09:29 <jbond> disable puppet fleet wide to deploy minor puppet change https://gerrit.wikimedia.org/r/c/operations/puppet/+/923353 [production]
09:28 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1016.eqiad.wmnet with OS buster [production]
09:26 <effie> parse1013-parse1016 have neen depooled and removed from the parsoid-php service - T329366 [production]
09:26 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1014.eqiad.wmnet with OS buster [production]
09:24 <jnuche@deploy1002> Installation of scap version "4.52.3" completed for 596 hosts [production]
09:23 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1013.eqiad.wmnet with OS buster [production]
09:23 <jnuche@deploy1002> Installing scap version "4.52.3" for 596 hosts [production]
09:13 <elukey@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: sync [production]
09:13 <elukey@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: sync [production]
09:08 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host parse1015.eqiad.wmnet with OS buster [production]
08:59 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1016.eqiad.wmnet with reason: host reimage [production]
08:56 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1014.eqiad.wmnet with reason: host reimage [production]
08:54 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1013.eqiad.wmnet with reason: host reimage [production]
08:54 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on parse1015.eqiad.wmnet with reason: host reimage [production]
08:52 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1016.eqiad.wmnet with reason: host reimage [production]
08:52 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1015.eqiad.wmnet with reason: host reimage [production]
08:51 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1014.eqiad.wmnet with reason: host reimage [production]
08:51 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1013.eqiad.wmnet with reason: host reimage [production]
08:39 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host parse1016.eqiad.wmnet with OS buster [production]
08:39 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host parse1015.eqiad.wmnet with OS buster [production]
08:39 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host parse1014.eqiad.wmnet with OS buster [production]
08:39 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host parse1013.eqiad.wmnet with OS buster [production]
08:10 <jiji@cumin1001> conftool action : set/pooled=inactive; selector: dc=eqiad,name=parse101[3-6].eqiad.wmnet [production]
07:59 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 100%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48591 and previous config saved to /var/cache/conftool/dbconfig/20230526-075903-root.json [production]
07:58 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 100%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48590 and previous config saved to /var/cache/conftool/dbconfig/20230526-075809-root.json [production]
07:43 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 75%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48589 and previous config saved to /var/cache/conftool/dbconfig/20230526-074358-root.json [production]
07:43 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 75%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48588 and previous config saved to /var/cache/conftool/dbconfig/20230526-074304-root.json [production]
07:28 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 50%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48587 and previous config saved to /var/cache/conftool/dbconfig/20230526-072854-root.json [production]
07:28 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 50%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48586 and previous config saved to /var/cache/conftool/dbconfig/20230526-072759-root.json [production]
07:13 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 25%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48585 and previous config saved to /var/cache/conftool/dbconfig/20230526-071349-root.json [production]
07:12 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 25%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48584 and previous config saved to /var/cache/conftool/dbconfig/20230526-071255-root.json [production]
06:58 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 10%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48583 and previous config saved to /var/cache/conftool/dbconfig/20230526-065844-root.json [production]
06:57 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 10%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48582 and previous config saved to /var/cache/conftool/dbconfig/20230526-065750-root.json [production]
06:43 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 5%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48581 and previous config saved to /var/cache/conftool/dbconfig/20230526-064340-root.json [production]
06:42 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 5%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48580 and previous config saved to /var/cache/conftool/dbconfig/20230526-064245-root.json [production]
06:42 <elukey> `apt-get clean` on stat1008 to clean up some space in the root partition [production]
06:36 <elukey> `truncate /var/log/kerberos/krb5kdc.log -s 10g` on krb1001 to avoid the root partition to fill up [production]
06:28 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 2%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48579 and previous config saved to /var/cache/conftool/dbconfig/20230526-062835-root.json [production]
06:27 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 3%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48578 and previous config saved to /var/cache/conftool/dbconfig/20230526-062741-root.json [production]
06:13 <marostegui@cumin1001> dbctl commit (dc=all): 'db1156 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48577 and previous config saved to /var/cache/conftool/dbconfig/20230526-061330-root.json [production]
06:12 <marostegui@cumin1001> dbctl commit (dc=all): 'db1161 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48576 and previous config saved to /var/cache/conftool/dbconfig/20230526-061236-root.json [production]
03:51 <fab@deploy1002> Finished deploy [airflow-dags/research@77cf676]: (no justification provided) (duration: 00m 17s) [production]
03:51 <fab@deploy1002> Started deploy [airflow-dags/research@77cf676]: (no justification provided) [production]
2023-05-25 §
22:14 <zabe@deploy1002> Finished scap: Backport for [[gerrit:923283|Replace deprecated Hooks::runWithoutAbort (T335536)]], [[gerrit:923276|BannerRenderer: Make sure the language variant is valid (T337427)]] (duration: 09m 14s) [production]
22:06 <zabe@deploy1002> zabe and ladsgroup: Backport for [[gerrit:923283|Replace deprecated Hooks::runWithoutAbort (T335536)]], [[gerrit:923276|BannerRenderer: Make sure the language variant is valid (T337427)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
22:05 <zabe@deploy1002> Started scap: Backport for [[gerrit:923283|Replace deprecated Hooks::runWithoutAbort (T335536)]], [[gerrit:923276|BannerRenderer: Make sure the language variant is valid (T337427)]] [production]