251-300 of 10000 results (47ms)
2022-04-04 ยง
10:38 <volans> uploaded python3-wmflib_1.2.0 to apt.wikimedia.org buster-wikimedia,bullseye-wikimedia [production]
10:32 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-druid1003.eqiad.wmnet [production]
10:26 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1119 (T298565)', diff saved to https://phabricator.wikimedia.org/P24035 and previous config saved to /var/cache/conftool/dbconfig/20220404-102616-ladsgroup.json [production]
10:26 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance [production]
10:26 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance [production]
10:26 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24034 and previous config saved to /var/cache/conftool/dbconfig/20220404-102609-ladsgroup.json [production]
10:26 <moritzm> installing libxml2 security updates [production]
10:14 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-druid1004.eqiad.wmnet [production]
10:11 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1106', diff saved to https://phabricator.wikimedia.org/P24033 and previous config saved to /var/cache/conftool/dbconfig/20220404-101104-ladsgroup.json [production]
10:09 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-druid1004.eqiad.wmnet [production]
10:08 <moritzm> installing icu bugfix updates from buster 10.12 point release [production]
09:58 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-druid1005.eqiad.wmnet [production]
09:55 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1106', diff saved to https://phabricator.wikimedia.org/P24032 and previous config saved to /var/cache/conftool/dbconfig/20220404-095558-ladsgroup.json [production]
09:55 <jelto@cumin1001> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM gitlab1001.wikimedia.org [production]
09:54 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
09:52 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-druid1005.eqiad.wmnet [production]
09:51 <mmandere> pool cp6008 with HAProxy as TLS termination layer - T290005 [production]
09:48 <jelto@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM gitlab1001.wikimedia.org [production]
09:47 <moritzm> installing zlib security updates [production]
09:44 <mmandere> pool cp5003 with HAProxy as TLS termination layer - T290005 [production]
09:40 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24031 and previous config saved to /var/cache/conftool/dbconfig/20220404-094053-ladsgroup.json [production]
09:31 <moritzm> rolling restart of FPM/Apache on mw canaries to pick up updated zlib/glibc/openssl/libxml [production]
09:29 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-test-presto1001.eqiad.wmnet [production]
09:26 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-test-presto1001.eqiad.wmnet [production]
09:26 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6008.drmrs.wmnet with OS buster [production]
09:26 <btullis@cumin1001> END (PASS) - Cookbook sre.presto.roll-restart-workers (exit_code=0) for Presto analytics cluster: Roll restart of all Presto's jvm daemons. [production]
09:25 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5003.eqsin.wmnet with OS buster [production]
09:16 <btullis@cumin1001> START - Cookbook sre.presto.roll-restart-workers for Presto analytics cluster: Roll restart of all Presto's jvm daemons. [production]
09:12 <moritzm> installing openssl updates from Buster 10.12 point release [production]
09:03 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6008.drmrs.wmnet with reason: host reimage [production]
08:59 <mmandere@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp6008.drmrs.wmnet with reason: host reimage [production]
08:59 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5003.eqsin.wmnet with reason: host reimage [production]
08:56 <moritzm> installing glibc updates from buster 10.12 point release [production]
08:55 <mmandere@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5003.eqsin.wmnet with reason: host reimage [production]
08:45 <marostegui@cumin1001> dbctl commit (dc=all): 'db1130 (re)pooling @ 5%: After reimage', diff saved to https://phabricator.wikimedia.org/P24030 and previous config saved to /var/cache/conftool/dbconfig/20220404-084523-root.json [production]
08:43 <elukey@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
08:42 <mmandere@cumin1001> START - Cookbook sre.hosts.reimage for host cp6008.drmrs.wmnet with OS buster [production]
08:39 <elukey@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
08:37 <moritzm> installing flac security updates [production]
08:37 <elukey@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
08:37 <mmandere> depool cp6008 for reimage - T290005 [production]
08:35 <elukey@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
08:34 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
08:31 <elukey@deploy1002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
08:31 <elukey@deploy1002> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
08:31 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
08:31 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
08:31 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:31 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:30 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24029 and previous config saved to /var/cache/conftool/dbconfig/20220404-083031-ladsgroup.json [production]