8101-8150 of 10000 results (35ms)
2024-01-18 ยง
12:51 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1221.eqiad.wmnet with reason: Maintenance [production]
12:50 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1221.eqiad.wmnet with reason: Maintenance [production]
12:50 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199 (T352010)', diff saved to https://phabricator.wikimedia.org/P54912 and previous config saved to /var/cache/conftool/dbconfig/20240118-125048-ladsgroup.json [production]
12:49 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P54911 and previous config saved to /var/cache/conftool/dbconfig/20240118-124945-marostegui.json [production]
12:41 <godog> grafana restarted on grafana1002 after https://gerrit.wikimedia.org/r/c/operations/puppet/+/991573 [production]
12:38 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker role in the tools cluster [admin]
12:38 <taavi@cloudcumin1001> Added a new k8s worker tools-k8s-worker-101.tools.eqiad1.wikimedia.cloud to the cluster [admin]
12:35 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P54910 and previous config saved to /var/cache/conftool/dbconfig/20240118-123541-ladsgroup.json [production]
12:35 <stran@deploy2002> helmfile [codfw] START helmfile.d/services/ipoid: apply [production]
12:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P54909 and previous config saved to /var/cache/conftool/dbconfig/20240118-123439-marostegui.json [production]
12:34 <stran@deploy2002> helmfile [eqiad] DONE helmfile.d/services/ipoid: apply [production]
12:33 <stran@deploy2002> helmfile [eqiad] START helmfile.d/services/ipoid: apply [production]
12:31 <stran@deploy2002> helmfile [staging] DONE helmfile.d/services/ipoid: apply [production]
12:28 <stran@deploy2002> helmfile [staging] START helmfile.d/services/ipoid: apply [production]
12:27 <Dreamy_Jazz> Finished security deploy for T347742 [production]
12:27 <dreamyjazz@deploy2002> Finished scap: Backport for [[gerrit:991552|SECURITY: Use message label instead of sanitized text output for massmessage-form-page-help message (T347742)]] (duration: 08m 28s) [production]
12:27 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1047.eqiad.wmnet [production]
12:26 <stran@deploy2002> helmfile [staging] START helmfile.d/services/ipoid: apply [production]
12:24 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.add_k8s_node for a worker role in the tools cluster [tools]
12:24 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2047.codfw.wmnet [production]
12:21 <dreamyjazz@deploy2002> dreamyjazz: Continuing with sync [production]
12:21 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.remove_grid_node for tools-sgeexec-10-17 [tools]
12:20 <dreamyjazz@deploy2002> dreamyjazz: Backport for [[gerrit:991552|SECURITY: Use message label instead of sanitized text output for massmessage-form-page-help message (T347742)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
12:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P54908 and previous config saved to /var/cache/conftool/dbconfig/20240118-122035-ladsgroup.json [production]
12:20 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc2047.codfw.wmnet [production]
12:20 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc1047.eqiad.wmnet [production]
12:20 <taavi> comment newly added crontab entries and add a hopefully-unmissable warning to the crontab about the grid engine deprecation, T319626 [tools.chobot]
12:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189 (T354336)', diff saved to https://phabricator.wikimedia.org/P54907 and previous config saved to /var/cache/conftool/dbconfig/20240118-121932-marostegui.json [production]
12:18 <dreamyjazz@deploy2002> Started scap: Backport for [[gerrit:991552|SECURITY: Use message label instead of sanitized text output for massmessage-form-page-help message (T347742)]] [production]
12:17 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:17 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:16 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:16 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:16 <jynus> depooled db2146, lot of lag, should be investigated later [production]
12:15 <jynus@cumin1002> dbctl commit (dc=all): 'Depool db2146', diff saved to https://phabricator.wikimedia.org/P54906 and previous config saved to /var/cache/conftool/dbconfig/20240118-121541-jynus.json [production]
12:07 <Dreamy_Jazz> Doing security deploy for T347742 [production]
12:05 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199 (T352010)', diff saved to https://phabricator.wikimedia.org/P54905 and previous config saved to /var/cache/conftool/dbconfig/20240118-120528-ladsgroup.json [production]
11:54 <urbanecm> deployment-prep: `mwscript userOptions.php --wiki=enwiki --delete --old '' --fromuserid=906 --nowarn 'echo-subscriptions-web-reverted'` (T353225) [releng]
11:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2189 (T354336)', diff saved to https://phabricator.wikimedia.org/P54904 and previous config saved to /var/cache/conftool/dbconfig/20240118-114551-marostegui.json [production]
11:45 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2189.codfw.wmnet with reason: Maintenance [production]
11:45 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db2189.codfw.wmnet with reason: Maintenance [production]
11:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T354336)', diff saved to https://phabricator.wikimedia.org/P54903 and previous config saved to /var/cache/conftool/dbconfig/20240118-114528-marostegui.json [production]
11:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P54902 and previous config saved to /var/cache/conftool/dbconfig/20240118-113022-marostegui.json [production]
11:21 <godog> bounce apache2 on logstash1025 / logstash1031 - T337818 [production]
11:15 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P54901 and previous config saved to /var/cache/conftool/dbconfig/20240118-111516-marostegui.json [production]
11:04 <cmooney@cumin1002> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1001-1002].eqiad.wmnet with reason: Release v0.6.5 - cmooney@cumin1002 [production]
11:01 <cmooney@cumin1002> START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1001-1002].eqiad.wmnet with reason: Release v0.6.5 - cmooney@cumin1002 [production]
11:00 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T354336)', diff saved to https://phabricator.wikimedia.org/P54900 and previous config saved to /var/cache/conftool/dbconfig/20240118-110009-marostegui.json [production]
10:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2175 (T354336)', diff saved to https://phabricator.wikimedia.org/P54899 and previous config saved to /var/cache/conftool/dbconfig/20240118-104335-marostegui.json [production]
10:43 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2175.codfw.wmnet with reason: Maintenance [production]