3751-3800 of 10000 results (47ms)
2023-06-06 ยง
12:19 <cgoubert@deploy1002> Started scap: (no justification provided) [production]
12:19 <claime> redeploying 927218 to mw-on-k8s - T338121 [production]
12:15 <eoghan@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 [production]
12:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2159 (T336886)', diff saved to https://phabricator.wikimedia.org/P48884 and previous config saved to /var/cache/conftool/dbconfig/20230606-121405-ladsgroup.json [production]
12:09 <eoghan@cumin1001> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 [production]
12:00 <kamila@deploy1002> Finished scap: Backport for [[gerrit:927218|OAuthRateLimiter: Add rate limiting class for WME using LiftWing (T338121)]] (duration: 08m 54s) [production]
11:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2159 (T336886)', diff saved to https://phabricator.wikimedia.org/P48881 and previous config saved to /var/cache/conftool/dbconfig/20230606-115911-ladsgroup.json [production]
11:59 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
11:58 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
11:58 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2159.codfw.wmnet with reason: Maintenance [production]
11:58 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2159.codfw.wmnet with reason: Maintenance [production]
11:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2150 (T336886)', diff saved to https://phabricator.wikimedia.org/P48880 and previous config saved to /var/cache/conftool/dbconfig/20230606-115833-ladsgroup.json [production]
11:53 <kamila@deploy1002> kamila and klausman: Backport for [[gerrit:927218|OAuthRateLimiter: Add rate limiting class for WME using LiftWing (T338121)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
11:51 <kamila@deploy1002> Started scap: Backport for [[gerrit:927218|OAuthRateLimiter: Add rate limiting class for WME using LiftWing (T338121)]] [production]
11:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P48879 and previous config saved to /var/cache/conftool/dbconfig/20230606-114327-ladsgroup.json [production]
11:38 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
11:37 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
11:31 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
11:31 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
11:29 <stevemunene> service hadoop-yarn-resourcemanager restart for T317861 [analytics]
11:28 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P48878 and previous config saved to /var/cache/conftool/dbconfig/20230606-112819-ladsgroup.json [production]
11:13 <btullis> restart airflow-scheduler service on an-test-client1001 for analytics_test instance [analytics]
11:13 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2150 (T336886)', diff saved to https://phabricator.wikimedia.org/P48877 and previous config saved to /var/cache/conftool/dbconfig/20230606-111313-ladsgroup.json [production]
11:12 <btullis> restart airflow-scheduler service on an-airflow1006 for product_analytics instance [analytics]
11:12 <btullis> restart airflow-scheduler service on an-airflow1005 for search instance [analytics]
11:08 <btullis> restart airflow-scheduler service on an-airflow1002 for research instance [analytics]
11:07 <btullis> (correction) that should have read an-airflow1004 for platform_eng instance [analytics]
11:06 <btullis> restart airflow-scheduler service on an-launcher1004 for postgresql restart [analytics]
11:05 <btullis> restart airflow-scheduler service on an-launcher1002 for postgresql restart [analytics]
11:03 <eoghan@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 [production]
10:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2150 (T336886)', diff saved to https://phabricator.wikimedia.org/P48876 and previous config saved to /var/cache/conftool/dbconfig/20230606-105756-ladsgroup.json [production]
10:57 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2150.codfw.wmnet with reason: Maintenance [production]
10:57 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2150.codfw.wmnet with reason: Maintenance [production]
10:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2122 (T336886)', diff saved to https://phabricator.wikimedia.org/P48875 and previous config saved to /var/cache/conftool/dbconfig/20230606-105724-ladsgroup.json [production]
10:53 <urbanecm@deploy1002> helmfile [codfw] DONE helmfile.d/services/linkrecommendation: apply [production]
10:53 <urbanecm@deploy1002> helmfile [codfw] START helmfile.d/services/linkrecommendation: apply [production]
10:52 <urbanecm@deploy1002> helmfile [eqiad] DONE helmfile.d/services/linkrecommendation: apply [production]
10:51 <zabe@deploy1002> Finished scap: Backport for [[gerrit:927594|Stop writing to revision_comment_temp in group1 wikis (T299954)]] (duration: 07m 03s) [production]
10:51 <urbanecm@deploy1002> helmfile [eqiad] START helmfile.d/services/linkrecommendation: apply [production]
10:50 <urbanecm@deploy1002> helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply [production]
10:50 <urbanecm@deploy1002> helmfile [staging] START helmfile.d/services/linkrecommendation: apply [production]
10:50 <urbanecm@deploy1002> helmfile [eqiad] DONE helmfile.d/services/linkrecommendation: apply [production]
10:50 <urbanecm@deploy1002> helmfile [eqiad] START helmfile.d/services/linkrecommendation: apply [production]
10:46 <zabe@deploy1002> zabe: Backport for [[gerrit:927594|Stop writing to revision_comment_temp in group1 wikis (T299954)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
10:44 <zabe@deploy1002> Started scap: Backport for [[gerrit:927594|Stop writing to revision_comment_temp in group1 wikis (T299954)]] [production]
10:42 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P48874 and previous config saved to /var/cache/conftool/dbconfig/20230606-104218-ladsgroup.json [production]
10:30 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: sync [production]
10:30 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: sync [production]
10:28 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
10:28 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]