2023-06-06
ยง
|
13:58 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging AndyRussG out of all services on: 1259 hosts |
[production] |
13:57 |
<jmm@cumin2002> |
START - Cookbook sre.idm.logout Logging AndyRussG out of all services on: 1259 hosts |
[production] |
13:55 |
<oblivian@deploy1002> |
oblivian and daniel: Backport for [[gerrit:927236|Enable parser cache warming jobs for parsoid on enwiki (T329366)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
13:53 |
<oblivian@deploy1002> |
Started scap: Backport for [[gerrit:927236|Enable parser cache warming jobs for parsoid on enwiki (T329366)]] |
[production] |
13:51 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbproxy1022.eqiad.wmnet with OS bullseye |
[production] |
13:50 |
<oblivian@deploy1002> |
Finished scap: Backport for [[gerrit:927671|Drop wmgMemoryLimitParsoid from IS.php]] (duration: 07m 21s) |
[production] |
13:49 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbproxy1023.eqiad.wmnet with OS bullseye |
[production] |
13:45 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2168:3317', diff saved to https://phabricator.wikimedia.org/P48891 and previous config saved to /var/cache/conftool/dbconfig/20230606-134524-ladsgroup.json |
[production] |
13:45 |
<oblivian@deploy1002> |
oblivian: Backport for [[gerrit:927671|Drop wmgMemoryLimitParsoid from IS.php]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
13:43 |
<oblivian@deploy1002> |
Started scap: Backport for [[gerrit:927671|Drop wmgMemoryLimitParsoid from IS.php]] |
[production] |
13:41 |
<oblivian@deploy1002> |
Finished scap: Backport for [[gerrit:927670|Raise memory limit to match parsoid (T334980)]] (duration: 07m 53s) |
[production] |
13:41 |
<elukey@deploy1002> |
helmfile [staging] DONE helmfile.d/services/changeprop: sync |
[production] |
13:41 |
<elukey@deploy1002> |
helmfile [staging] START helmfile.d/services/changeprop: sync |
[production] |
13:35 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on lsw1-e1-eqiad.mgmt,lsw1-f[1-2]-eqiad.mgmt with reason: Migrate lsw1-f2-eqiad uplinks to spine |
[production] |
13:35 |
<oblivian@deploy1002> |
oblivian: Backport for [[gerrit:927670|Raise memory limit to match parsoid (T334980)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
13:34 |
<cmooney@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on lsw1-e1-eqiad.mgmt,lsw1-f[1-2]-eqiad.mgmt with reason: Migrate lsw1-f2-eqiad uplinks to spine |
[production] |
13:33 |
<oblivian@deploy1002> |
Started scap: Backport for [[gerrit:927670|Raise memory limit to match parsoid (T334980)]] |
[production] |
13:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2168:3317', diff saved to https://phabricator.wikimedia.org/P48890 and previous config saved to /var/cache/conftool/dbconfig/20230606-133018-ladsgroup.json |
[production] |
13:15 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T336886)', diff saved to https://phabricator.wikimedia.org/P48889 and previous config saved to /var/cache/conftool/dbconfig/20230606-131512-ladsgroup.json |
[production] |
13:11 |
<eoghan@cumin1001> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 |
[production] |
13:06 |
<otto@deploy1002> |
Synchronized wmf-config/ext-EventStreamConfig.php: EventStreamConfig - Disable canary events and hadoop ingestion for development.network.probe - T332024 (duration: 07m 17s) |
[production] |
13:00 |
<eoghan@cumin1001> |
END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab2002.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 |
[production] |
12:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2168:3317 (T336886)', diff saved to https://phabricator.wikimedia.org/P48888 and previous config saved to /var/cache/conftool/dbconfig/20230606-125944-ladsgroup.json |
[production] |
12:59 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2168.codfw.wmnet with reason: Maintenance |
[production] |
12:59 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2168.codfw.wmnet with reason: Maintenance |
[production] |
12:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2159 (T336886)', diff saved to https://phabricator.wikimedia.org/P48887 and previous config saved to /var/cache/conftool/dbconfig/20230606-125923-ladsgroup.json |
[production] |
12:56 |
<fabfur@cumin1001> |
END (PASS) - Cookbook sre.cdn.run-puppet-restart-varnish (exit_code=0) rolling custom on A:cp-upload_esams and A:cp |
[production] |
12:55 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host dbproxy1022.eqiad.wmnet with OS bullseye |
[production] |
12:53 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host dbproxy1023.eqiad.wmnet with OS bullseye |
[production] |
12:44 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P48886 and previous config saved to /var/cache/conftool/dbconfig/20230606-124417-ladsgroup.json |
[production] |
12:29 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P48885 and previous config saved to /var/cache/conftool/dbconfig/20230606-122911-ladsgroup.json |
[production] |
12:21 |
<cgoubert@deploy1002> |
Finished scap: (no justification provided) (duration: 02m 10s) |
[production] |
12:19 |
<cgoubert@deploy1002> |
Started scap: (no justification provided) |
[production] |
12:19 |
<claime> |
redeploying 927218 to mw-on-k8s - T338121 |
[production] |
12:15 |
<eoghan@cumin1001> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 |
[production] |
12:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2159 (T336886)', diff saved to https://phabricator.wikimedia.org/P48884 and previous config saved to /var/cache/conftool/dbconfig/20230606-121405-ladsgroup.json |
[production] |
12:09 |
<eoghan@cumin1001> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Upgrading Gitlab to 15.10.8 |
[production] |
12:00 |
<kamila@deploy1002> |
Finished scap: Backport for [[gerrit:927218|OAuthRateLimiter: Add rate limiting class for WME using LiftWing (T338121)]] (duration: 08m 54s) |
[production] |
11:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2159 (T336886)', diff saved to https://phabricator.wikimedia.org/P48881 and previous config saved to /var/cache/conftool/dbconfig/20230606-115911-ladsgroup.json |
[production] |
11:59 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
11:58 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
11:58 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2159.codfw.wmnet with reason: Maintenance |
[production] |
11:58 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2159.codfw.wmnet with reason: Maintenance |
[production] |
11:58 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2150 (T336886)', diff saved to https://phabricator.wikimedia.org/P48880 and previous config saved to /var/cache/conftool/dbconfig/20230606-115833-ladsgroup.json |
[production] |
11:53 |
<kamila@deploy1002> |
kamila and klausman: Backport for [[gerrit:927218|OAuthRateLimiter: Add rate limiting class for WME using LiftWing (T338121)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
11:51 |
<kamila@deploy1002> |
Started scap: Backport for [[gerrit:927218|OAuthRateLimiter: Add rate limiting class for WME using LiftWing (T338121)]] |
[production] |
11:43 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P48879 and previous config saved to /var/cache/conftool/dbconfig/20230606-114327-ladsgroup.json |
[production] |
11:38 |
<cgoubert@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-debug: apply |
[production] |
11:37 |
<cgoubert@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-debug: apply |
[production] |
11:31 |
<cgoubert@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-debug: apply |
[production] |