2025-08-29
§
|
08:19 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on install2004.wikimedia.org with reason: being replaced by install2005 |
[production] |
08:02 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2194 (T402925)', diff saved to https://phabricator.wikimedia.org/P82092 and previous config saved to /var/cache/conftool/dbconfig/20250829-080216-ladsgroup.json |
[production] |
08:02 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2194.codfw.wmnet with reason: Maintenance |
[production] |
08:01 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2190 (T402925)', diff saved to https://phabricator.wikimedia.org/P82091 and previous config saved to /var/cache/conftool/dbconfig/20250829-080153-ladsgroup.json |
[production] |
07:49 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
07:46 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2190', diff saved to https://phabricator.wikimedia.org/P82090 and previous config saved to /var/cache/conftool/dbconfig/20250829-074645-ladsgroup.json |
[production] |
07:46 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . |
[production] |
07:31 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2190', diff saved to https://phabricator.wikimedia.org/P82089 and previous config saved to /var/cache/conftool/dbconfig/20250829-073138-ladsgroup.json |
[production] |
07:16 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2190 (T402925)', diff saved to https://phabricator.wikimedia.org/P82088 and previous config saved to /var/cache/conftool/dbconfig/20250829-071630-ladsgroup.json |
[production] |
06:13 |
<arnaudb@cumin1003> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Security Update |
[production] |
06:06 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2190 (T402925)', diff saved to https://phabricator.wikimedia.org/P82087 and previous config saved to /var/cache/conftool/dbconfig/20250829-060644-ladsgroup.json |
[production] |
06:06 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2190.codfw.wmnet with reason: Maintenance |
[production] |
06:06 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T402925)', diff saved to https://phabricator.wikimedia.org/P82086 and previous config saved to /var/cache/conftool/dbconfig/20250829-060621-ladsgroup.json |
[production] |
05:51 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P82085 and previous config saved to /var/cache/conftool/dbconfig/20250829-055113-ladsgroup.json |
[production] |
05:36 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P82084 and previous config saved to /var/cache/conftool/dbconfig/20250829-053606-ladsgroup.json |
[production] |
05:20 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T402925)', diff saved to https://phabricator.wikimedia.org/P82083 and previous config saved to /var/cache/conftool/dbconfig/20250829-052059-ladsgroup.json |
[production] |
04:08 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2177 (T402925)', diff saved to https://phabricator.wikimedia.org/P82082 and previous config saved to /var/cache/conftool/dbconfig/20250829-040849-ladsgroup.json |
[production] |
04:08 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
04:08 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T402925)', diff saved to https://phabricator.wikimedia.org/P82081 and previous config saved to /var/cache/conftool/dbconfig/20250829-040826-ladsgroup.json |
[production] |
03:53 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P82080 and previous config saved to /var/cache/conftool/dbconfig/20250829-035319-ladsgroup.json |
[production] |
03:38 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P82079 and previous config saved to /var/cache/conftool/dbconfig/20250829-033811-ladsgroup.json |
[production] |
03:23 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T402925)', diff saved to https://phabricator.wikimedia.org/P82078 and previous config saved to /var/cache/conftool/dbconfig/20250829-032304-ladsgroup.json |
[production] |
02:11 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2156 (T402925)', diff saved to https://phabricator.wikimedia.org/P82077 and previous config saved to /var/cache/conftool/dbconfig/20250829-021120-ladsgroup.json |
[production] |
02:11 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |
02:10 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T402925)', diff saved to https://phabricator.wikimedia.org/P82076 and previous config saved to /var/cache/conftool/dbconfig/20250829-021056-ladsgroup.json |
[production] |
01:55 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P82075 and previous config saved to /var/cache/conftool/dbconfig/20250829-015549-ladsgroup.json |
[production] |
01:40 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P82074 and previous config saved to /var/cache/conftool/dbconfig/20250829-014041-ladsgroup.json |
[production] |
01:25 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T402925)', diff saved to https://phabricator.wikimedia.org/P82073 and previous config saved to /var/cache/conftool/dbconfig/20250829-012534-ladsgroup.json |
[production] |
01:12 |
<mwpresync@deploy1003> |
Finished scap build-images: Publishing wmf/next image (duration: 11m 47s) |
[production] |
01:00 |
<mwpresync@deploy1003> |
Started scap build-images: Publishing wmf/next image |
[production] |
00:13 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2149 (T402925)', diff saved to https://phabricator.wikimedia.org/P82072 and previous config saved to /var/cache/conftool/dbconfig/20250829-001328-ladsgroup.json |
[production] |
00:13 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2149.codfw.wmnet with reason: Maintenance |
[production] |
2025-08-28
§
|
23:38 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host maps2012.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
23:22 |
<krinkle@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1181310|Enable wmgUseMdotRouting in Beta Cluster for testwiki only (T401595)]] (duration: 10m 54s) |
[production] |
23:20 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host maps2012.codfw.wmnet with OS bookworm |
[production] |
23:16 |
<krinkle@deploy1003> |
krinkle: Continuing with sync |
[production] |
23:16 |
<rzl@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
23:16 |
<rzl@deploy1003> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
23:14 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host maps2012.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
23:13 |
<krinkle@deploy1003> |
krinkle: Backport for [[gerrit:1181310|Enable wmgUseMdotRouting in Beta Cluster for testwiki only (T401595)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
23:13 |
<rzl@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
23:12 |
<rzl@deploy1003> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
23:12 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance |
[production] |
23:11 |
<krinkle@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1181310|Enable wmgUseMdotRouting in Beta Cluster for testwiki only (T401595)]] |
[production] |
23:08 |
<rzl@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply |
[production] |
23:07 |
<rzl@deploy1003> |
helmfile [eqiad] START helmfile.d/services/api-gateway: apply |
[production] |
23:05 |
<rzl@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/api-gateway: apply |
[production] |
23:05 |
<rzl@deploy1003> |
helmfile [codfw] START helmfile.d/services/api-gateway: apply |
[production] |
22:56 |
<rzl@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
22:55 |
<rzl@deploy1003> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |