2024-03-01
§
|
13:12 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw1389.eqiad.wmnet with OS bullseye |
[production] |
13:11 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw1387.eqiad.wmnet with OS bullseye |
[production] |
13:03 |
<jynus> |
refreshing image metadata of commons Алтарна_частина.jpg |
[production] |
13:02 |
<claime> |
Depooling mw1387.eqiad.wmnet,mw1389.eqiad.wmnet,mw1391.eqiad.wmnet,mw1393.eqiad.wmnet,mw1395.eqiad.wmnet,mw1397.eqiad.wmnet for reimage to k8s nodes - T351074 |
[production] |
12:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P58283 and previous config saved to /var/cache/conftool/dbconfig/20240301-125812-marostegui.json |
[production] |
12:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1163 (T354015)', diff saved to https://phabricator.wikimedia.org/P58282 and previous config saved to /var/cache/conftool/dbconfig/20240301-124306-marostegui.json |
[production] |
11:58 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply |
[production] |
11:58 |
<cgoubert@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-api-int: apply |
[production] |
11:56 |
<cgoubert@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply |
[production] |
11:55 |
<cgoubert@deploy2002> |
helmfile [codfw] START helmfile.d/services/mw-api-int: apply |
[production] |
11:54 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1173.eqiad.wmnet |
[production] |
11:48 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply |
[production] |
11:48 |
<cgoubert@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-api-int: apply |
[production] |
11:47 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host an-worker1173.eqiad.wmnet |
[production] |
11:33 |
<mfossati@deploy2002> |
Finished deploy [airflow-dags/platform_eng@241457d]: (no justification provided) (duration: 00m 28s) |
[production] |
11:32 |
<mfossati@deploy2002> |
Started deploy [airflow-dags/platform_eng@241457d]: (no justification provided) |
[production] |
11:16 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2194 (T352010)', diff saved to https://phabricator.wikimedia.org/P58281 and previous config saved to /var/cache/conftool/dbconfig/20240301-111610-ladsgroup.json |
[production] |
11:16 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2194.codfw.wmnet with reason: Maintenance |
[production] |
11:15 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2194.codfw.wmnet with reason: Maintenance |
[production] |
10:34 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2117.codfw.wmnet with reason: Silence for maintenance |
[production] |
10:34 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2117.codfw.wmnet with reason: Silence for maintenance |
[production] |
08:43 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply |
[production] |
08:42 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply |
[production] |
08:42 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
08:42 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
07:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 100%: After schema change', diff saved to https://phabricator.wikimedia.org/P58280 and previous config saved to /var/cache/conftool/dbconfig/20240301-075212-root.json |
[production] |
07:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 75%: After schema change', diff saved to https://phabricator.wikimedia.org/P58279 and previous config saved to /var/cache/conftool/dbconfig/20240301-073707-root.json |
[production] |
07:23 |
<eoghan@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on vrts1002.eqiad.wmnet with reason: Not in production, silencing alarms until we decide whether to decom or not |
[production] |
07:23 |
<eoghan@cumin1002> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on vrts1002.eqiad.wmnet with reason: Not in production, silencing alarms until we decide whether to decom or not |
[production] |
07:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 50%: After schema change', diff saved to https://phabricator.wikimedia.org/P58278 and previous config saved to /var/cache/conftool/dbconfig/20240301-072202-root.json |
[production] |
07:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 25%: After schema change', diff saved to https://phabricator.wikimedia.org/P58277 and previous config saved to /var/cache/conftool/dbconfig/20240301-070657-root.json |
[production] |
06:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 10%: After schema change', diff saved to https://phabricator.wikimedia.org/P58276 and previous config saved to /var/cache/conftool/dbconfig/20240301-065152-root.json |
[production] |
06:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db1118.eqiad.wmnet |
[production] |
06:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
06:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1118.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" |
[production] |
06:47 |
<marostegui@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1118.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" |
[production] |
06:44 |
<marostegui@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
06:39 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts db1118.eqiad.wmnet |
[production] |
06:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 5%: After schema change', diff saved to https://phabricator.wikimedia.org/P58275 and previous config saved to /var/cache/conftool/dbconfig/20240301-063647-root.json |
[production] |
06:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1163 (T354015)', diff saved to https://phabricator.wikimedia.org/P58274 and previous config saved to /var/cache/conftool/dbconfig/20240301-063633-marostegui.json |
[production] |
06:36 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1163.eqiad.wmnet with reason: Maintenance |
[production] |
06:36 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1163.eqiad.wmnet with reason: Maintenance |
[production] |
01:09 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2117.codfw.wmnet with reason: Maintenance |
[production] |
01:09 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2117.codfw.wmnet with reason: Maintenance |
[production] |
01:09 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114 (T352010)', diff saved to https://phabricator.wikimedia.org/P58273 and previous config saved to /var/cache/conftool/dbconfig/20240301-010936-ladsgroup.json |
[production] |
00:54 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114', diff saved to https://phabricator.wikimedia.org/P58271 and previous config saved to /var/cache/conftool/dbconfig/20240301-005429-ladsgroup.json |
[production] |
00:39 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114', diff saved to https://phabricator.wikimedia.org/P58270 and previous config saved to /var/cache/conftool/dbconfig/20240301-003923-ladsgroup.json |
[production] |
00:24 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114 (T352010)', diff saved to https://phabricator.wikimedia.org/P58269 and previous config saved to /var/cache/conftool/dbconfig/20240301-002417-ladsgroup.json |
[production] |