2024-04-11
ยง
|
20:38 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=0) |
[toolsbeta] |
20:27 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
20:27 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
20:20 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:1018317|ext-EventLogging: Add mediawiki.product_metrics.wikifunctions_ui to $wgEventLoggingStreamNames]] (duration: 17m 38s) |
[production] |
20:18 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
20:17 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
20:17 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
20:08 |
<urbanecm@deploy1002> |
urbanecm and phuedx: Continuing with sync |
[production] |
20:08 |
<mutante> |
zuul-1001 - switching to new puppetmaster-1003 in puppet.conf manually, switched project defaults in repo too |
[devtools] |
20:05 |
<urbanecm@deploy1002> |
urbanecm and phuedx: Backport for [[gerrit:1018317|ext-EventLogging: Add mediawiki.product_metrics.wikifunctions_ui to $wgEventLoggingStreamNames]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:05 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
20:04 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
20:03 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
20:03 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:1018317|ext-EventLogging: Add mediawiki.product_metrics.wikifunctions_ui to $wgEventLoggingStreamNames]] |
[production] |
20:02 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
20:02 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
20:01 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
20:00 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
19:59 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
19:58 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
19:58 |
<mutante> |
manually editing puppet.conf to use puppetmaster-1003 instead of -1001 because you can't switch the puppetmaster via puppet if puppet is already broken :) |
[devtools] |
19:46 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
19:41 |
<mutante> |
switching gitlab-runner-1005 from puppetmaster-1001 to puppetmaster-1003 via web Hiera |
[devtools] |
19:41 |
<eevans@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply |
[production] |
19:40 |
<eevans@deploy1002> |
helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply |
[production] |
19:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1248 (T356166)', diff saved to https://phabricator.wikimedia.org/P60448 and previous config saved to /var/cache/conftool/dbconfig/20240411-193537-marostegui.json |
[production] |
19:35 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1248.eqiad.wmnet with reason: Maintenance |
[production] |
19:35 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1248.eqiad.wmnet with reason: Maintenance |
[production] |
19:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1247 (T356166)', diff saved to https://phabricator.wikimedia.org/P60447 and previous config saved to /var/cache/conftool/dbconfig/20240411-193514-marostegui.json |
[production] |
19:34 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.add_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
19:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P60446 and previous config saved to /var/cache/conftool/dbconfig/20240411-192006-marostegui.json |
[production] |
19:17 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_etcd_node |
[toolsbeta] |
19:16 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.add_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
19:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P60445 and previous config saved to /var/cache/conftool/dbconfig/20240411-190459-marostegui.json |
[production] |
19:03 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_etcd_node |
[toolsbeta] |
19:03 |
<mutante> |
- deleting instance contint-bullseye which was only used by me for a test before we created contint1003 in prod T334517 T361224 |
[devtools] |
18:58 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=0) |
[toolsbeta] |
18:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1247 (T356166)', diff saved to https://phabricator.wikimedia.org/P60443 and previous config saved to /var/cache/conftool/dbconfig/20240411-184951-marostegui.json |
[production] |
18:49 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
18:38 |
<mutante> |
- attempting to fix puppet run on vrts-1001 related to switching prod to cfssl for SSL cers |
[devtools] |
18:23 |
<mutante> |
- shutting down puppetmaster-1001 on buster - should now be replaced by puppetmaster-1003 on bookworm (thanks brennen) T360964 T360470 |
[devtools] |
18:10 |
<wmbot~bd808@tools-bastion-12> |
Restarted phorge task to test changes from MR!30 |
[tools.wikibugs-testing] |
18:01 |
<mutante> |
- shutting down instance devtools-puppetdb1001 - which is on buster - basically to see what breaks of complains, if anything |
[devtools] |
17:50 |
<dancy@deploy1002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.26 refs T360158 |
[production] |
17:33 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=0) |
[toolsbeta] |
17:27 |
<swfrench@deploy1002> |
Finished scap: (no justification provided) (duration: 07m 57s) |
[production] |
17:23 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_etcd_node |
[toolsbeta] |
17:23 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) |
[toolsbeta] |
17:20 |
<swfrench@deploy1002> |
Started scap: (no justification provided) |
[production] |
17:14 |
<JJMC89> |
copypatrol-backend-prod-01 deploy 40d8da9..1622949 |
[copypatrol] |