2024-04-12
§
|
14:22 |
<hashar@deploy1002> |
Finished scap: Backport for [[gerrit:1018692|Parser::statelessFetchTemplate: don't add interwiki redirects to dependencies (T362221)]] (duration: 16m 29s) |
[production] |
14:19 |
<elukey@deploy1002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . |
[production] |
14:18 |
<elukey@deploy1002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
14:18 |
<elukey@deploy1002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
14:17 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reimage for host matomo1003.eqiad.wmnet with OS bookworm |
[production] |
14:09 |
<hashar@deploy1002> |
hashar and jforrester: Continuing with sync |
[production] |
14:08 |
<hashar@deploy1002> |
hashar and jforrester: Backport for [[gerrit:1018692|Parser::statelessFetchTemplate: don't add interwiki redirects to dependencies (T362221)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:08 |
<sukhe> |
depool cp1115 for PXE boot issue testing: T350179 |
[production] |
14:07 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp1115.eqiad.wmnet,service=(cdn|ats-be) |
[production] |
14:05 |
<hashar@deploy1002> |
Started scap: Backport for [[gerrit:1018692|Parser::statelessFetchTemplate: don't add interwiki redirects to dependencies (T362221)]] |
[production] |
12:53 |
<jayme> |
updated rsyslog to 8.2404.0-1~bpo11+1 on staging-codfw and staging-eqiad k8s clusters - T357616 |
[production] |
12:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P60466 and previous config saved to /var/cache/conftool/dbconfig/20240412-122045-marostegui.json |
[production] |
12:05 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P60464 and previous config saved to /var/cache/conftool/dbconfig/20240412-120537-marostegui.json |
[production] |
12:02 |
<btullis@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host matomo1003.eqiad.wmnet with OS bookworm |
[production] |
11:50 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1249 (T356166)', diff saved to https://phabricator.wikimedia.org/P60463 and previous config saved to /var/cache/conftool/dbconfig/20240412-115029-marostegui.json |
[production] |
11:33 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reimage for host matomo1003.eqiad.wmnet with OS bookworm |
[production] |
11:06 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab to new version |
[production] |
10:55 |
<urbanecm> |
mwmaint1002: mwscript extensions/GrowthExperiments/maintenance/fixLinkRecommendationData.php --wiki=frwiki --search-index (T362367) |
[production] |
09:58 |
<urbanecm> |
mwmaint1002: mwscript extensions/GrowthExperiments/maintenance/fixLinkRecommendationData.php --wiki=eswiki --search-index (T362367) |
[production] |
09:36 |
<moritzm> |
installing postgresql-common bugfix updates from Bullseye point release |
[production] |
09:26 |
<moritzm> |
installing debootstrap bugfix updates from Bullseye point release |
[production] |
09:25 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on matomo1003.eqiad.wmnet with reason: Still in setup |
[production] |
09:25 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on matomo1003.eqiad.wmnet with reason: Still in setup |
[production] |
08:56 |
<jelto@cumin1002> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab to new version |
[production] |
07:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P60461 and previous config saved to /var/cache/conftool/dbconfig/20240412-072435-root.json |
[production] |
07:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P60460 and previous config saved to /var/cache/conftool/dbconfig/20240412-070930-root.json |
[production] |
06:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P60459 and previous config saved to /var/cache/conftool/dbconfig/20240412-065424-root.json |
[production] |
06:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P60458 and previous config saved to /var/cache/conftool/dbconfig/20240412-063918-root.json |
[production] |
06:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P60457 and previous config saved to /var/cache/conftool/dbconfig/20240412-062412-root.json |
[production] |
06:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P60456 and previous config saved to /var/cache/conftool/dbconfig/20240412-060907-root.json |
[production] |
05:56 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2109.codfw.wmnet with OS bookworm |
[production] |
05:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2109 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P60455 and previous config saved to /var/cache/conftool/dbconfig/20240412-055401-root.json |
[production] |
05:35 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2109.codfw.wmnet with reason: host reimage |
[production] |
05:33 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2109.codfw.wmnet with reason: host reimage |
[production] |
05:23 |
<moritzm> |
prune obsolete nginx debs on apt-staging after switch to new nginx provider scheme T329529 |
[production] |
05:17 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2109.codfw.wmnet with OS bookworm |
[production] |
05:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2109', diff saved to https://phabricator.wikimedia.org/P60454 and previous config saved to /var/cache/conftool/dbconfig/20240412-051606-root.json |
[production] |
03:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1249 (T356166)', diff saved to https://phabricator.wikimedia.org/P60453 and previous config saved to /var/cache/conftool/dbconfig/20240412-033317-marostegui.json |
[production] |
03:33 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1249.eqiad.wmnet with reason: Maintenance |
[production] |
03:32 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1249.eqiad.wmnet with reason: Maintenance |
[production] |
03:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1248 (T356166)', diff saved to https://phabricator.wikimedia.org/P60452 and previous config saved to /var/cache/conftool/dbconfig/20240412-033254-marostegui.json |
[production] |
03:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P60451 and previous config saved to /var/cache/conftool/dbconfig/20240412-031744-marostegui.json |
[production] |
03:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P60450 and previous config saved to /var/cache/conftool/dbconfig/20240412-030237-marostegui.json |
[production] |
02:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1248 (T356166)', diff saved to https://phabricator.wikimedia.org/P60449 and previous config saved to /var/cache/conftool/dbconfig/20240412-024729-marostegui.json |
[production] |
01:05 |
<denisse> |
Manually deleting /srv/syslog/.linux.dhcp.DictModel/syslog.log from November 30 on centrallog1002 and centrallog2002 after the prune_old_srv_syslog_directories.service failed to delete the non-empty directory - T362376 |
[production] |