2025-09-30
ยง
|
20:12 |
<SandraEbele_> |
Deploying Refinery as part of deployment weekly train |
[production] |
20:12 |
<SandraEbele_> |
Deploying Refinery as part of deployment weekly train |
[analytics] |
20:07 |
<dani@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1191691|Remove reader foundational survey on enwiki (beta) (T405410)]], [[gerrit:1192555|Increase coverage of Design Research participant recruitment survey on jawiki (T405577)]], [[gerrit:1191510|Update reader foundational survey on enwiki (T405410)]], [[gerrit:1192595|Enable USERLANGUAGE for sourceswiki (T406050)]] |
[production] |
20:07 |
<SandraEbele_> |
refinery-source deployment paused due to maven release error |
[analytics] |
20:07 |
<SandraEbele_> |
refinery-source deployment paused due to maven release error |
[production] |
20:06 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host dbprov1007.eqiad.wmnet with OS bookworm |
[production] |
20:05 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1065.eqiad.wmnet}' |
[admin] |
20:05 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1064.eqiad.wmnet}' |
[admin] |
19:46 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1064.eqiad.wmnet}' |
[admin] |
19:46 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1063.eqiad.wmnet}' |
[admin] |
19:23 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1063.eqiad.wmnet}' |
[admin] |
19:23 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirtlocal1003.eqiad.wmnet |
[production] |
19:16 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host cloudvirtlocal1003.eqiad.wmnet |
[production] |
19:16 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirtlocal1002.eqiad.wmnet |
[production] |
19:09 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host cloudvirtlocal1002.eqiad.wmnet |
[production] |
19:09 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirtlocal1001.eqiad.wmnet |
[production] |
19:02 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host cloudvirtlocal1001.eqiad.wmnet |
[production] |
19:01 |
<andrew@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host cloudvirtlocal1001.eqiad.wmnet |
[production] |
19:01 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host cloudvirtlocal1001.eqiad.wmnet |
[production] |
19:00 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance |
[production] |
19:00 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1245.eqiad.wmnet with reason: Maintenance |
[production] |
19:00 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1230 (T401906)', diff saved to https://phabricator.wikimedia.org/P83515 and previous config saved to /var/cache/conftool/dbconfig/20250930-190012-fceratto.json |
[production] |
18:55 |
<James_F> |
Zuul: [mediawiki/extensions/WikimediaEvents] Add dependency on ConfirmEdit, for T405239 |
[releng] |
18:53 |
<swfrench@cumin2002> |
END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Deploy DSL rendering for known_client objects - swfrench@cumin2002" |
[production] |
18:53 |
<swfrench@cumin2002> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Deploy DSL rendering for known_client objects - swfrench@cumin2002 |
[production] |
18:52 |
<swfrench@cumin2002> |
START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Deploy DSL rendering for known_client objects - swfrench@cumin2002 |
[production] |
18:52 |
<swfrench@cumin2002> |
START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Deploy DSL rendering for known_client objects - swfrench@cumin2002" |
[production] |
18:51 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
18:45 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1230', diff saved to https://phabricator.wikimedia.org/P83514 and previous config saved to /var/cache/conftool/dbconfig/20250930-184504-fceratto.json |
[production] |
18:38 |
<cdanis@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/eventgate-logging-external: sync |
[production] |
18:37 |
<cdanis@deploy2002> |
helmfile [codfw] START helmfile.d/services/eventgate-logging-external: sync |
[production] |
18:36 |
<cdanis@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: sync |
[production] |
18:36 |
<cdanis@deploy2002> |
helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: sync |
[production] |
18:29 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1230', diff saved to https://phabricator.wikimedia.org/P83513 and previous config saved to /var/cache/conftool/dbconfig/20250930-182957-fceratto.json |
[production] |
18:14 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1230 (T401906)', diff saved to https://phabricator.wikimedia.org/P83512 and previous config saved to /var/cache/conftool/dbconfig/20250930-181449-fceratto.json |
[production] |
18:13 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1230 (T401906)', diff saved to https://phabricator.wikimedia.org/P83511 and previous config saved to /var/cache/conftool/dbconfig/20250930-181340-fceratto.json |
[production] |
18:13 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1230.eqiad.wmnet with reason: Maintenance |
[production] |
18:13 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
18:13 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1207 (T401906)', diff saved to https://phabricator.wikimedia.org/P83510 and previous config saved to /var/cache/conftool/dbconfig/20250930-181300-fceratto.json |
[production] |
17:58 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
17:58 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
17:58 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
17:57 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P83509 and previous config saved to /var/cache/conftool/dbconfig/20250930-175752-fceratto.json |
[production] |
17:57 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
17:57 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
17:56 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wdqs2016.codfw.wmnet with OS bullseye |
[production] |
17:49 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbprov1007.eqiad.wmnet with OS bookworm |
[production] |
17:46 |
<swfrench@deploy2002> |
Finished scap sync-world: Non-image-build scap run to switch next and migration releases to PHP 8.3 - T405955 (duration: 04m 29s) |
[production] |
17:42 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P83508 and previous config saved to /var/cache/conftool/dbconfig/20250930-174245-fceratto.json |
[production] |
17:42 |
<swfrench@deploy2002> |
Started scap sync-world: Non-image-build scap run to switch next and migration releases to PHP 8.3 - T405955 |
[production] |