|
2026-06-11
§
|
| 00:53 |
<jasmine@deploy1003> |
helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 00:53 |
<jasmine@deploy1003> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:53 |
<jasmine@deploy1003> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:52 |
<jasmine@deploy1003> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 00:51 |
<jasmine@deploy1003> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 00:41 |
<jasmine@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host kafka-main1009 |
[production] |
| 00:41 |
<jasmine@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host kafka-main1009 |
[production] |
| 00:41 |
<jasmine@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host kafka-main1009 |
[production] |
| 00:41 |
<jasmine@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) kafka-main1009.eqiad.wmnet 37.48.64.10.in-addr.arpa 7.3.0.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 00:41 |
<jasmine@cumin2002> |
START - Cookbook sre.dns.wipe-cache kafka-main1009.eqiad.wmnet 37.48.64.10.in-addr.arpa 7.3.0.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 00:41 |
<jasmine@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 00:41 |
<jasmine@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host kafka-main1009 - jasmine@cumin2002" |
[production] |
| 00:40 |
<jasmine@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host kafka-main1009 - jasmine@cumin2002" |
[production] |
| 00:39 |
<cdanis@cumin1003> |
dbctl commit (dc=all): 'depool db1262', diff saved to https://phabricator.wikimedia.org/P94032 and previous config saved to /var/cache/conftool/dbconfig/20260611-003950-cdanis.json |
[production] |
| 00:36 |
<jasmine@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
| 00:34 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp5020.* |
[production] |
| 00:30 |
<jasmine@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host kafka-main1009 |
[production] |
| 00:30 |
<jasmine@cumin2002> |
START - Cookbook sre.hosts.reimage for host kafka-main1009.eqiad.wmnet with OS trixie |
[production] |
| 00:03 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp5024.* |
[production] |
|
2026-06-10
§
|
| 23:53 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5024.* |
[production] |
| 23:15 |
<krinkle@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1300154|Disable ShortUrl on bdwikimedia, bhwiki, bnwiki, bnwikisource, eswikibooks, gomwiki (T107188)]] (duration: 11m 37s) |
[production] |
| 23:11 |
<krinkle@deploy1003> |
krinkle: Continuing with deployment |
[production] |
| 23:06 |
<krinkle@deploy1003> |
krinkle: Backport for [[gerrit:1300154|Disable ShortUrl on bdwikimedia, bhwiki, bnwiki, bnwikisource, eswikibooks, gomwiki (T107188)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 23:04 |
<krinkle@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1300154|Disable ShortUrl on bdwikimedia, bhwiki, bnwiki, bnwikisource, eswikibooks, gomwiki (T107188)]] |
[production] |
| 22:57 |
<ladsgroup@dns1004> |
END - running authdns-update |
[production] |
| 22:55 |
<ladsgroup@dns1004> |
START - running authdns-update |
[production] |
| 22:13 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5024.eqsin.wmnet with OS trixie |
[production] |
| 22:13 |
<mutante> |
gerrit - restarting service for logging change |
[production] |
| 22:11 |
<dzahn@cumin2002> |
DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:10:00 on gerrit.wikimedia.org with reason: service restart |
[production] |
| 22:09 |
<dzahn@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on gerrit2003.wikimedia.org with reason: service restart |
[production] |
| 22:06 |
<mutante> |
gerrit-spare: restarting gerrit |
[production] |
| 22:06 |
<mutante> |
gerrit-replica: restarting gerrit |
[production] |
| 21:44 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5024.eqsin.wmnet with reason: host reimage |
[production] |
| 21:37 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5024.eqsin.wmnet with reason: host reimage |
[production] |
| 21:21 |
<jforrester@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1300250|ExecuteTestAndCacheJob: Fix stdClasses serialised wrongly by JobQueue (T428801)]], [[gerrit:1300248|tests: Fix StandaloneHooksTest ordering, now broken by DB upgrade]] (duration: 08m 23s) |
[production] |
| 21:17 |
<jforrester@deploy1003> |
jforrester: Continuing with deployment |
[production] |