|
2025-02-25
§
|
| 21:51 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1122662|Remove more wikitech specific stuff]] |
[production] |
| 21:36 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1122622|cowikimedia: Change the logo v2 (T386872)]] (duration: 11m 12s) |
[production] |
| 21:30 |
<ladsgroup@deploy2002> |
ladsgroup, zhaofjx: Continuing with sync |
[production] |
| 21:29 |
<volans> |
upgraded spicerack on the cumin hosts to v9.1.3 |
[production] |
| 21:28 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 21:28 |
<ladsgroup@deploy2002> |
ladsgroup, zhaofjx: Backport for [[gerrit:1122622|cowikimedia: Change the logo v2 (T386872)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
| 21:25 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1122622|cowikimedia: Change the logo v2 (T386872)]] |
[production] |
| 21:21 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1122279|Add various settings for new wikis (T386464 T386631)]] (duration: 14m 58s) |
[production] |
| 21:14 |
<ladsgroup@deploy2002> |
pppery, ladsgroup: Continuing with sync |
[production] |
| 21:11 |
<ladsgroup@deploy2002> |
pppery, ladsgroup: Backport for [[gerrit:1122279|Add various settings for new wikis (T386464 T386631)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
| 21:06 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1122279|Add various settings for new wikis (T386464 T386631)]] |
[production] |
| 21:00 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 20:50 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 20:41 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
| 20:25 |
<jhathaway@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on ms-be2088.codfw.wmnet with reason: T381919 |
[production] |
| 20:23 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
| 20:17 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
| 20:01 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
| 19:59 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
| 19:59 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 19:57 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
| 19:55 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 19:55 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 19:54 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 19:50 |
<dduvall@deploy2002> |
rebuilt and synchronized wikiversions files: group0 to 1.44.0-wmf.18 refs T382369 |
[production] |
| 19:44 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
| 19:42 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 19:38 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 19:38 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 19:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1186 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73599 and previous config saved to /var/cache/conftool/dbconfig/20250225-193155-root.json |
[production] |
| 19:20 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 19:20 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 19:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1186 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73598 and previous config saved to /var/cache/conftool/dbconfig/20250225-191650-root.json |
[production] |
| 19:11 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
| 19:09 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:restbase-eqiad: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
| 19:09 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
| 19:08 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp4047.ulsfo.wmnet,service=(cdn|ats-be) |
[production] |
| 19:08 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=no; selector: name=host down,service=(cdn|ats-be) |
[production] |
| 19:08 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=no; selector: name=--reason,service=(cdn|ats-be) |
[production] |
| 19:08 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=no; selector: name=host down,service=(cdn|ats-be) |
[production] |
| 19:08 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=no; selector: name=--reason,service=(cdn|ats-be) |
[production] |
| 19:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1186 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73597 and previous config saved to /var/cache/conftool/dbconfig/20250225-190145-root.json |
[production] |
| 18:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1181 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73596 and previous config saved to /var/cache/conftool/dbconfig/20250225-184758-root.json |
[production] |
| 18:46 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1186 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73595 and previous config saved to /var/cache/conftool/dbconfig/20250225-184640-root.json |
[production] |
| 18:36 |
<fabfur> |
re-enabled puppet on cp4050 (T329332) |
[production] |
| 18:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1181 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73594 and previous config saved to /var/cache/conftool/dbconfig/20250225-183252-root.json |
[production] |
| 18:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1186 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73593 and previous config saved to /var/cache/conftool/dbconfig/20250225-183134-root.json |
[production] |
| 18:22 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host backup1014.eqiad.wmnet with OS bookworm |
[production] |
| 18:22 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1077048|Allow users to sign up on Wikitech (T377074)]] (duration: 14m 05s) |
[production] |
| 18:21 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1014.eqiad.wmnet with OS bookworm |
[production] |