2023-09-22
§
|
08:51 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on 20 hosts with reason: Schema change |
[production] |
08:51 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on 20 hosts with reason: Schema change |
[production] |
07:45 |
<hashar> |
Upgrading CI Jenkins from 2.401.3 to 2.414.2 |
[production] |
07:36 |
<hashar> |
Restarting Gerrit to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/953967 "Link account creation to IDM" # T345226 |
[production] |
07:06 |
<moritzm> |
installing mutt security updates |
[production] |
06:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1132', diff saved to https://phabricator.wikimedia.org/P52577 and previous config saved to /var/cache/conftool/dbconfig/20230922-063617-root.json |
[production] |
06:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1132', diff saved to https://phabricator.wikimedia.org/P52576 and previous config saved to /var/cache/conftool/dbconfig/20230922-063212-root.json |
[production] |
05:13 |
<isaranto@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . |
[production] |
00:43 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
00:43 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
00:43 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1202 (T343198)', diff saved to https://phabricator.wikimedia.org/P52575 and previous config saved to /var/cache/conftool/dbconfig/20230922-004330-arnaudb.json |
[production] |
00:28 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P52574 and previous config saved to /var/cache/conftool/dbconfig/20230922-002823-arnaudb.json |
[production] |
00:13 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P52573 and previous config saved to /var/cache/conftool/dbconfig/20230922-001316-arnaudb.json |
[production] |
2023-09-21
§
|
23:58 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1202 (T343198)', diff saved to https://phabricator.wikimedia.org/P52572 and previous config saved to /var/cache/conftool/dbconfig/20230921-235810-arnaudb.json |
[production] |
22:02 |
<ejegg> |
Standalone (listener) SmashPig upgraded from ca5b6218 to 2412df22 |
[production] |
20:28 |
<brennen> |
end of UTC late backport & config window |
[production] |
20:27 |
<brennen@deploy2002> |
Finished scap: Backport for [[gerrit:956931|Update Reader Demographics 2 pilot survey (T345951)]] (duration: 21m 36s) |
[production] |
20:18 |
<brennen@deploy2002> |
dani and brennen: Continuing with sync |
[production] |
20:17 |
<brennen@deploy2002> |
dani and brennen: Backport for [[gerrit:956931|Update Reader Demographics 2 pilot survey (T345951)]] synced to the testservers mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) |
[production] |
20:06 |
<brennen@deploy2002> |
Started scap: Backport for [[gerrit:956931|Update Reader Demographics 2 pilot survey (T345951)]] |
[production] |
20:04 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db1202 (T343198)', diff saved to https://phabricator.wikimedia.org/P52570 and previous config saved to /var/cache/conftool/dbconfig/20230921-200439-arnaudb.json |
[production] |
20:04 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1202.eqiad.wmnet with reason: Maintenance |
[production] |
20:04 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1202.eqiad.wmnet with reason: Maintenance |
[production] |
20:04 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1194 (T343198)', diff saved to https://phabricator.wikimedia.org/P52569 and previous config saved to /var/cache/conftool/dbconfig/20230921-200417-arnaudb.json |
[production] |
20:01 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:00 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add reords for codfw test servers - cmooney@cumin1001" |
[production] |
19:59 |
<cmooney@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add reords for codfw test servers - cmooney@cumin1001" |
[production] |
19:49 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P52568 and previous config saved to /var/cache/conftool/dbconfig/20230921-194911-arnaudb.json |
[production] |
19:47 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:34 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P52567 and previous config saved to /var/cache/conftool/dbconfig/20230921-193404-arnaudb.json |
[production] |
19:18 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1194 (T343198)', diff saved to https://phabricator.wikimedia.org/P52566 and previous config saved to /var/cache/conftool/dbconfig/20230921-191858-arnaudb.json |
[production] |
19:17 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "add codfw new switches - cmooney@cumin1001" |
[production] |
19:13 |
<cmooney@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "add codfw new switches - cmooney@cumin1001" |
[production] |
18:54 |
<ladsgroup@deploy2002> |
Finished scap: Backport for [[gerrit:959713|Enable Url shortener in sidebar in all wikis (T267921)]] (duration: 20m 47s) |
[production] |
18:47 |
<ejegg> |
payments-wiki upgraded from 9cd3e4cd to 5596c7fd |
[production] |
18:45 |
<ladsgroup@deploy2002> |
ladsgroup: Continuing with sync |
[production] |
18:45 |
<ladsgroup@deploy2002> |
ladsgroup: Backport for [[gerrit:959713|Enable Url shortener in sidebar in all wikis (T267921)]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) |
[production] |
18:40 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 100%: Maint over', diff saved to https://phabricator.wikimedia.org/P52565 and previous config saved to /var/cache/conftool/dbconfig/20230921-184000-ladsgroup.json |
[production] |
18:33 |
<ladsgroup@deploy2002> |
Started scap: Backport for [[gerrit:959713|Enable Url shortener in sidebar in all wikis (T267921)]] |
[production] |
18:24 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 75%: Maint over', diff saved to https://phabricator.wikimedia.org/P52564 and previous config saved to /var/cache/conftool/dbconfig/20230921-182455-ladsgroup.json |
[production] |
18:15 |
<brennen@deploy2002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.41.0-wmf.27 refs T345888 |
[production] |
18:09 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 25%: Maint over', diff saved to https://phabricator.wikimedia.org/P52562 and previous config saved to /var/cache/conftool/dbconfig/20230921-180949-ladsgroup.json |
[production] |
18:05 |
<brennen> |
train 1.41.0-wmf.27 (T345888): no current blockers, logs clean, rolling to group2 shortly. |
[production] |
18:00 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repool db1166 (T346365)', diff saved to https://phabricator.wikimedia.org/P52561 and previous config saved to /var/cache/conftool/dbconfig/20230921-180003-ladsgroup.json |
[production] |
17:59 |
<xcollazo@deploy2002> |
Finished deploy [airflow-dags/analytics@ddcc518]: Deploy latest DAGs to analytics Airflow instance (duration: 00m 40s) |
[production] |
17:58 |
<xcollazo@deploy2002> |
Started deploy [airflow-dags/analytics@ddcc518]: Deploy latest DAGs to analytics Airflow instance |
[production] |
17:56 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depool db1166 (T346365)', diff saved to https://phabricator.wikimedia.org/P52560 and previous config saved to /var/cache/conftool/dbconfig/20230921-175634-ladsgroup.json |
[production] |
17:54 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 10%: Maint over', diff saved to https://phabricator.wikimedia.org/P52559 and previous config saved to /var/cache/conftool/dbconfig/20230921-175444-ladsgroup.json |
[production] |
17:49 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depool db2149 (T346365)', diff saved to https://phabricator.wikimedia.org/P52558 and previous config saved to /var/cache/conftool/dbconfig/20230921-174934-ladsgroup.json |
[production] |
17:41 |
<eevans@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for restbase2014.codfw.wmnet |
[production] |