1051-1100 of 10000 results (86ms)
2023-12-05 ยง
12:11 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135 (T348183)', diff saved to https://phabricator.wikimedia.org/P54163 and previous config saved to /var/cache/conftool/dbconfig/20231205-121121-arnaudb.json [production]
12:10 <kamila@cumin1001> START - Cookbook sre.hosts.reimage for host mw2424.codfw.wmnet with OS bullseye [production]
12:07 <pfischer@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
12:07 <pfischer@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
12:06 <kamila@cumin1001> START - Cookbook sre.hosts.reimage for host mw2423.codfw.wmnet with OS bullseye [production]
12:04 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host cp4039.ulsfo.wmnet [production]
12:02 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1135 (T348183)', diff saved to https://phabricator.wikimedia.org/P54162 and previous config saved to /var/cache/conftool/dbconfig/20231205-120206-arnaudb.json [production]
12:02 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1135.eqiad.wmnet with reason: Maintenance [production]
12:01 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1135.eqiad.wmnet with reason: Maintenance [production]
12:01 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134 (T348183)', diff saved to https://phabricator.wikimedia.org/P54161 and previous config saved to /var/cache/conftool/dbconfig/20231205-120145-arnaudb.json [production]
12:01 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cp4049.ulsfo.wmnet [production]
11:53 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
11:52 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
11:51 <kamila@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
11:51 <kamila@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
11:50 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host cp4049.ulsfo.wmnet [production]
11:46 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P54160 and previous config saved to /var/cache/conftool/dbconfig/20231205-114638-arnaudb.json [production]
11:40 <kamila@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
11:40 <kamila@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
11:40 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
11:40 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
11:38 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:979920|Bump ParserCache TTL back to 30 days (T280604)]] (duration: 07m 47s) [production]
11:33 <pfischer@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:32 <pfischer@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:32 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
11:32 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:979920|Bump ParserCache TTL back to 30 days (T280604)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
11:31 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P54159 and previous config saved to /var/cache/conftool/dbconfig/20231205-113132-arnaudb.json [production]
11:30 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:979920|Bump ParserCache TTL back to 30 days (T280604)]] [production]
11:30 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1023.eqiad.wmnet with OS bookworm [production]
11:17 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:16 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:16 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:16 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134 (T348183)', diff saved to https://phabricator.wikimedia.org/P54158 and previous config saved to /var/cache/conftool/dbconfig/20231205-111625-arnaudb.json [production]
11:16 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:15 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:15 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:12 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1023.eqiad.wmnet with reason: host reimage [production]
11:08 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1023.eqiad.wmnet with reason: host reimage [production]
11:08 <hnowlan@deploy2002> helmfile [codfw] [main] DONE helmfile.d/services/mw-jobrunner : sync [production]
11:08 <hnowlan@deploy2002> helmfile [codfw] [main] START helmfile.d/services/mw-jobrunner : sync [production]
11:07 <hnowlan@deploy2002> helmfile [eqiad] [main] DONE helmfile.d/services/mw-jobrunner : sync [production]
11:07 <hnowlan@deploy2002> helmfile [eqiad] [main] START helmfile.d/services/mw-jobrunner : sync [production]
11:04 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1134 (T348183)', diff saved to https://phabricator.wikimedia.org/P54157 and previous config saved to /var/cache/conftool/dbconfig/20231205-110448-arnaudb.json [production]
11:04 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
11:04 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
11:04 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1132 (T348183)', diff saved to https://phabricator.wikimedia.org/P54156 and previous config saved to /var/cache/conftool/dbconfig/20231205-110426-arnaudb.json [production]
11:02 <mvernon@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host moss-be1002.eqiad.wmnet with OS bookworm [production]
10:54 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host dbproxy1023.eqiad.wmnet with OS bookworm [production]
10:49 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1132', diff saved to https://phabricator.wikimedia.org/P54155 and previous config saved to /var/cache/conftool/dbconfig/20231205-104919-arnaudb.json [production]
10:45 <aikochou@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]