101-150 of 10000 results (72ms)
2023-10-30 §
14:37 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on search-loader2001.codfw.wmnet with reason: T346039 [production]
14:36 <inflatador> bking@search-loader2001 disabling services as part of bullseye migration T346039 [production]
14:34 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4052.ulsfo.wmnet with reason: host reimage [production]
14:32 <elukey@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: sync [production]
14:31 <elukey@deploy2002> helmfile [staging] START helmfile.d/services/changeprop: sync [production]
14:06 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host cp4052.ulsfo.wmnet with OS bookworm [production]
12:55 <arnaudb@cumin1001> END (PASS) - Cookbook sre.mysql.clone (exit_code=0) of db1130.eqiad.wmnet onto db1230.eqiad.wmnet [production]
12:47 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1217.eqiad.wmnet with OS bookworm [production]
12:28 <marostegui@cumin1001> dbctl commit (dc=all): 'New host', diff saved to https://phabricator.wikimedia.org/P53065 and previous config saved to /var/cache/conftool/dbconfig/20231030-122855-marostegui.json [production]
12:26 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1217.eqiad.wmnet with reason: host reimage [production]
12:24 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1217.eqiad.wmnet with reason: host reimage [production]
12:11 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1217.eqiad.wmnet with OS bookworm [production]
11:52 <arnaudb@cumin1001> START - Cookbook sre.mysql.clone of db1130.eqiad.wmnet onto db1230.eqiad.wmnet [production]
11:34 <arnaudb@cumin1001> dbctl commit (dc=all): 'Adding db1230 depooled, depooling db1130', diff saved to https://phabricator.wikimedia.org/P53064 and previous config saved to /var/cache/conftool/dbconfig/20231030-113401-arnaudb.json [production]
11:28 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1230.eqiad.wmnet with reason: provisionning db1230.eqiad.wmnet - T344036 [production]
11:28 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1230.eqiad.wmnet with reason: provisionning db1230.eqiad.wmnet - T344036 [production]
11:28 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1130.eqiad.wmnet with reason: provisionning db1230.eqiad.wmnet - T344036 [production]
11:28 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1130.eqiad.wmnet with reason: provisionning db1230.eqiad.wmnet - T344036 [production]
09:42 <jnuche@deploy2002> Finished deploy [releng/jenkins-deploy@af33784] (releasing): (no justification provided) (duration: 00m 40s) [production]
09:42 <jnuche@deploy2002> Started deploy [releng/jenkins-deploy@af33784] (releasing): (no justification provided) [production]
08:29 <vgutierrez> switched to digicert-2023 in esams, eqsin and drmrs - T341119 [production]
08:17 <wmde-fisch@deploy2002> Finished scap: Backport for [[gerrit:966520|Cleanup Kartographer Nearby flags (T332785)]] (duration: 07m 35s) [production]
08:12 <wmde-fisch@deploy2002> wmde-fisch: Continuing with sync [production]
08:11 <wmde-fisch@deploy2002> wmde-fisch: Backport for [[gerrit:966520|Cleanup Kartographer Nearby flags (T332785)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:10 <wmde-fisch@deploy2002> Started scap: Backport for [[gerrit:966520|Cleanup Kartographer Nearby flags (T332785)]] [production]
08:10 <vgutierrez> triggering a puppet run on cp hosts in esams, eqsin and drmrs to switch to the new unified digicert certificates - T341119 [production]
08:06 <vgutierrez> repool cp5025 - T341119 [production]
08:06 <marostegui@deploy2002> Finished scap: Backport for [[gerrit:969361|Revert "ProductionServices.php: Promote pc1014 to pc1 master"]] (duration: 06m 41s) [production]
08:01 <marostegui@deploy2002> marostegui: Continuing with sync [production]
08:00 <marostegui@deploy2002> marostegui: Backport for [[gerrit:969361|Revert "ProductionServices.php: Promote pc1014 to pc1 master"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:59 <marostegui@deploy2002> Started scap: Backport for [[gerrit:969361|Revert "ProductionServices.php: Promote pc1014 to pc1 master"]] [production]
07:52 <vgutierrez> depool cp5025 to perform some digicert-2023 related sanity checks - T341119 [production]
07:49 <marostegui@deploy2002> Finished scap: Backport for [[gerrit:969667|ProductionServices.php: Promote pc1014 to pc1 master]] (duration: 06m 36s) [production]
07:48 <marostegui@deploy2002> marostegui: Continuing with sync [production]
07:44 <marostegui@deploy2002> marostegui: Backport for [[gerrit:969667|ProductionServices.php: Promote pc1014 to pc1 master]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:43 <marostegui@deploy2002> Started scap: Backport for [[gerrit:969667|ProductionServices.php: Promote pc1014 to pc1 master]] [production]
07:35 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on an-airflow1007.eqiad.wmnet with reason: Downtime as we setup the new WMDE Airflow instance [production]
07:34 <stevemunene@cumin1001> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on an-airflow1007.eqiad.wmnet with reason: Downtime as we setup the new WMDE Airflow instance [production]
07:29 <marostegui@deploy2002> Finished scap: Backport for [[gerrit:969360|Revert "ProductionServices.php: Promote pc1014 to pc1 master"]] (duration: 06m 33s) [production]
07:24 <marostegui@deploy2002> marostegui: Continuing with sync [production]
07:24 <marostegui@deploy2002> marostegui: Backport for [[gerrit:969360|Revert "ProductionServices.php: Promote pc1014 to pc1 master"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:22 <marostegui@deploy2002> Started scap: Backport for [[gerrit:969360|Revert "ProductionServices.php: Promote pc1014 to pc1 master"]] [production]
07:22 <marostegui@deploy2002> Finished scap: Backport for [[gerrit:969533|ProductionServices.php: Promote pc1014 to pc1 master]] (duration: 14m 04s) [production]
07:18 <elukey> arm keyholder on acmechief2002 and deploy1002 [production]
07:16 <marostegui@deploy2002> marostegui: Continuing with sync [production]
07:16 <marostegui@deploy2002> marostegui: Backport for [[gerrit:969533|ProductionServices.php: Promote pc1014 to pc1 master]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:08 <marostegui@deploy2002> Started scap: Backport for [[gerrit:969533|ProductionServices.php: Promote pc1014 to pc1 master]] [production]
2023-10-28 §
21:25 <fabfur> re-pooled cp1089 and cp3069 [production]
21:05 <fabfur> depooled cp1089 and cp3069 to restart varnish|haproxy and let purged process incoming messages [production]
20:20 <fabfur> restarted purged on cp1089, cp6005, cp3069 [production]