751-800 of 10000 results (45ms)
2023-03-16 §
11:16 <cgoubert@cumin1001> START - Cookbook sre.hosts.downtime for 5:00:00 on 32 hosts with reason: new_install [production]
11:10 <hnowlan@puppetmaster1001> conftool action : set/weight=2; selector: service=thumbor,name=kubernetes101[0123].eqiad.wmnet [production]
11:07 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_eqsin [production]
11:06 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_drmrs [production]
11:06 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_drmrs [production]
11:04 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes:weight=4; selector: service=thumbor,name=kubernetes101[0123].eqiad.wmnet [production]
10:52 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_codfw [production]
10:50 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_codfw [production]
10:42 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
10:42 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
10:40 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
10:39 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
10:38 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_eqsin [production]
10:37 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_eqsin [production]
10:33 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
10:33 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 32 hosts with reason: new_install [production]
10:32 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
10:32 <cgoubert@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 32 hosts with reason: new_install [production]
10:32 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_codfw [production]
10:31 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_codfw [production]
10:31 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . [production]
10:31 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . [production]
10:31 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . [production]
10:31 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . [production]
10:30 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
10:29 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
10:28 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
10:26 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
10:09 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1179 to move it to x1', diff saved to https://phabricator.wikimedia.org/P45885 and previous config saved to /var/cache/conftool/dbconfig/20230316-100945-root.json [production]
08:51 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db1105.eqiad.wmnet [production]
08:51 <marostegui@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:51 <marostegui@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1105.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1001" [production]
08:49 <marostegui@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1105.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1001" [production]
08:48 <marostegui@cumin1001> START - Cookbook sre.dns.netbox [production]
08:43 <marostegui@cumin1001> START - Cookbook sre.hosts.decommission for hosts db1105.eqiad.wmnet [production]
08:40 <kostajh> UTC morning deploys (second round) done [production]
08:40 <kharlan@deploy2002> Finished scap: Backport for [[gerrit:900126|SuggestedEditSession: Fix handling of post-save data refresh]], [[gerrit:899605|Leveling up: always set wgGELevelingUpEnabledForUser (T332227)]] (duration: 12m 30s) [production]
08:29 <kharlan@deploy2002> kharlan: Backport for [[gerrit:900126|SuggestedEditSession: Fix handling of post-save data refresh]], [[gerrit:899605|Leveling up: always set wgGELevelingUpEnabledForUser (T332227)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
08:27 <kharlan@deploy2002> Started scap: Backport for [[gerrit:900126|SuggestedEditSession: Fix handling of post-save data refresh]], [[gerrit:899605|Leveling up: always set wgGELevelingUpEnabledForUser (T332227)]] [production]
08:11 <apergos> additional deployments for the UTC morning backport and config training window, running into the next hour, so window re-opened [production]
07:36 <tgr_> UTC morning deploys done [production]
07:34 <tgr@deploy2002> Finished scap: Backport for [[gerrit:900026|Leveling up: Backport recent changes]] (duration: 08m 13s) [production]
07:28 <tgr@deploy2002> tgr: Backport for [[gerrit:900026|Leveling up: Backport recent changes]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
07:26 <tgr@deploy2002> Started scap: Backport for [[gerrit:900026|Leveling up: Backport recent changes]] [production]
06:23 <marostegui@cumin1001> dbctl commit (dc=all): 'Remove db1105 from dbctl T331874', diff saved to https://phabricator.wikimedia.org/P45883 and previous config saved to /var/cache/conftool/dbconfig/20230316-062307-root.json [production]
06:03 <marostegui> Failover m5 from db1106 to db1176 - T332155 [production]
05:59 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: m5 master switch T332155 [production]
05:59 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: m5 master switch T332155 [production]
03:29 <ejegg> payments-wiki upgraded from 1532b107 to 0fd66b1f [production]
2023-03-15 §
22:55 <tzatziki> Removing 1 file for legal compliance [production]