7951-8000 of 10000 results (86ms)
2022-12-05 ยง
14:02 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2127.codfw.wmnet with reason: Maintenance [production]
13:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool db2127 T324180', diff saved to https://phabricator.wikimedia.org/P42247 and previous config saved to /var/cache/conftool/dbconfig/20221205-135932-ladsgroup.json [production]
13:55 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Promote db2105 to s3 primary T324180', diff saved to https://phabricator.wikimedia.org/P42246 and previous config saved to /var/cache/conftool/dbconfig/20221205-135539-ladsgroup.json [production]
13:54 <Amir1> Starting s3 codfw failover from db2127 to db2105 - T324180 [production]
13:51 <dcausse> repooling wdqs1004 [production]
13:44 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 55818 [production]
13:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Set db2105 with weight 0 T324180', diff saved to https://phabricator.wikimedia.org/P42245 and previous config saved to /var/cache/conftool/dbconfig/20221205-134346-ladsgroup.json [production]
13:43 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Primary switchover s3 T324180 [production]
13:42 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 23 hosts with reason: Primary switchover s3 T324180 [production]
13:32 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 55818 [production]
13:31 <TheresNoTime> T302486 : [samtar@mwmaint1002 ~]$ mwscript maintenance/fixMergeHistoryCorruption.php --wiki enwiki --ns 828 --delete [production]
13:24 <moritzm> installing postgresql-common bugfix updates from Buster 10.13 point release [production]
13:17 <moritzm> installing distro-info-data bugfix updates from Buster 10.13 point release [production]
13:12 <moritzm> installing libnet-ssleay-perl bugfix updates from Buster 10.13 point release [production]
12:50 <moritzm> installing python-keystoneauth1 bugfix updates from Buster 10.13 point release [production]
12:41 <root@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:41 <root@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'sync'. [production]
12:41 <root@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
12:39 <root@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
11:59 <oblivian@deploy1002> helmfile [codfw] DONE helmfile.d/services/shellbox: apply [production]
11:59 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/api-gateway: sync [production]
11:59 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/api-gateway: sync [production]
11:58 <oblivian@deploy1002> helmfile [codfw] START helmfile.d/services/shellbox: apply [production]
11:53 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/services/shellbox: apply [production]
11:52 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/services/shellbox: apply [production]
11:51 <oblivian@deploy1002> helmfile [staging] DONE helmfile.d/services/shellbox: apply [production]
11:50 <oblivian@deploy1002> helmfile [staging] START helmfile.d/services/shellbox: apply [production]
11:37 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1206 with more weight', diff saved to https://phabricator.wikimedia.org/P42243 and previous config saved to /var/cache/conftool/dbconfig/20221205-113746-marostegui.json [production]
11:31 <moritzm> installing librsvg bugfix updates from buster point release [production]
11:18 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1206 with more weight', diff saved to https://phabricator.wikimedia.org/P42242 and previous config saved to /var/cache/conftool/dbconfig/20221205-111836-marostegui.json [production]
11:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on idp-test1002.wikimedia.org with reason: Various tests which may cause temporary breakage on idp-test.w.o [production]
11:09 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on idp-test1002.wikimedia.org with reason: Various tests which may cause temporary breakage on idp-test.w.o [production]
11:07 <hashar> Restarted Zuul to clear a stuck ssh connection with Gerrit - T309376 [production]
10:33 <kostajh> UTC morning deploys done [production]
10:32 <godog> contint1001 - racadm serveraction powercyle - crashed [production]
10:31 <kharlan@deploy1002> Finished scap: Backport for [[gerrit:864713|User impact: Show discovery notice to mobile users (T323619)]] (duration: 09m 30s) [production]
10:30 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1206 with more weight', diff saved to https://phabricator.wikimedia.org/P42241 and previous config saved to /var/cache/conftool/dbconfig/20221205-103028-marostegui.json [production]
10:23 <kharlan@deploy1002> kharlan and kharlan: Backport for [[gerrit:864713|User impact: Show discovery notice to mobile users (T323619)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
10:22 <kharlan@deploy1002> Started scap: Backport for [[gerrit:864713|User impact: Show discovery notice to mobile users (T323619)]] [production]
10:14 <Emperor> rebalance thanos rings T311690 [production]
10:06 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1206 with more weight', diff saved to https://phabricator.wikimedia.org/P42240 and previous config saved to /var/cache/conftool/dbconfig/20221205-100607-marostegui.json [production]
10:05 <kharlan@deploy1002> Finished scap: Backport for [[gerrit:864712|User impact: Show discovery tour to desktop users who had old module (T323619)]] (duration: 27m 33s) [production]
09:50 <kharlan@deploy1002> kharlan and kharlan: Backport for [[gerrit:864712|User impact: Show discovery tour to desktop users who had old module (T323619)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
09:39 <moritzm> restarting mediawiki canaries to pick up freetype security updates [production]
09:38 <godog> force a puppet run on physical hosts to pick up https://gerrit.wikimedia.org/r/c/operations/puppet/+/860572 [production]
09:37 <kharlan@deploy1002> Started scap: Backport for [[gerrit:864712|User impact: Show discovery tour to desktop users who had old module (T323619)]] [production]
09:36 <moritzm> installing freetype security updates [production]
09:15 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1206 with more weight', diff saved to https://phabricator.wikimedia.org/P42239 and previous config saved to /var/cache/conftool/dbconfig/20221205-091547-marostegui.json [production]
09:15 <kharlan@deploy1002> backport aborted: (duration: 00m 25s) [production]
09:14 <kharlan@deploy1002> Finished scap: Backport for [[gerrit:864666|Fix ExpensiveUserImpact input validation (T324312)]] (duration: 09m 10s) [production]