4501-4550 of 10000 results (69ms)
2022-08-05 §
14:43 <jbond> upload fressian to puppet7 component [production]
14:40 <pt1979@cumin1001> START - Cookbook sre.hosts.reimage for host db1185.eqiad.wmnet with OS bullseye [production]
14:40 <jbond> upload test-generative-clojure to puppet7 component [production]
14:35 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:34 <jbond> upload data-generators-clojure to puppet7 component [production]
14:31 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
14:23 <jbond> upload encore-clojure to puppet7 component [production]
14:17 <jbond> upload truss-clojure to puppet7 component [production]
14:13 <jbond> upload structured-logging-clojure to puppet7 component [production]
14:06 <jbond> upload murphy-clojure to puppet7 component [production]
13:57 <jbond> upload logstash-logback-encoder-7.2 to puppet7 component [production]
13:49 <jbond> upload kitchensink-clojure to puppet7 component [production]
13:27 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool hosts with fragile power supply (T314559 T314628)', diff saved to https://phabricator.wikimedia.org/P32292 and previous config saved to /var/cache/conftool/dbconfig/20220805-132709-ladsgroup.json [production]
13:12 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db2095.codfw.wmnet with reason: Maintenance [production]
13:12 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db2095.codfw.wmnet with reason: Maintenance [production]
13:09 <sukhe> repool codfw [production]
13:02 <jbond> upload honeysql-clojure to puppet7 component [production]
12:53 <_joe_> progressive repool of services in codfw [production]
12:24 <moritzm> installing nano bugfix updates from bullseye point release [production]
11:50 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
11:40 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
11:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repool after PDU maint on D3 (T310146)', diff saved to https://phabricator.wikimedia.org/P32291 and previous config saved to /var/cache/conftool/dbconfig/20220805-113729-ladsgroup.json [production]
11:35 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repool after PDU maint on C6 (T310145)', diff saved to https://phabricator.wikimedia.org/P32290 and previous config saved to /var/cache/conftool/dbconfig/20220805-113555-ladsgroup.json [production]
11:34 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repool after PDU maint on C5 (T310145)', diff saved to https://phabricator.wikimedia.org/P32289 and previous config saved to /var/cache/conftool/dbconfig/20220805-113436-ladsgroup.json [production]
10:46 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
10:36 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
10:17 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
10:12 <Amir1> dbmaint at s4@codfw (T312863) [production]
10:07 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
09:04 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on 12 hosts with reason: Maintenance [production]
09:03 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on 12 hosts with reason: Maintenance [production]
09:03 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2110.codfw.wmnet with reason: Maintenance [production]
09:03 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2110.codfw.wmnet with reason: Maintenance [production]
00:53 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on gerrit2001.wikimedia.org with reason: decom, replaced by gerrit2002 [production]
00:53 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on gerrit2001.wikimedia.org with reason: decom, replaced by gerrit2002 [production]
00:53 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for gerrit2002.wikimedia.org [production]
00:53 <dzahn@cumin1001> START - Cookbook sre.hosts.remove-downtime for gerrit2002.wikimedia.org [production]
00:52 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on gerrit2002.wikimedia.org with reason: decom, replaced by gerrit2002 [production]
00:52 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on gerrit2002.wikimedia.org with reason: decom, replaced by gerrit2002 [production]
00:18 <mutante> restarting gerrit for config change - removing old replica T313250 [production]
2022-08-04 §
23:06 <mutante> switching gerrit-replica.wikimedia.org to new machine gerrit2002, dropping gerrit-replica-new.wikimedia.org T313250 [production]
21:07 <ryankemper@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
20:59 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:57 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:57 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:56 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:56 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:819774]] tkwiki: Update wordmark (duration: 06m 12s) [production]
20:51 <ryankemper@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
20:51 <ryankemper@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
20:51 <ryankemper@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]