2022-08-05
§
|
14:31 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
14:23 |
<jbond> |
upload encore-clojure to puppet7 component |
[production] |
14:17 |
<jbond> |
upload truss-clojure to puppet7 component |
[production] |
14:13 |
<jbond> |
upload structured-logging-clojure to puppet7 component |
[production] |
14:06 |
<jbond> |
upload murphy-clojure to puppet7 component |
[production] |
13:57 |
<jbond> |
upload logstash-logback-encoder-7.2 to puppet7 component |
[production] |
13:49 |
<jbond> |
upload kitchensink-clojure to puppet7 component |
[production] |
13:27 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depool hosts with fragile power supply (T314559 T314628)', diff saved to https://phabricator.wikimedia.org/P32292 and previous config saved to /var/cache/conftool/dbconfig/20220805-132709-ladsgroup.json |
[production] |
13:12 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db2095.codfw.wmnet with reason: Maintenance |
[production] |
13:12 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db2095.codfw.wmnet with reason: Maintenance |
[production] |
13:09 |
<sukhe> |
repool codfw |
[production] |
13:02 |
<jbond> |
upload honeysql-clojure to puppet7 component |
[production] |
12:53 |
<_joe_> |
progressive repool of services in codfw |
[production] |
12:24 |
<moritzm> |
installing nano bugfix updates from bullseye point release |
[production] |
11:50 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
11:40 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
11:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repool after PDU maint on D3 (T310146)', diff saved to https://phabricator.wikimedia.org/P32291 and previous config saved to /var/cache/conftool/dbconfig/20220805-113729-ladsgroup.json |
[production] |
11:35 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repool after PDU maint on C6 (T310145)', diff saved to https://phabricator.wikimedia.org/P32290 and previous config saved to /var/cache/conftool/dbconfig/20220805-113555-ladsgroup.json |
[production] |
11:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repool after PDU maint on C5 (T310145)', diff saved to https://phabricator.wikimedia.org/P32289 and previous config saved to /var/cache/conftool/dbconfig/20220805-113436-ladsgroup.json |
[production] |
10:46 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
10:36 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
10:17 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
10:12 |
<Amir1> |
dbmaint at s4@codfw (T312863) |
[production] |
10:07 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
09:04 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on 12 hosts with reason: Maintenance |
[production] |
09:03 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on 12 hosts with reason: Maintenance |
[production] |
09:03 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2110.codfw.wmnet with reason: Maintenance |
[production] |
09:03 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2110.codfw.wmnet with reason: Maintenance |
[production] |
00:53 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on gerrit2001.wikimedia.org with reason: decom, replaced by gerrit2002 |
[production] |
00:53 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on gerrit2001.wikimedia.org with reason: decom, replaced by gerrit2002 |
[production] |
00:53 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for gerrit2002.wikimedia.org |
[production] |
00:53 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for gerrit2002.wikimedia.org |
[production] |
00:52 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on gerrit2002.wikimedia.org with reason: decom, replaced by gerrit2002 |
[production] |
00:52 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on gerrit2002.wikimedia.org with reason: decom, replaced by gerrit2002 |
[production] |
00:18 |
<mutante> |
restarting gerrit for config change - removing old replica T313250 |
[production] |
2022-08-04
§
|
23:06 |
<mutante> |
switching gerrit-replica.wikimedia.org to new machine gerrit2002, dropping gerrit-replica-new.wikimedia.org T313250 |
[production] |
21:07 |
<ryankemper@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply |
[production] |
20:59 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:57 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
20:57 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:56 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
20:56 |
<thcipriani@deploy1002> |
Finished scap: Backport for [[gerrit:819774]] tkwiki: Update wordmark (duration: 06m 12s) |
[production] |
20:51 |
<ryankemper@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply |
[production] |
20:51 |
<ryankemper@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply |
[production] |
20:51 |
<ryankemper@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply |
[production] |
20:50 |
<thcipriani@deploy1002> |
Started scap: Backport for [[gerrit:819774]] tkwiki: Update wordmark |
[production] |
20:48 |
<thcipriani@deploy1002> |
Finished scap: Backport for [[gerrit:812391]] [config]: Add click event logging for mobile and desktop (duration: 39m 16s) |
[production] |
20:45 |
<ryankemper@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply |
[production] |
20:24 |
<ryankemper@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply |
[production] |
20:23 |
<ryankemper@deploy1002> |
helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply |
[production] |