2023-05-22
§
|
05:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote es2020 to es4 codfw primaryT337203', diff saved to https://phabricator.wikimedia.org/P48411 and previous config saved to /var/cache/conftool/dbconfig/20230522-053554-marostegui.json |
[production] |
05:34 |
<marostegui> |
Starting es4 codfw failover from es2021 to es2020 - T337203 |
[production] |
05:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set es2020 with weight 0 T337203', diff saved to https://phabricator.wikimedia.org/P48410 and previous config saved to /var/cache/conftool/dbconfig/20230522-052938-root.json |
[production] |
05:29 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es4 T337203 |
[production] |
05:29 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Primary switchover es4 T337203 |
[production] |
05:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1031 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48409 and previous config saved to /var/cache/conftool/dbconfig/20230522-052800-root.json |
[production] |
05:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1030 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48408 and previous config saved to /var/cache/conftool/dbconfig/20230522-052753-root.json |
[production] |
05:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1029 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48407 and previous config saved to /var/cache/conftool/dbconfig/20230522-052746-root.json |
[production] |
05:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es1029, es1030, es1031 for kernel reboots', diff saved to https://phabricator.wikimedia.org/P48406 and previous config saved to /var/cache/conftool/dbconfig/20230522-051957-root.json |
[production] |
05:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Failover es1, es2 and es3 masters for kernel reboots', diff saved to https://phabricator.wikimedia.org/P48405 and previous config saved to /var/cache/conftool/dbconfig/20230522-051723-marostegui.json |
[production] |
2023-05-20
§
|
18:25 |
<effie> |
restart varnish cp3061 |
[production] |
16:39 |
<akosiaris@cumin1001> |
conftool action : set/pooled=yes; selector: name=parse1018.eqiad.wmnet |
[production] |
15:17 |
<hoo@deploy1002> |
Finished scap: Backport for [[gerrit:921549|Remove linkitem dependency on jquery.wikibase.wbtooltip (T337081)]] (duration: 08m 47s) |
[production] |
15:10 |
<hoo@deploy1002> |
hoo: Backport for [[gerrit:921549|Remove linkitem dependency on jquery.wikibase.wbtooltip (T337081)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
15:08 |
<hoo@deploy1002> |
Started scap: Backport for [[gerrit:921549|Remove linkitem dependency on jquery.wikibase.wbtooltip (T337081)]] |
[production] |
14:41 |
<akosiaris@cumin1001> |
conftool action : set/pooled=no; selector: name=parse1018.eqiad.wmnet |
[production] |
09:08 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
09:08 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Added records for the new private.codfw.wikimedia.cloud domain - volans@cumin1001" |
[production] |
09:07 |
<volans@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Added records for the new private.codfw.wikimedia.cloud domain - volans@cumin1001" |
[production] |
09:00 |
<volans@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
2023-05-19
§
|
21:22 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:22 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add entries for ssw link addresses in eqiad - cmooney@cumin1001" |
[production] |
21:21 |
<cmooney@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add entries for ssw link addresses in eqiad - cmooney@cumin1001" |
[production] |
21:19 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
20:52 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: cluster=jobrunner,name=mw1495.eqiad.wmnet |
[production] |
19:46 |
<mutante> |
mw1469 - sudo pkill ffmpeg (per runbook) |
[production] |
19:45 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: cluster=jobrunner,name=mw1469.eqiad.wmnet |
[production] |
19:45 |
<mutante> |
depooled mw1469 from videoscaler, dedicating to just jobrunner |
[production] |
19:45 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw1469.eqiad.wmnet |
[production] |
19:36 |
<htriedman@deploy1002> |
Finished deploy [airflow-dags/platform_eng@b34c529]: (no justification provided) (duration: 00m 09s) |
[production] |
19:36 |
<htriedman@deploy1002> |
Started deploy [airflow-dags/platform_eng@b34c529]: (no justification provided) |
[production] |
16:55 |
<mutante> |
mw2448 - scap pull - T2334429 |
[production] |
15:31 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:921150|i18n: Add link to help page (T322717)]], [[gerrit:921326|Enable RealMe (T324535)]] (duration: 22m 02s) |
[production] |
15:21 |
<taavi@deploy1002> |
legoktm and taavi: Backport for [[gerrit:921150|i18n: Add link to help page (T322717)]], [[gerrit:921326|Enable RealMe (T324535)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
15:09 |
<taavi@deploy1002> |
Started scap: Backport for [[gerrit:921150|i18n: Add link to help page (T322717)]], [[gerrit:921326|Enable RealMe (T324535)]] |
[production] |
15:06 |
<legoktm@deploy1002> |
Finished scap: Backport for [[gerrit:921252|Disable GWToolset from Commons (T270911)]] (duration: 09m 46s) |
[production] |
15:06 |
<isaranto@deploy1002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
14:59 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:ml-serve-worker-eqiad |
[production] |
14:58 |
<legoktm@deploy1002> |
legoktm: Backport for [[gerrit:921252|Disable GWToolset from Commons (T270911)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
14:57 |
<legoktm@deploy1002> |
Started scap: Backport for [[gerrit:921252|Disable GWToolset from Commons (T270911)]] |
[production] |
14:40 |
<isaranto@deploy1002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
14:36 |
<stevemunene@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on stat1009.eqiad.wmnet with reason: Bringing stat1009 into service |
[production] |
14:36 |
<stevemunene@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on stat1009.eqiad.wmnet with reason: Bringing stat1009 into service |
[production] |
14:35 |
<sukhe> |
enable puppet on A:lvs, finished rolling out change |
[production] |