151-200 of 10000 results (68ms)
2023-05-22 §
05:37 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es2021 T337203', diff saved to https://phabricator.wikimedia.org/P48412 and previous config saved to /var/cache/conftool/dbconfig/20230522-053705-marostegui.json [production]
05:35 <marostegui@cumin1001> dbctl commit (dc=all): 'Promote es2020 to es4 codfw primaryT337203', diff saved to https://phabricator.wikimedia.org/P48411 and previous config saved to /var/cache/conftool/dbconfig/20230522-053554-marostegui.json [production]
05:34 <marostegui> Starting es4 codfw failover from es2021 to es2020 - T337203 [production]
05:29 <marostegui@cumin1001> dbctl commit (dc=all): 'Set es2020 with weight 0 T337203', diff saved to https://phabricator.wikimedia.org/P48410 and previous config saved to /var/cache/conftool/dbconfig/20230522-052938-root.json [production]
05:29 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es4 T337203 [production]
05:29 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Primary switchover es4 T337203 [production]
05:28 <marostegui@cumin1001> dbctl commit (dc=all): 'es1031 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48409 and previous config saved to /var/cache/conftool/dbconfig/20230522-052800-root.json [production]
05:27 <marostegui@cumin1001> dbctl commit (dc=all): 'es1030 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48408 and previous config saved to /var/cache/conftool/dbconfig/20230522-052753-root.json [production]
05:27 <marostegui@cumin1001> dbctl commit (dc=all): 'es1029 (re)pooling @ 1%: Repooling after maintenance', diff saved to https://phabricator.wikimedia.org/P48407 and previous config saved to /var/cache/conftool/dbconfig/20230522-052746-root.json [production]
05:19 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es1029, es1030, es1031 for kernel reboots', diff saved to https://phabricator.wikimedia.org/P48406 and previous config saved to /var/cache/conftool/dbconfig/20230522-051957-root.json [production]
05:17 <marostegui@cumin1001> dbctl commit (dc=all): 'Failover es1, es2 and es3 masters for kernel reboots', diff saved to https://phabricator.wikimedia.org/P48405 and previous config saved to /var/cache/conftool/dbconfig/20230522-051723-marostegui.json [production]
2023-05-21 §
07:45 <jelto@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
07:44 <jelto@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
07:43 <jelto@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
07:42 <jelto@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
07:41 <jelto@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
07:40 <jelto@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
2023-05-20 §
18:25 <effie> restart varnish cp3061 [production]
16:39 <akosiaris@cumin1001> conftool action : set/pooled=yes; selector: name=parse1018.eqiad.wmnet [production]
15:17 <hoo@deploy1002> Finished scap: Backport for [[gerrit:921549|Remove linkitem dependency on jquery.wikibase.wbtooltip (T337081)]] (duration: 08m 47s) [production]
15:10 <hoo@deploy1002> hoo: Backport for [[gerrit:921549|Remove linkitem dependency on jquery.wikibase.wbtooltip (T337081)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
15:08 <hoo@deploy1002> Started scap: Backport for [[gerrit:921549|Remove linkitem dependency on jquery.wikibase.wbtooltip (T337081)]] [production]
14:41 <akosiaris@cumin1001> conftool action : set/pooled=no; selector: name=parse1018.eqiad.wmnet [production]
09:08 <volans@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:08 <volans@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Added records for the new private.codfw.wikimedia.cloud domain - volans@cumin1001" [production]
09:07 <volans@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Added records for the new private.codfw.wikimedia.cloud domain - volans@cumin1001" [production]
09:00 <volans@cumin1001> START - Cookbook sre.dns.netbox [production]
2023-05-19 §
21:22 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:22 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add entries for ssw link addresses in eqiad - cmooney@cumin1001" [production]
21:21 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add entries for ssw link addresses in eqiad - cmooney@cumin1001" [production]
21:19 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
20:52 <dzahn@cumin1001> conftool action : set/pooled=no; selector: cluster=jobrunner,name=mw1495.eqiad.wmnet [production]
19:46 <mutante> mw1469 - sudo pkill ffmpeg (per runbook) [production]
19:45 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: cluster=jobrunner,name=mw1469.eqiad.wmnet [production]
19:45 <mutante> depooled mw1469 from videoscaler, dedicating to just jobrunner [production]
19:45 <dzahn@cumin1001> conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw1469.eqiad.wmnet [production]
19:36 <htriedman@deploy1002> Finished deploy [airflow-dags/platform_eng@b34c529]: (no justification provided) (duration: 00m 09s) [production]
19:36 <htriedman@deploy1002> Started deploy [airflow-dags/platform_eng@b34c529]: (no justification provided) [production]
16:55 <mutante> mw2448 - scap pull - T2334429 [production]
15:31 <taavi@deploy1002> Finished scap: Backport for [[gerrit:921150|i18n: Add link to help page (T322717)]], [[gerrit:921326|Enable RealMe (T324535)]] (duration: 22m 02s) [production]
15:21 <taavi@deploy1002> legoktm and taavi: Backport for [[gerrit:921150|i18n: Add link to help page (T322717)]], [[gerrit:921326|Enable RealMe (T324535)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
15:09 <taavi@deploy1002> Started scap: Backport for [[gerrit:921150|i18n: Add link to help page (T322717)]], [[gerrit:921326|Enable RealMe (T324535)]] [production]
15:06 <legoktm@deploy1002> Finished scap: Backport for [[gerrit:921252|Disable GWToolset from Commons (T270911)]] (duration: 09m 46s) [production]
15:06 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
14:59 <elukey@cumin1001> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:ml-serve-worker-eqiad [production]
14:58 <legoktm@deploy1002> legoktm: Backport for [[gerrit:921252|Disable GWToolset from Commons (T270911)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet [production]
14:57 <legoktm@deploy1002> Started scap: Backport for [[gerrit:921252|Disable GWToolset from Commons (T270911)]] [production]
14:40 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
14:36 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on stat1009.eqiad.wmnet with reason: Bringing stat1009 into service [production]
14:36 <stevemunene@cumin1001> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on stat1009.eqiad.wmnet with reason: Bringing stat1009 into service [production]