3601-3650 of 10000 results (69ms)
2022-08-01 §
08:50 <jelto@cumin1001> START - Cookbook sre.hosts.reboot-single for host gitlab-runner2002.codfw.wmnet [production]
08:50 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab-runner1004.eqiad.wmnet [production]
08:48 <godog> thanos-be2004: copy quarantined and tmp off sdb3 and into sdb4 for analysis and to free space - T314275 [production]
08:48 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
08:47 <ladsgroup@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:818998|Stop writing to the old templatelinks columns in itwikisource (T312865)]] (duration: 03m 12s) [production]
08:43 <vgutierrez> rolling upgrade of HAProxy to version 2.4.18 [production]
08:43 <kevinbazira@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
08:41 <kevinbazira@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
08:39 <jelto@cumin1001> START - Cookbook sre.hosts.reboot-single for host gitlab-runner1004.eqiad.wmnet [production]
08:39 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab-runner1003.eqiad.wmnet [production]
08:28 <jelto@cumin1001> START - Cookbook sre.hosts.reboot-single for host gitlab-runner1003.eqiad.wmnet [production]
08:25 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab-runner1002.eqiad.wmnet [production]
08:14 <jelto@cumin1001> START - Cookbook sre.hosts.reboot-single for host gitlab-runner1002.eqiad.wmnet [production]
06:19 <oblivian@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=(appservers|api)-ro,name=codfw [production]
06:14 <oblivian@puppetmaster1001> conftool action : set/ttl=10; selector: dnsdisc=appservers-ro [production]
06:13 <oblivian@puppetmaster1001> conftool action : set/ttl=10; selector: dnsdisc=appserver-ro [production]
06:13 <oblivian@puppetmaster1001> conftool action : set/ttl=10; selector: dnsdisc=(appserver|api)-ro [production]
05:43 <moritzm> installing Linux 5.10.127-2 on Gitlab runners [production]
01:00 <krinkle@deploy1002> Synchronized multiversion/: Ic0dbcba9f60f20a (duration: 03m 31s) [production]
00:57 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
00:56 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
00:56 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
00:53 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
00:45 <krinkle@deploy1002> Synchronized multiversion/MWMultiVersion.php: I9d363abd7cfef (duration: 03m 17s) [production]
00:43 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
00:42 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
00:42 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
00:39 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
2022-07-31 §
23:29 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
23:25 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
23:25 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
23:22 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
23:20 <krinkle@deploy1002> Synchronized dblists-index.php: I814ee93b5c (duration: 03m 20s) [production]
23:17 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
23:13 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
23:13 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
23:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
22:54 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
22:53 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
22:53 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
22:52 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
18:19 <vgutierrez@puppetmaster1001> conftool action : set/pooled=inactive; selector: name=cp5001.eqsin.wmnet [production]
18:14 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on cp5001.eqsin.wmnet with reason: depooled: faulty DIMM: T314256 [production]
18:13 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on cp5001.eqsin.wmnet with reason: depooled: faulty DIMM: T314256 [production]
18:12 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5001.eqsin.wmnet,service=ats-tls [production]
18:12 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5001.eqsin.wmnet,service=varnish-fe [production]
18:12 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5001.eqsin.wmnet,service=ats-be [production]
2022-07-30 §
01:44 <bking@cumin1001> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_codfw: codfw cluster reimage (bullseye upgrade) - bking@cumin1001 - T289135 [production]
01:44 <bking@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2028.codfw.wmnet with OS bullseye [production]
00:55 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host elastic2028.codfw.wmnet with OS bullseye [production]