1151-1200 of 10000 results (16ms)
2023-01-09 §
14:55 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host registry2003.codfw.wmnet [production]
14:52 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1004.eqiad.wmnet [production]
14:48 <lucaswerkmeister-wmde@deploy1002> lucaswerkmeister-wmde and stang: Backport for [[gerrit:876364|jawikisource: Update project logo and wordmark (T326488)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
14:48 <SandraEbele> reran webrequest failed jobs ‘sudo -u analytics kerberos-run-command analytics oozie job --oozie $OOZIE_URL -Dstart_time=2023-01-08T07:00Z -Dstop_time=2023-01-08T14:59Z -Dwebrequest_source=text -Derror_incomplete_data_threshold=100 -Dwarning_incomplete_data_threshold=100 -Derror_data_loss_threshold=100 -Dwarning_data_loss_threshold=100 -submit -config /home/ebysans/webrequest_text_coordinator.properties’ [analytics]
14:47 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host registry1004.eqiad.wmnet [production]
14:47 <lucaswerkmeister-wmde@deploy1002> Started scap: Backport for [[gerrit:876364|jawikisource: Update project logo and wordmark (T326488)]] [production]
14:45 <lucaswerkmeister-wmde@deploy1002> Finished scap: Backport for [[gerrit:876310|arwiki: Create extendedmover group (T326434)]] (duration: 08m 56s) [production]
14:38 <lucaswerkmeister-wmde@deploy1002> lucaswerkmeister-wmde and stang: Backport for [[gerrit:876310|arwiki: Create extendedmover group (T326434)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet [production]
14:36 <lucaswerkmeister-wmde@deploy1002> Started scap: Backport for [[gerrit:876310|arwiki: Create extendedmover group (T326434)]] [production]
14:31 <godog> upgrade thanos to 0.30.1 on prometheus2005 - T303154 [production]
14:27 <lucaswerkmeister-wmde@deploy1002> Finished scap: Backport for [[gerrit:871286|mediawikiwiki: Disable Flow on new pages by default (T325907)]] (duration: 18m 19s) [production]
14:19 <lucaswerkmeister-wmde@deploy1002> lucaswerkmeister-wmde and stang: Backport for [[gerrit:871286|mediawikiwiki: Disable Flow on new pages by default (T325907)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
14:09 <lucaswerkmeister-wmde@deploy1002> Started scap: Backport for [[gerrit:871286|mediawikiwiki: Disable Flow on new pages by default (T325907)]] [production]
13:55 <moritzm> installing systemd bugfix updates from Bullseye point release [production]
13:41 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1003.eqiad.wmnet [production]
13:36 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host registry1003.eqiad.wmnet [production]
13:35 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) [production]
13:35 <jayme@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=helm-charts,name=eqiad [production]
12:53 <hnowlan@deploy1002> Finished deploy [restbase/deploy@bcb0a69]: New wikis T321284 T321290 T321296 T326140 (duration: 18m 56s) [production]
12:34 <hnowlan@deploy1002> Started deploy [restbase/deploy@bcb0a69]: New wikis T321284 T321290 T321296 T326140 [production]
12:18 <vgutierrez> repool cp5025 [production]
11:41 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 15954 [production]
11:40 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 15954 [production]
11:29 <vgutierrez> restart purged on cp5025 [production]
11:28 <vgutierrez> depool cp5025 due to purging issues [production]
11:23 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host chartmuseum1001.eqiad.wmnet [production]
11:19 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host chartmuseum1001.eqiad.wmnet [production]
11:06 <XioNoX> repool ulsfo - T316532 [production]
11:01 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2050.codfw.wmnet [production]
10:55 <cgoubert@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
10:55 <jiji@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=codfw [production]
10:54 <jayme@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=helm-charts,name=eqiad [production]
10:54 <claime> Starting codfw appserver rolling reboot [production]
10:54 <jayme@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=helm-charts,name=codfw [production]
10:54 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be2050.codfw.wmnet [production]
10:54 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host chartmuseum2001.codfw.wmnet [production]
10:51 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode1001.eqiad.wmnet [production]
10:49 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host chartmuseum2001.codfw.wmnet [production]
10:49 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode1001.eqiad.wmnet [production]
10:48 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode2001.codfw.wmnet [production]
10:46 <jiji@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=eqiad [production]
10:46 <effie> switching maps to eqiad [production]
10:45 <moritzm> installing avahi security updates [production]
10:44 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode2001.codfw.wmnet [production]
10:41 <jayme@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=helm-charts,name=codfw [production]
10:34 <hashar> Restarted zuul-merger on contint1002 it was listening but not processing any requests [releng]
10:21 <aqu> backfilling with refine_event on an-launcher1002 `sudo -u analytics kerberos-run-command analytics /usr/local/bin/refine_event --ignore_failure_flag=true --since=2023-01-07T16:00:00 --until=2023-01-09T09:00:00 --verbose` [analytics]
09:48 <aqu> killing refine_event yarn application `sudo -u analytics yarn application -kill application_1663082229270_682638` [analytics]
09:39 <aqu> Manually kill the Spark process on an-launcher1002 `sudo -u analytics kill -9 28538` [analytics]
09:35 <dcausse> restarting blazegraph on wdqs1006 (BlazegraphFreeAllocatorsDecreasingRapidly) [production]