|
2023-01-09
§
|
| 14:55 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host registry2003.codfw.wmnet |
[production] |
| 14:52 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1004.eqiad.wmnet |
[production] |
| 14:48 |
<lucaswerkmeister-wmde@deploy1002> |
lucaswerkmeister-wmde and stang: Backport for [[gerrit:876364|jawikisource: Update project logo and wordmark (T326488)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
| 14:48 |
<SandraEbele> |
reran webrequest failed jobs ‘sudo -u analytics kerberos-run-command analytics oozie job --oozie $OOZIE_URL -Dstart_time=2023-01-08T07:00Z -Dstop_time=2023-01-08T14:59Z -Dwebrequest_source=text -Derror_incomplete_data_threshold=100 -Dwarning_incomplete_data_threshold=100 -Derror_data_loss_threshold=100 -Dwarning_data_loss_threshold=100 -submit -config /home/ebysans/webrequest_text_coordinator.properties’ |
[analytics] |
| 14:47 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host registry1004.eqiad.wmnet |
[production] |
| 14:47 |
<lucaswerkmeister-wmde@deploy1002> |
Started scap: Backport for [[gerrit:876364|jawikisource: Update project logo and wordmark (T326488)]] |
[production] |
| 14:45 |
<lucaswerkmeister-wmde@deploy1002> |
Finished scap: Backport for [[gerrit:876310|arwiki: Create extendedmover group (T326434)]] (duration: 08m 56s) |
[production] |
| 14:38 |
<lucaswerkmeister-wmde@deploy1002> |
lucaswerkmeister-wmde and stang: Backport for [[gerrit:876310|arwiki: Create extendedmover group (T326434)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
| 14:36 |
<lucaswerkmeister-wmde@deploy1002> |
Started scap: Backport for [[gerrit:876310|arwiki: Create extendedmover group (T326434)]] |
[production] |
| 14:31 |
<godog> |
upgrade thanos to 0.30.1 on prometheus2005 - T303154 |
[production] |
| 14:27 |
<lucaswerkmeister-wmde@deploy1002> |
Finished scap: Backport for [[gerrit:871286|mediawikiwiki: Disable Flow on new pages by default (T325907)]] (duration: 18m 19s) |
[production] |
| 14:19 |
<lucaswerkmeister-wmde@deploy1002> |
lucaswerkmeister-wmde and stang: Backport for [[gerrit:871286|mediawikiwiki: Disable Flow on new pages by default (T325907)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
| 14:09 |
<lucaswerkmeister-wmde@deploy1002> |
Started scap: Backport for [[gerrit:871286|mediawikiwiki: Disable Flow on new pages by default (T325907)]] |
[production] |
| 13:55 |
<moritzm> |
installing systemd bugfix updates from Bullseye point release |
[production] |
| 13:41 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1003.eqiad.wmnet |
[production] |
| 13:36 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host registry1003.eqiad.wmnet |
[production] |
| 13:35 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) |
[production] |
| 13:35 |
<jayme@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=helm-charts,name=eqiad |
[production] |
| 12:53 |
<hnowlan@deploy1002> |
Finished deploy [restbase/deploy@bcb0a69]: New wikis T321284 T321290 T321296 T326140 (duration: 18m 56s) |
[production] |
| 12:34 |
<hnowlan@deploy1002> |
Started deploy [restbase/deploy@bcb0a69]: New wikis T321284 T321290 T321296 T326140 |
[production] |
| 12:18 |
<vgutierrez> |
repool cp5025 |
[production] |
| 11:41 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 15954 |
[production] |
| 11:40 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 15954 |
[production] |
| 11:29 |
<vgutierrez> |
restart purged on cp5025 |
[production] |
| 11:28 |
<vgutierrez> |
depool cp5025 due to purging issues |
[production] |
| 11:23 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host chartmuseum1001.eqiad.wmnet |
[production] |
| 11:19 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host chartmuseum1001.eqiad.wmnet |
[production] |
| 11:06 |
<XioNoX> |
repool ulsfo - T316532 |
[production] |
| 11:01 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2050.codfw.wmnet |
[production] |
| 10:55 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
| 10:55 |
<jiji@cumin1001> |
conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=codfw |
[production] |
| 10:54 |
<jayme@cumin1001> |
conftool action : set/pooled=false; selector: dnsdisc=helm-charts,name=eqiad |
[production] |
| 10:54 |
<claime> |
Starting codfw appserver rolling reboot |
[production] |
| 10:54 |
<jayme@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=helm-charts,name=codfw |
[production] |
| 10:54 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ms-be2050.codfw.wmnet |
[production] |
| 10:54 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host chartmuseum2001.codfw.wmnet |
[production] |
| 10:51 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode1001.eqiad.wmnet |
[production] |
| 10:49 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host chartmuseum2001.codfw.wmnet |
[production] |
| 10:49 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode1001.eqiad.wmnet |
[production] |
| 10:48 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode2001.codfw.wmnet |
[production] |
| 10:46 |
<jiji@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=eqiad |
[production] |
| 10:46 |
<effie> |
switching maps to eqiad |
[production] |
| 10:45 |
<moritzm> |
installing avahi security updates |
[production] |
| 10:44 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode2001.codfw.wmnet |
[production] |
| 10:41 |
<jayme@cumin1001> |
conftool action : set/pooled=false; selector: dnsdisc=helm-charts,name=codfw |
[production] |
| 10:34 |
<hashar> |
Restarted zuul-merger on contint1002 it was listening but not processing any requests |
[releng] |
| 10:21 |
<aqu> |
backfilling with refine_event on an-launcher1002 `sudo -u analytics kerberos-run-command analytics /usr/local/bin/refine_event --ignore_failure_flag=true --since=2023-01-07T16:00:00 --until=2023-01-09T09:00:00 --verbose` |
[analytics] |
| 09:48 |
<aqu> |
killing refine_event yarn application `sudo -u analytics yarn application -kill application_1663082229270_682638` |
[analytics] |
| 09:39 |
<aqu> |
Manually kill the Spark process on an-launcher1002 `sudo -u analytics kill -9 28538` |
[analytics] |
| 09:35 |
<dcausse> |
restarting blazegraph on wdqs1006 (BlazegraphFreeAllocatorsDecreasingRapidly) |
[production] |