2022-06-02
ยง
|
12:15 |
<joal@deploy1002> |
Started deploy [airflow-dags/analytics@19b943d]: (no justification provided) |
[production] |
12:13 |
<wm-bot2> |
created node tools-sgeweblight-10-7.tools.eqiad1.wikimedia.cloud and added it to the grid - cookbook ran by taavi@runko |
[tools] |
12:03 |
<dcaro> |
refresh prometheus certs (T308402) |
[tools] |
12:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Give more weight to db1137 in x1 to test 10.6.8 T309679 ', diff saved to https://phabricator.wikimedia.org/P29343 and previous config saved to /var/cache/conftool/dbconfig/20220602-120320-marostegui.json |
[production] |
11:47 |
<dcaro> |
refresh registry-admission-controller certs (T308402) |
[tools] |
11:44 |
<moritzm> |
installing python-pip bugfix updates from bullseye point release |
[production] |
11:42 |
<dcaro> |
refresh ingress-admission-controller certs (T308402) |
[tools] |
11:40 |
<moritzm> |
installing tasksel updates from bullseye point release |
[production] |
11:36 |
<dcaro> |
refresh volume-admission-controller certs (T308402) |
[tools] |
11:31 |
<hashar> |
Restarted Gerrit on replica gerrit2001 |
[production] |
11:26 |
<hashar> |
Restarting Jenkins on contint2001 |
[releng] |
11:24 |
<wm-bot2> |
created node tools-sgeweblight-10-6.tools.eqiad1.wikimedia.cloud and added it to the grid - cookbook ran by taavi@runko |
[tools] |
11:23 |
<moritzm> |
installing sysvinit-utils bugfix updates from last bullseye point release |
[production] |
11:19 |
<hashar> |
Restarting Jenkins on releases1002 |
[releng] |
11:17 |
<taavi> |
publish jobutils 1.44 that updates the grid default from stretch to buster T277653 |
[tools] |
11:03 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
11:03 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:51 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:51 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:40 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:28 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:28 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:16 |
<taavi> |
publish tools-webservice 0.84 that updates the grid default from stretch to buster T277653 |
[tools] |
10:14 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:14 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:02 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
10:02 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
09:56 |
<joal> |
Relaunch sqoop after having deployed a corrective patch |
[analytics] |
09:54 |
<wm-bot2> |
created node tools-sgeexec-10-14.tools.eqiad1.wikimedia.cloud and added it to the grid - cookbook ran by taavi@runko |
[tools] |
09:53 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
09:53 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
09:46 |
<joal> |
Manually mark interlaguage historical tasks failed in airflow |
[analytics] |
09:39 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
09:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |
08:54 |
<joal> |
Deploy airflow with spark3 jobs |
[analytics] |
08:54 |
<joal@deploy1002> |
Finished deploy [airflow-dags/analytics@19cd054]: (no justification provided) (duration: 00m 09s) |
[production] |
08:54 |
<joal@deploy1002> |
Started deploy [airflow-dags/analytics@19cd054]: (no justification provided) |
[production] |
08:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Give more weight to db1137 in x1 to test 10.6.8 T309679 ', diff saved to https://phabricator.wikimedia.org/P29340 and previous config saved to /var/cache/conftool/dbconfig/20220602-085357-marostegui.json |
[production] |
08:47 |
<joal> |
Merging 2 airflow spark3 jobs now that their refinery counterpart is dpeloyed |
[analytics] |
08:32 |
<jayme> |
imported scap 4.8.1 to stretch-/buster-/bullseye-wikimedia - T309116 |
[production] |
08:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Give more weight to db1137 in x1 to test 10.6.8 T309679 ', diff saved to https://phabricator.wikimedia.org/P29339 and previous config saved to /var/cache/conftool/dbconfig/20220602-082700-marostegui.json |
[production] |
08:07 |
<joal> |
Deploy refinery onto HDFS |
[analytics] |
08:03 |
<joal@deploy1002> |
Finished deploy [analytics/refinery@ef68481] (hadoop-test): Additional analytics weekly train TEST [analytics/refinery@ef68481] (duration: 07m 33s) |
[production] |
07:55 |
<joal@deploy1002> |
Started deploy [analytics/refinery@ef68481] (hadoop-test): Additional analytics weekly train TEST [analytics/refinery@ef68481] |
[production] |
07:54 |
<joal@deploy1002> |
Finished deploy [analytics/refinery@ef68481] (thin): Additional analytics weekly train THIN [analytics/refinery@ef68481] (duration: 00m 08s) |
[production] |
07:54 |
<joal@deploy1002> |
Started deploy [analytics/refinery@ef68481] (thin): Additional analytics weekly train THIN [analytics/refinery@ef68481] |
[production] |
07:51 |
<taavi> |
restart neutron-linuxbridge-agent.service on cloudvirt1034 T309732 |
[admin] |
07:51 |
<joal@deploy1002> |
Finished deploy [analytics/refinery@ef68481]: Additional analytics weekly train [analytics/refinery@ef68481] (duration: 24m 33s) |
[production] |
07:26 |
<joal> |
Deploy refinery using scap |
[analytics] |