2022-10-05
ยง
|
14:33 |
<mforns> |
finished refinery deploy - regular weekly train |
[analytics] |
14:30 |
<papaul> |
on going maintenance on msw1-eqiad |
[production] |
14:28 |
<arturo> |
adding cloudinstances2b-gw router to l3 agents on cloudnet1005/1006 (T316284) |
[admin] |
14:26 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1032.eqiad.wmnet with OS bullseye |
[production] |
14:20 |
<mforns@deploy1002> |
Finished deploy [analytics/refinery@7e16d2a] (thin): Regular analytics weekly train THIN [analytics/refinery@7e16d2a] (duration: 04m 24s) |
[production] |
14:16 |
<mforns@deploy1002> |
Started deploy [analytics/refinery@7e16d2a] (thin): Regular analytics weekly train THIN [analytics/refinery@7e16d2a] |
[production] |
14:16 |
<mforns@deploy1002> |
Finished deploy [analytics/refinery@7e16d2a]: Regular analytics weekly train [analytics/refinery@7e16d2a] (duration: 10m 27s) |
[production] |
14:15 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudnet1006.eqiad.wmnet with OS bullseye |
[production] |
14:11 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti1032.eqiad.wmnet with reason: host reimage |
[production] |
14:08 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1032.eqiad.wmnet with reason: host reimage |
[production] |
14:07 |
<elukey@deploy1002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
14:06 |
<elukey@deploy1002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
14:05 |
<mforns@deploy1002> |
Started deploy [analytics/refinery@7e16d2a]: Regular analytics weekly train [analytics/refinery@7e16d2a] |
[production] |
14:05 |
<mforns> |
starting refinery deploy - regular weekly train |
[analytics] |
13:55 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1032.eqiad.wmnet with OS bullseye |
[production] |
13:49 |
<SandraEbele> |
Started Airflow projectview_geo job |
[analytics] |
13:48 |
<SandraEbele> |
killed Oozie projectview-geo-coord job |
[analytics] |
13:37 |
<ebysans@deploy1002> |
Finished deploy [airflow-dags/analytics@f7a68c2]: (no justification provided) (duration: 00m 12s) |
[production] |
13:36 |
<ebysans@deploy1002> |
Started deploy [airflow-dags/analytics@f7a68c2]: (no justification provided) |
[production] |
13:27 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:26 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
13:26 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:25 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
13:22 |
<SandraEbele> |
deploying fix for projectview dags on airflow |
[production] |
13:21 |
<SandraEbele> |
deploying fix for projective tags on airflow. |
[analytics] |
13:21 |
<hoo@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Disable UnconnectedPagePagePropMigrationLegacyFormat for enwiktionary/frwiki (duration: 03m 38s) |
[production] |
13:20 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:19 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
13:19 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:18 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
13:18 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1031.eqiad.wmnet with OS bullseye |
[production] |
13:11 |
<wm-bot2> |
Added 1 new OSDs ['cloudcephosd1034.eqiad.wmnet'] (T314870) - cookbook ran by fran@wmf3169 |
[admin] |
13:11 |
<wm-bot2> |
Added OSD cloudcephosd1034.eqiad.wmnet... (1/1) (T314870) - cookbook ran by fran@wmf3169 |
[admin] |
13:07 |
<moritzm> |
draining ganeti1012 T311687 |
[production] |
13:04 |
<hoo> |
Running extensions/Wikibase/client/maintenance/populateUnexpectedUnconnectedPagePageProp.php for zhwiki |
[production] |
13:03 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti1031.eqiad.wmnet with reason: host reimage |
[production] |
13:02 |
<wm-bot2> |
Finished rebooting node cloudcephosd1034.eqiad.wmnet (T314870) - cookbook ran by fran@wmf3169 |
[admin] |
13:00 |
<vgutierrez> |
test HAProxy 2.4.19 in cp4026 && cp4032 |
[production] |
12:59 |
<vgutierrez> |
vgutierrez@apt1001:~$ sudo -i reprepro --component thirdparty/haproxy24 update buster-wikimedia # fetch HAProxy 2.4.19 |
[production] |
12:59 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1031.eqiad.wmnet with reason: host reimage |
[production] |
12:58 |
<wm-bot2> |
Rebooting node cloudcephosd1034.eqiad.wmnet (T314870) - cookbook ran by fran@wmf3169 |
[admin] |
12:58 |
<wm-bot2> |
Adding OSD cloudcephosd1034.eqiad.wmnet... (1/1) (T314870) - cookbook ran by fran@wmf3169 |
[admin] |
12:58 |
<wm-bot2> |
Adding new OSDs ['cloudcephosd1034.eqiad.wmnet'] to the cluster (T314870) - cookbook ran by fran@wmf3169 |
[admin] |
12:48 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet1006.eqiad.wmnet with reason: host reimage |
[production] |
12:47 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1031.eqiad.wmnet with OS bullseye |
[production] |
12:46 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1031.eqiad.wmnet with OS bullseye |
[production] |
12:46 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1031.eqiad.wmnet with OS bullseye |
[production] |
12:45 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet1006.eqiad.wmnet with reason: host reimage |
[production] |
12:41 |
<hoo> |
Running extensions/Wikibase/client/maintenance/populateUnexpectedUnconnectedPagePageProp.php for enwiki |
[production] |
12:33 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |