2024-02-01
ยง
|
11:02 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db2137.codfw.wmnet with reason: Maintenance |
[production] |
11:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56053 and previous config saved to /var/cache/conftool/dbconfig/20240201-110252-marostegui.json |
[production] |
10:54 |
<phuedx@deploy2002> |
Finished deploy [analytics/refinery@0d8e976] (hadoop-test): Remove trvwikisource from scoop list (duration: 03m 30s) |
[production] |
10:52 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: analytics_cluster::hadoop::yarn |
[production] |
10:51 |
<phuedx@deploy2002> |
Started deploy [analytics/refinery@0d8e976] (hadoop-test): Remove trvwikisource from scoop list |
[production] |
10:50 |
<phuedx@deploy2002> |
Finished deploy [analytics/refinery@0d8e976] (thin): Remove trvwikisource from scoop list (duration: 00m 05s) |
[production] |
10:50 |
<phuedx@deploy2002> |
Started deploy [analytics/refinery@0d8e976] (thin): Remove trvwikisource from scoop list |
[production] |
10:49 |
<phuedx@deploy2002> |
Finished deploy [analytics/refinery@0d8e976]: analytics/refinery: Remove trvwikisource from scoop list (duration: 10m 20s) |
[production] |
10:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P56052 and previous config saved to /var/cache/conftool/dbconfig/20240201-104746-marostegui.json |
[production] |
10:39 |
<phuedx@deploy2002> |
Started deploy [analytics/refinery@0d8e976]: analytics/refinery: Remove trvwikisource from scoop list |
[production] |
10:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P56051 and previous config saved to /var/cache/conftool/dbconfig/20240201-103239-marostegui.json |
[production] |
10:32 |
<moritzm> |
installing openjdk-11 security updates |
[production] |
10:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56049 and previous config saved to /var/cache/conftool/dbconfig/20240201-101733-marostegui.json |
[production] |
10:11 |
<hashar> |
Restarting CI Jenkins on contint2002 |
[production] |
10:10 |
<btullis@deploy2002> |
Finished deploy [analytics/superset/deploy@26c0d49]: (no justification provided) (duration: 00m 59s) |
[production] |
10:09 |
<btullis@deploy2002> |
Started deploy [analytics/superset/deploy@26c0d49]: (no justification provided) |
[production] |
10:01 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: JRE update for DSA 5604 - klausman@cumin2002 |
[production] |
09:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56048 and previous config saved to /var/cache/conftool/dbconfig/20240201-095150-marostegui.json |
[production] |
09:51 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2136.codfw.wmnet with reason: Maintenance |
[production] |
09:51 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db2136.codfw.wmnet with reason: Maintenance |
[production] |
09:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56047 and previous config saved to /var/cache/conftool/dbconfig/20240201-095128-marostegui.json |
[production] |
09:49 |
<joal@deploy2002> |
Finished deploy [airflow-dags/analytics@6b84b7a]: (no justification provided) (duration: 00m 28s) |
[production] |
09:49 |
<joal@deploy2002> |
Started deploy [airflow-dags/analytics@6b84b7a]: (no justification provided) |
[production] |
09:43 |
<klausman@cumin2002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: JRE update for DSA 5604 - klausman@cumin2002 |
[production] |
09:43 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: JRE update for DSA 5604 - klausman@cumin2002 |
[production] |
09:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P56046 and previous config saved to /var/cache/conftool/dbconfig/20240201-093621-marostegui.json |
[production] |
09:30 |
<vgutierrez@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet |
[production] |
09:26 |
<vgutierrez@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet |
[production] |
09:25 |
<klausman@cumin2002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: JRE update for DSA 5604 - klausman@cumin2002 |
[production] |
09:24 |
<vgutierrez@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1002.eqiad.wmnet |
[production] |
09:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P56045 and previous config saved to /var/cache/conftool/dbconfig/20240201-092115-marostegui.json |
[production] |
09:20 |
<vgutierrez@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host acmechief1002.eqiad.wmnet |
[production] |
09:18 |
<vgutierrez@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief2001.codfw.wmnet |
[production] |
09:14 |
<vgutierrez@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host acmechief2001.codfw.wmnet |
[production] |
09:12 |
<vgutierrez@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief2002.codfw.wmnet |
[production] |
09:08 |
<vgutierrez@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host acmechief2002.codfw.wmnet |
[production] |
09:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56044 and previous config saved to /var/cache/conftool/dbconfig/20240201-090607-marostegui.json |
[production] |
08:57 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2104 (re)pooling @ 100%: After switchover', diff saved to https://phabricator.wikimedia.org/P56043 and previous config saved to /var/cache/conftool/dbconfig/20240201-085743-root.json |
[production] |
08:52 |
<hashar> |
Restarted primary Gerrit on gerrit1003 |
[production] |
08:44 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test2001.codfw.wmnet |
[production] |
08:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2104 (re)pooling @ 75%: After switchover', diff saved to https://phabricator.wikimedia.org/P56042 and previous config saved to /var/cache/conftool/dbconfig/20240201-084238-root.json |
[production] |
08:42 |
<hashar> |
Restarting Gerrit replica on gerrit2002 |
[production] |
08:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56041 and previous config saved to /var/cache/conftool/dbconfig/20240201-084126-marostegui.json |
[production] |
08:41 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2119.codfw.wmnet with reason: Maintenance |
[production] |
08:41 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db2119.codfw.wmnet with reason: Maintenance |
[production] |
08:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2110 (T355609)', diff saved to https://phabricator.wikimedia.org/P56040 and previous config saved to /var/cache/conftool/dbconfig/20240201-084104-marostegui.json |
[production] |
08:40 |
<vgutierrez@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host acmechief-test2001.codfw.wmnet |
[production] |
08:33 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test1001.eqiad.wmnet |
[production] |
08:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2104 (re)pooling @ 50%: After switchover', diff saved to https://phabricator.wikimedia.org/P56039 and previous config saved to /var/cache/conftool/dbconfig/20240201-082733-root.json |
[production] |
08:26 |
<vgutierrez@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host acmechief-test1001.eqiad.wmnet |
[production] |