401-450 of 10000 results (74ms)
2024-02-01 ยง
11:02 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
11:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56053 and previous config saved to /var/cache/conftool/dbconfig/20240201-110252-marostegui.json [production]
10:54 <phuedx@deploy2002> Finished deploy [analytics/refinery@0d8e976] (hadoop-test): Remove trvwikisource from scoop list (duration: 03m 30s) [production]
10:52 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: analytics_cluster::hadoop::yarn [production]
10:51 <phuedx@deploy2002> Started deploy [analytics/refinery@0d8e976] (hadoop-test): Remove trvwikisource from scoop list [production]
10:50 <phuedx@deploy2002> Finished deploy [analytics/refinery@0d8e976] (thin): Remove trvwikisource from scoop list (duration: 00m 05s) [production]
10:50 <phuedx@deploy2002> Started deploy [analytics/refinery@0d8e976] (thin): Remove trvwikisource from scoop list [production]
10:49 <phuedx@deploy2002> Finished deploy [analytics/refinery@0d8e976]: analytics/refinery: Remove trvwikisource from scoop list (duration: 10m 20s) [production]
10:47 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P56052 and previous config saved to /var/cache/conftool/dbconfig/20240201-104746-marostegui.json [production]
10:39 <phuedx@deploy2002> Started deploy [analytics/refinery@0d8e976]: analytics/refinery: Remove trvwikisource from scoop list [production]
10:32 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P56051 and previous config saved to /var/cache/conftool/dbconfig/20240201-103239-marostegui.json [production]
10:32 <moritzm> installing openjdk-11 security updates [production]
10:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56049 and previous config saved to /var/cache/conftool/dbconfig/20240201-101733-marostegui.json [production]
10:11 <hashar> Restarting CI Jenkins on contint2002 [production]
10:10 <btullis@deploy2002> Finished deploy [analytics/superset/deploy@26c0d49]: (no justification provided) (duration: 00m 59s) [production]
10:09 <btullis@deploy2002> Started deploy [analytics/superset/deploy@26c0d49]: (no justification provided) [production]
10:01 <klausman@cumin2002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:51 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56048 and previous config saved to /var/cache/conftool/dbconfig/20240201-095150-marostegui.json [production]
09:51 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2136.codfw.wmnet with reason: Maintenance [production]
09:51 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db2136.codfw.wmnet with reason: Maintenance [production]
09:51 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56047 and previous config saved to /var/cache/conftool/dbconfig/20240201-095128-marostegui.json [production]
09:49 <joal@deploy2002> Finished deploy [airflow-dags/analytics@6b84b7a]: (no justification provided) (duration: 00m 28s) [production]
09:49 <joal@deploy2002> Started deploy [airflow-dags/analytics@6b84b7a]: (no justification provided) [production]
09:43 <klausman@cumin2002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:43 <klausman@cumin2002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:36 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P56046 and previous config saved to /var/cache/conftool/dbconfig/20240201-093621-marostegui.json [production]
09:30 <vgutierrez@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet [production]
09:26 <vgutierrez@cumin2002> START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet [production]
09:25 <klausman@cumin2002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:24 <vgutierrez@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1002.eqiad.wmnet [production]
09:21 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P56045 and previous config saved to /var/cache/conftool/dbconfig/20240201-092115-marostegui.json [production]
09:20 <vgutierrez@cumin2002> START - Cookbook sre.hosts.reboot-single for host acmechief1002.eqiad.wmnet [production]
09:18 <vgutierrez@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief2001.codfw.wmnet [production]
09:14 <vgutierrez@cumin2002> START - Cookbook sre.hosts.reboot-single for host acmechief2001.codfw.wmnet [production]
09:12 <vgutierrez@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief2002.codfw.wmnet [production]
09:08 <vgutierrez@cumin2002> START - Cookbook sre.hosts.reboot-single for host acmechief2002.codfw.wmnet [production]
09:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56044 and previous config saved to /var/cache/conftool/dbconfig/20240201-090607-marostegui.json [production]
08:57 <marostegui@cumin1002> dbctl commit (dc=all): 'db2104 (re)pooling @ 100%: After switchover', diff saved to https://phabricator.wikimedia.org/P56043 and previous config saved to /var/cache/conftool/dbconfig/20240201-085743-root.json [production]
08:52 <hashar> Restarted primary Gerrit on gerrit1003 [production]
08:44 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test2001.codfw.wmnet [production]
08:42 <marostegui@cumin1002> dbctl commit (dc=all): 'db2104 (re)pooling @ 75%: After switchover', diff saved to https://phabricator.wikimedia.org/P56042 and previous config saved to /var/cache/conftool/dbconfig/20240201-084238-root.json [production]
08:42 <hashar> Restarting Gerrit replica on gerrit2002 [production]
08:41 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56041 and previous config saved to /var/cache/conftool/dbconfig/20240201-084126-marostegui.json [production]
08:41 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2119.codfw.wmnet with reason: Maintenance [production]
08:41 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db2119.codfw.wmnet with reason: Maintenance [production]
08:41 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2110 (T355609)', diff saved to https://phabricator.wikimedia.org/P56040 and previous config saved to /var/cache/conftool/dbconfig/20240201-084104-marostegui.json [production]
08:40 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host acmechief-test2001.codfw.wmnet [production]
08:33 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test1001.eqiad.wmnet [production]
08:27 <marostegui@cumin1002> dbctl commit (dc=all): 'db2104 (re)pooling @ 50%: After switchover', diff saved to https://phabricator.wikimedia.org/P56039 and previous config saved to /var/cache/conftool/dbconfig/20240201-082733-root.json [production]
08:26 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host acmechief-test1001.eqiad.wmnet [production]