1201-1250 of 10000 results (65ms)
2024-02-01 ยง
12:30 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
12:29 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1162.eqiad.wmnet with reason: Maintenance [production]
12:29 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1162.eqiad.wmnet with reason: Maintenance [production]
12:24 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp1001.eqiad.wmnet [production]
12:21 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zookeeper-test1002.eqiad.wmnet [production]
12:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2137:3314', diff saved to https://phabricator.wikimedia.org/P56057 and previous config saved to /var/cache/conftool/dbconfig/20240201-121853-marostegui.json [production]
12:18 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host arclamp1001.eqiad.wmnet [production]
12:17 <btullis@cumin1002> START - Cookbook sre.hosts.reboot-single for host zookeeper-test1002.eqiad.wmnet [production]
12:15 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp2001.codfw.wmnet [production]
12:09 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host arclamp2001.codfw.wmnet [production]
12:04 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2005.codfw.wmnet [production]
12:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2137:3314 (T355609)', diff saved to https://phabricator.wikimedia.org/P56056 and previous config saved to /var/cache/conftool/dbconfig/20240201-120346-marostegui.json [production]
12:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host testvm2005.codfw.wmnet [production]
11:22 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
11:21 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
11:13 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-tool1008.eqiad.wmnet [production]
11:09 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host an-tool1008.eqiad.wmnet [production]
11:07 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: analytics_cluster::hadoop::yarn [production]
11:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2137:3314 (T355609)', diff saved to https://phabricator.wikimedia.org/P56054 and previous config saved to /var/cache/conftool/dbconfig/20240201-110315-marostegui.json [production]
11:03 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
11:02 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
11:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56053 and previous config saved to /var/cache/conftool/dbconfig/20240201-110252-marostegui.json [production]
10:54 <phuedx@deploy2002> Finished deploy [analytics/refinery@0d8e976] (hadoop-test): Remove trvwikisource from scoop list (duration: 03m 30s) [production]
10:52 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: analytics_cluster::hadoop::yarn [production]
10:51 <phuedx@deploy2002> Started deploy [analytics/refinery@0d8e976] (hadoop-test): Remove trvwikisource from scoop list [production]
10:50 <phuedx@deploy2002> Finished deploy [analytics/refinery@0d8e976] (thin): Remove trvwikisource from scoop list (duration: 00m 05s) [production]
10:50 <phuedx@deploy2002> Started deploy [analytics/refinery@0d8e976] (thin): Remove trvwikisource from scoop list [production]
10:49 <phuedx@deploy2002> Finished deploy [analytics/refinery@0d8e976]: analytics/refinery: Remove trvwikisource from scoop list (duration: 10m 20s) [production]
10:47 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P56052 and previous config saved to /var/cache/conftool/dbconfig/20240201-104746-marostegui.json [production]
10:39 <phuedx@deploy2002> Started deploy [analytics/refinery@0d8e976]: analytics/refinery: Remove trvwikisource from scoop list [production]
10:32 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P56051 and previous config saved to /var/cache/conftool/dbconfig/20240201-103239-marostegui.json [production]
10:32 <moritzm> installing openjdk-11 security updates [production]
10:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56049 and previous config saved to /var/cache/conftool/dbconfig/20240201-101733-marostegui.json [production]
10:11 <hashar> Restarting CI Jenkins on contint2002 [production]
10:10 <btullis@deploy2002> Finished deploy [analytics/superset/deploy@26c0d49]: (no justification provided) (duration: 00m 59s) [production]
10:09 <btullis@deploy2002> Started deploy [analytics/superset/deploy@26c0d49]: (no justification provided) [production]
10:01 <klausman@cumin2002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:51 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2136 (T355609)', diff saved to https://phabricator.wikimedia.org/P56048 and previous config saved to /var/cache/conftool/dbconfig/20240201-095150-marostegui.json [production]
09:51 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2136.codfw.wmnet with reason: Maintenance [production]
09:51 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db2136.codfw.wmnet with reason: Maintenance [production]
09:51 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2119 (T355609)', diff saved to https://phabricator.wikimedia.org/P56047 and previous config saved to /var/cache/conftool/dbconfig/20240201-095128-marostegui.json [production]
09:49 <joal@deploy2002> Finished deploy [airflow-dags/analytics@6b84b7a]: (no justification provided) (duration: 00m 28s) [production]
09:49 <joal@deploy2002> Started deploy [airflow-dags/analytics@6b84b7a]: (no justification provided) [production]
09:43 <klausman@cumin2002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:43 <klausman@cumin2002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:36 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P56046 and previous config saved to /var/cache/conftool/dbconfig/20240201-093621-marostegui.json [production]
09:30 <vgutierrez@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet [production]
09:26 <vgutierrez@cumin2002> START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet [production]
09:25 <klausman@cumin2002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: JRE update for DSA 5604 - klausman@cumin2002 [production]
09:24 <vgutierrez@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1002.eqiad.wmnet [production]