2501-2550 of 10000 results (34ms)
2021-01-28 §
17:35 <crusnov@deploy1001> Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 [production]
17:28 <ebernhardson> ban elastic1063 from production-search-omega-eqiad and production-search-eqiad T265113 [production]
17:11 <urbanecm@deploy1001> Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 01m 06s) [production]
16:56 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy1002.eqiad.wmnet [production]
16:51 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'staging' . [production]
16:51 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' . [production]
16:49 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host deploy1002.eqiad.wmnet [production]
16:49 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . [production]
16:49 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'staging' . [production]
16:49 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy2002.codfw.wmnet [production]
16:48 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . [production]
16:48 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'production' . [production]
16:45 <elukey@cumin1001> END (FAIL) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=99) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:44 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:44 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:44 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:41 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:41 <arturo> running homer on cr*-eqiad* again for reverting latest changes (T271476) [production]
16:39 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host deploy2002.codfw.wmnet [production]
16:28 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' . [production]
16:28 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'staging' . [production]
16:28 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'plain' . [production]
16:26 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' . [production]
16:25 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'plain' . [production]
16:25 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'staging' . [production]
16:24 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:24 <akosiaris> stop scraping apertium from prometheus, it doesn't have a prometheus endpoint. [production]
16:23 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'production' . [production]
16:23 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'plain' . [production]
16:23 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'staging' . [production]
16:19 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:17 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:06 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:03 <arturo> running homer on cr*-eqiad* for T271476 [production]
15:55 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
15:54 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 [production]
15:52 <cdanis> draining traffic from Zayo OGYX/120003 codfw<>eqiad in preparation for decommission 🥃 T272675 [production]
15:49 <elukey@cumin1001> START - Cookbook sre.hadoop.stop-cluster for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 [production]
15:49 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@d0a6933]: align threshold path references across days (duration: 01m 15s) [production]
15:49 <marostegui> Power off clouddb1019 for memory replacement T272125 [production]
15:48 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@d0a6933]: align threshold path references across days [production]
15:25 <otto@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Migrate NavigationTiming schemas to Event Platform on all wikis - T271208 (duration: 01m 11s) [production]
15:06 <jayme@deploy1001> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
15:05 <jayme@deploy1001> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
14:26 <jayme@deploy1001> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
14:14 <jayme@deploy1001> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
14:14 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1148 after kernel upgrade and enablement of report_host', diff saved to https://phabricator.wikimedia.org/P14039 and previous config saved to /var/cache/conftool/dbconfig/20210128-141425-marostegui.json [production]
13:57 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1148 for kernel upgrade and enablement of report_host', diff saved to https://phabricator.wikimedia.org/P14038 and previous config saved to /var/cache/conftool/dbconfig/20210128-135730-marostegui.json [production]
13:56 <marostegui@cumin1001> dbctl commit (dc=all): 'db1146:3314 (re)pooling @ 100%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14037 and previous config saved to /var/cache/conftool/dbconfig/20210128-135612-root.json [production]
13:56 <marostegui@cumin1001> dbctl commit (dc=all): 'db1146:3312 (re)pooling @ 100%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14036 and previous config saved to /var/cache/conftool/dbconfig/20210128-135602-root.json [production]