4401-4450 of 10000 results (59ms)
2022-06-21 §
13:02 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be2046.codfw.wmnet [production]
13:01 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2045.codfw.wmnet [production]
12:59 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be1049.eqiad.wmnet [production]
12:57 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be1048.eqiad.wmnet [production]
12:56 <moritzm> installing haproxy security updates on stretch [production]
12:53 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be2045.codfw.wmnet [production]
12:52 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2044.codfw.wmnet [production]
12:52 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be1048.eqiad.wmnet [production]
12:50 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be1047.eqiad.wmnet [production]
12:43 <moritzm> installing python-bottle security updates [production]
12:40 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be1047.eqiad.wmnet [production]
12:39 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be2044.codfw.wmnet [production]
12:36 <Rook> T302164 #173 upgrading single user container b687ca6aedc745c63b2659124dca0dab01d38173 [paws]
12:25 <moritzm> reset logster-csp/logster-badpass-priv on mwlog1002, these were removed from Puppet [production]
12:12 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti4004.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
12:12 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4004.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
12:06 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti4004.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
12:05 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4004.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
11:59 <mbsantos> mbsantos@maps2009 imposm-removebackup-import (T305845) [production]
11:48 <Rook> 806504: Show username on 404 page when logged in | https://gerrit.wikimedia.org/r/c/analytics/quarry/web/+/806504 [quarry]
11:44 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti4004.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
11:44 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4004.ulsfo.wmnet to ganeti01.svc.ulsfo.wmnet [production]
11:43 <btullis@cumin1001> END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) restart masters for Hadoop analytics cluster: Restart of jvm daemons. [production]
11:42 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1127 for testing', diff saved to https://phabricator.wikimedia.org/P29936 and previous config saved to /var/cache/conftool/dbconfig/20220621-114232-root.json [production]
11:42 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1143 for testing', diff saved to https://phabricator.wikimedia.org/P29935 and previous config saved to /var/cache/conftool/dbconfig/20220621-114216-root.json [production]
11:41 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1111 for testing', diff saved to https://phabricator.wikimedia.org/P29934 and previous config saved to /var/cache/conftool/dbconfig/20220621-114151-root.json [production]
10:57 <volans> deleting netbox getstats.GetDeviceStats job results - T311048 [production]
10:51 <kart_> Updated cxserver to 2022-06-21-035954-production (T307970) [production]
10:49 <kartik@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
10:48 <kartik@deploy1002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
10:47 <kartik@deploy1002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
10:47 <btullis> proceeding with the hadoop.roll-restart-masters cookbook [analytics]
10:47 <btullis@cumin1001> START - Cookbook sre.hadoop.roll-restart-masters restart masters for Hadoop analytics cluster: Restart of jvm daemons. [production]
10:47 <kartik@deploy1002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
10:45 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
10:44 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
09:31 <urbanecm> 09:29:23 Synchronized wmf-config/throttle.php: 7c9f6a561b2b4b5c5db063bad83bd23e9cbac347: Add a throttle rule for a Czech course (T310885) (duration: 03m 34s) #manually logging in logmsgbot's absence [production]
09:20 <marostegui> dbmaint s8@eqiad T310011 [production]
09:13 <marostegui> dbmaint s8@codfw T310011 [production]
08:29 <marostegui> Reboot db1120 for kernel upgrade [production]
08:14 <moritzm> remove EOLed parsoid debs from releases.wikimedia.org T309765 [production]
05:54 <marostegui> Reboot db1132 and db1181 for kernel upgrade [production]
04:48 <andrewbogott> stopping nova-fullstack agent on cloudcontrol1003; it's going to page us otherwise and we're all AFK tomorrow [admin]
04:02 <andrewbogott> restarting rabbitmq on cloudcontrol100x (one at a time) [admin]
2022-06-20 §
23:04 <Nettrom> installed libmariadb-dev and python3.9-dev on suggestbot-02 [suggestbot]
18:01 <Nettrom> installed MariaDB on suggestbot-01, created database and user, restricted access to localhost, login test worked [suggestbot]
17:43 <Nettrom> stopped all cron jobs on suggestbot-01 [suggestbot]
16:30 <urbanecm> add sgimeno as a project member (Growth engineer with need for access) [deployment-prep]
16:30 <urbanecm> add sgimeno as a project member (Growth engineer with need for access) [releng]
15:50 <ori> On deployment-cache-{text,upload}06, ran: touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service (T310957) [releng]