401-450 of 10000 results (23ms)
2020-09-18 ยง
18:52 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:46 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
18:44 <ryankemper> `sudo kill 254017 254018 254028 254029` to kill some dangling serdi / gzip processes, all the wikidata cleanup should be complete [production]
18:38 <ryankemper> `sudo kill 126121 126122 126124 126128 249520 249521 254016 254027` on `snapshot1008` to terminate wikidata dump jobs that are in a bad state [production]
18:10 <ryankemper> Removed stale `wikidatardf-dumps` crontab entry from `dumpsgen@snapshot1008`, stored backup of previous state of crontab in the (admittedly verbose) `/tmp/dumpsgen_crontab_before_removing_stale_wikidata_dump_entry_see_gerrit_puppet_patch_622342` [production]
17:15 <mutante> lists1001 - apt-get install pwgen to generate passwords (this was installed on previous list server but apparently not puppetized, puppet patch coming up) [production]
16:23 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
16:21 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime [production]
15:09 <mutante> restarting gerrit service to apply gerrit::628338 to make it dump heap if out of memory (T263008) [production]
14:15 <ladsgroup@deploy1001> Synchronized wmf-config/Wikibase.php: labs: Turn on termbox v2 on desktop for wikidatawiki -- noop for production, sanity sync (T261488) (duration: 00m 56s) [production]
14:13 <ladsgroup@deploy1001> Synchronized wmf-config/InitialiseSettings.php: labs: Turn on termbox v2 on desktop for wikidatawiki -- noop for production, sanity sync (T261488) (duration: 01m 00s) [production]
13:02 <kormat@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:00 <kormat@cumin2001> START - Cookbook sre.hosts.downtime [production]
12:48 <cdanis@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=swift,name=eqiad [production]
12:41 <kormat> reimaging db2125 T263244 [production]
12:39 <kormat@cumin1001> dbctl commit (dc=all): 'db2089:3316 (re)pooling @ 100%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12665 and previous config saved to /var/cache/conftool/dbconfig/20200918-123947-kormat.json [production]
12:24 <kormat@cumin1001> dbctl commit (dc=all): 'db2089:3316 (re)pooling @ 75%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12664 and previous config saved to /var/cache/conftool/dbconfig/20200918-122444-kormat.json [production]
12:09 <kormat@cumin1001> dbctl commit (dc=all): 'db2089:3316 (re)pooling @ 50%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12663 and previous config saved to /var/cache/conftool/dbconfig/20200918-120940-kormat.json [production]
11:54 <kormat@cumin1001> dbctl commit (dc=all): 'db2089:3316 (re)pooling @ 25%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12662 and previous config saved to /var/cache/conftool/dbconfig/20200918-115437-kormat.json [production]
11:35 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2125', diff saved to https://phabricator.wikimedia.org/P12661 and previous config saved to /var/cache/conftool/dbconfig/20200918-113509-marostegui.json [production]
11:15 <kormat@cumin1001> dbctl commit (dc=all): 'db2089:3316 depooling: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12660 and previous config saved to /var/cache/conftool/dbconfig/20200918-111529-kormat.json [production]
10:56 <kormat@cumin1001> dbctl commit (dc=all): 'db2087:3316 (re)pooling @ 100%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12659 and previous config saved to /var/cache/conftool/dbconfig/20200918-105645-kormat.json [production]
10:45 <jiji@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
10:41 <kormat@cumin1001> dbctl commit (dc=all): 'db2087:3316 (re)pooling @ 75%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12658 and previous config saved to /var/cache/conftool/dbconfig/20200918-104141-kormat.json [production]
10:35 <jiji@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
10:34 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
10:31 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
10:28 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
10:26 <kormat@cumin1001> dbctl commit (dc=all): 'db2087:3316 (re)pooling @ 50%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12657 and previous config saved to /var/cache/conftool/dbconfig/20200918-102638-kormat.json [production]
10:11 <kormat@cumin1001> dbctl commit (dc=all): 'db2087:3316 (re)pooling @ 25%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12656 and previous config saved to /var/cache/conftool/dbconfig/20200918-101135-kormat.json [production]
09:55 <kormat@cumin1001> dbctl commit (dc=all): 'db2087:3316 depooling: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12655 and previous config saved to /var/cache/conftool/dbconfig/20200918-095554-kormat.json [production]
09:55 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:55 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:47 <twentyafterfour> deployed hotfix for T263063 to phab1001 [production]
09:47 <jayme> deleting some random pods in kubernetes staging to rebalance load back on kubestage1001 - T262527 [production]
09:46 <jayme> uncordoned kubestage1001 - T262527 [production]
09:46 <kormat@cumin1001> dbctl commit (dc=all): 'db2124 (re)pooling @ 100%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12654 and previous config saved to /var/cache/conftool/dbconfig/20200918-094608-kormat.json [production]
09:31 <kormat@cumin1001> dbctl commit (dc=all): 'db2124 (re)pooling @ 80%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12653 and previous config saved to /var/cache/conftool/dbconfig/20200918-093105-kormat.json [production]
09:24 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:22 <klausman@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:16 <kormat@cumin1001> dbctl commit (dc=all): 'db2124 (re)pooling @ 60%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12652 and previous config saved to /var/cache/conftool/dbconfig/20200918-091601-kormat.json [production]
09:00 <kormat@cumin1001> dbctl commit (dc=all): 'db2124 (re)pooling @ 40%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12651 and previous config saved to /var/cache/conftool/dbconfig/20200918-090058-kormat.json [production]
09:00 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) [production]
08:56 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single [production]
08:56 <jayme> reboot kubestage1001 for clean state - T262527 [production]
08:54 <elukey> change analytics-in4/in6 filters on cr1/cr2 after https://gerrit.wikimedia.org/r/628300 [production]
08:47 <jayme@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) [production]
08:45 <kormat@cumin1001> dbctl commit (dc=all): 'db2124 (re)pooling @ 20%: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12650 and previous config saved to /var/cache/conftool/dbconfig/20200918-084554-kormat.json [production]
08:43 <jayme@cumin1001> START - Cookbook sre.hosts.reboot-single [production]
08:43 <jayme> reboot kubestage1001 for kernel upgrade - T262527 [production]