2021-12-02
§
|
02:50 |
<andrew@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1028.eqiad.wmnet with OS buster |
[production] |
02:43 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1028.eqiad.wmnet with OS buster |
[production] |
02:40 |
<andrew@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1028.eqiad.wmnet with OS buster |
[production] |
02:15 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1028.eqiad.wmnet with OS buster |
[production] |
02:14 |
<andrew@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1028.eqiad.wmnet with OS buster |
[production] |
01:52 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1028.eqiad.wmnet with OS buster |
[production] |
01:21 |
<ryankemper> |
T280001 Rolling restart of low-traffic pybal hosts complete. All of `wcqs` is pooled and the pybal / ipvs related alerts have cleared |
[production] |
01:16 |
<ryankemper> |
T280001 Pooled `wcqs200[1-3]` (had been left unpooled from when we last removed wcqs from production) |
[production] |
01:12 |
<ryankemper> |
T280001 Restarting pybal on low-traffic primaries `lvs2009` and `lvs1015`: `ryankemper@cumin1001:~$ sudo cumin 'P{lvs2009*,lvs1015*}' 'sudo systemctl restart pybal'` |
[production] |
01:12 |
<ryankemper> |
T280001 Restarting pybal on low-traffic primaries `lvs2009` and `lvs1015`: `ryankemper@cumin1001:~$ sudo cumin 'P{lvs2009*,lvs1015*}' 'sudo systemctl restart pybal'` |
[production] |
01:11 |
<ryankemper> |
T280001 Waited 120s and checked https://icinga.wikimedia.org/alerts, proceeding to primary low-traffic hosts `lvs2009` and `lvs1015` |
[production] |
01:08 |
<ryankemper> |
T280001 Sanity check of `sudo ipvsadm -L -n` on backup `lvs2010` and `lvs1016` looks good (for ex `lvs1016` has `TCP 10.2.2.67:443 wrr`) |
[production] |
01:07 |
<ryankemper> |
T280001 Restarting pybal on low-traffic backups: `ryankemper@cumin1001:~$ sudo cumin 'P{lvs2010*,lvs1016*}' 'sudo systemctl restart pybal'` |
[production] |
01:02 |
<ryankemper> |
T280001 `ryankemper@cumin1001:~$ sudo cumin 'O:lvs::balancer' 'sudo run-puppet-agent'` |
[production] |
01:01 |
<ryankemper> |
T280001 Merged https://gerrit.wikimedia.org/r/c/operations/puppet/+/742841 |
[production] |
01:00 |
<ryankemper> |
T280001 About to merge https://gerrit.wikimedia.org/r/c/operations/puppet/+/742841 to bring `wcqs` into state `lvs_setup`, after which I'll perform a rolling restart of pybal |
[production] |
00:24 |
<urbanecm@deploy1002> |
Synchronized php-1.38.0-wmf.9/skins/Vector/: a7586cd4a2559248ea1fd29cf74de535de016501: Update scroll observer to allow event logging (T292586) (duration: 00m 57s) |
[production] |
2021-12-01
§
|
22:15 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 07s) |
[production] |
22:15 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
22:13 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 07s) |
[production] |
22:13 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
22:12 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 07s) |
[production] |
22:12 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
22:12 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 01m 23s) |
[production] |
22:11 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
22:10 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 03s) |
[production] |
22:10 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
22:10 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 03s) |
[production] |
22:10 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
22:09 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 03s) |
[production] |
22:09 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
21:12 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 03s) |
[production] |
21:12 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
21:11 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 16s) |
[production] |
21:10 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
21:10 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257]: (no justification provided) |
[production] |
21:09 |
<razzi@deploy1002> |
Finished deploy [analytics/refinery@3b1b794]: Regular analytics weekly train [analytics/refinery@3b1b794] (duration: 21m 18s) |
[production] |
21:06 |
<jynus> |
installing python-monotonic on ms-fe2011, ms-fe2012 (breaks swift-proxy) |
[production] |
21:02 |
<jynus> |
installing python-monotonic on ms-fe2010 |
[production] |
20:48 |
<razzi@deploy1002> |
Started deploy [analytics/refinery@3b1b794]: Regular analytics weekly train [analytics/refinery@3b1b794] |
[production] |
20:13 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:09 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
19:46 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) (duration: 00m 03s) |
[production] |
19:46 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@2f59257] (hadoop-test): (no justification provided) |
[production] |
19:30 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@bea2abe] (hadoop-test): (no justification provided) (duration: 00m 22s) |
[production] |
19:30 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@bea2abe] (hadoop-test): (no justification provided) |
[production] |
19:27 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@bea2abe] (hadoop-test): (no justification provided) (duration: 02m 26s) |
[production] |
19:25 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@bea2abe] (hadoop-test): (no justification provided) |
[production] |
19:24 |
<otto@deploy1002> |
Finished deploy [airflow-dags/analytics@bea2abe] (hadoop-test): (no justification provided) (duration: 00m 03s) |
[production] |
19:24 |
<otto@deploy1002> |
Started deploy [airflow-dags/analytics@bea2abe] (hadoop-test): (no justification provided) |
[production] |