2023-01-12
ยง
|
18:35 |
<mutante> |
stat1007 - systemctl reset-failed - clears Icinga alerts |
[production] |
18:19 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on mc2040.codfw.wmnet with reason: hardware troubleshooting |
[production] |
18:18 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on mc2040.codfw.wmnet with reason: hardware troubleshooting |
[production] |
17:54 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2002.codfw.wmnet with OS bullseye |
[production] |
17:45 |
<mutante> |
powercycling mc2040 via mgmt ocnsole |
[production] |
17:34 |
<ejegg> |
civicrm rolled back from 7ecb5038 to 9afd2789 |
[production] |
17:08 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) |
[production] |
17:08 |
<btullis@cumin1001> |
Added views for new wiki: aswikiquote T321294 |
[production] |
17:05 |
<ejegg> |
civicrm upgraded from 9afd2789 to 7ecb5038 |
[production] |
16:57 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest2002.codfw.wmnet with OS bullseye |
[production] |
16:48 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: sync |
[production] |
16:48 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/thumbor: sync |
[production] |
16:47 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/thumbor: sync |
[production] |
16:43 |
<btullis@cumin1001> |
START - Cookbook sre.wikireplicas.add-wiki |
[production] |
16:34 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: sync |
[production] |
16:31 |
<zabe@deploy1002> |
Finished scap: Backport for [[gerrit:879590|Stop writing to cul_user and cul_user_text on a few wikis (T233004)]], [[gerrit:879591|Start writing to rev_comment_id on group1 wikis (T299954)]] (duration: 09m 49s) |
[production] |
16:23 |
<zabe@deploy1002> |
zabe and zabe: Backport for [[gerrit:879590|Stop writing to cul_user and cul_user_text on a few wikis (T233004)]], [[gerrit:879591|Start writing to rev_comment_id on group1 wikis (T299954)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
16:21 |
<zabe@deploy1002> |
Started scap: Backport for [[gerrit:879590|Stop writing to cul_user and cul_user_text on a few wikis (T233004)]], [[gerrit:879591|Start writing to rev_comment_id on group1 wikis (T299954)]] |
[production] |
16:14 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/thumbor: sync |
[production] |
16:08 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) |
[production] |
16:08 |
<btullis@cumin1001> |
Added views for new wiki: bjnwiktionary T312214 |
[production] |
15:47 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: service=thumbor,name=kubernetes1014.eqiad.wmnet |
[production] |
15:46 |
<hnowlan@puppetmaster1001> |
conftool action : set/weight=8; selector: service=thumbor,name=kubernetes1014.eqiad.wmnet |
[production] |
15:44 |
<btullis@cumin1001> |
START - Cookbook sre.wikireplicas.add-wiki |
[production] |
15:36 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) |
[production] |
15:36 |
<btullis@cumin1001> |
Added views for new wiki: shnwikibooks T321256 |
[production] |
15:35 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes1014.eqiad.wmnet |
[production] |
15:34 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1118.eqiad.wmnet with reason: Maintenance |
[production] |
15:34 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1118.eqiad.wmnet with reason: Maintenance |
[production] |
15:28 |
<effie> |
Planet import in codfw (on maps2009) started at 15:26 UTC - T314472 |
[production] |
15:11 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1041.eqiad.wmnet |
[production] |
15:11 |
<btullis@cumin1001> |
START - Cookbook sre.wikireplicas.add-wiki |
[production] |
15:10 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dborch1001.wikimedia.org |
[production] |
15:06 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host dborch1001.wikimedia.org |
[production] |
15:05 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host mc1041.eqiad.wmnet |
[production] |
14:58 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host moss-fe2002.codfw.wmnet |
[production] |
14:54 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
14:54 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
14:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T321391)', diff saved to https://phabricator.wikimedia.org/P43138 and previous config saved to /var/cache/conftool/dbconfig/20230112-145441-marostegui.json |
[production] |
14:51 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host moss-fe2002.codfw.wmnet |
[production] |
14:50 |
<moritzm> |
installing postgresql-11 security updates on puppetdb1002 |
[production] |
14:44 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host moss-fe1002.eqiad.wmnet |
[production] |
14:42 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) |
[production] |
14:42 |
<btullis@cumin1001> |
Added views for new wiki: guwwikiquote T321288 |
[production] |
14:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P43137 and previous config saved to /var/cache/conftool/dbconfig/20230112-143934-marostegui.json |
[production] |
14:38 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host moss-fe1002.eqiad.wmnet |
[production] |
14:37 |
<moritzm> |
installing sqlite3 security updates on buster |
[production] |
14:34 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1040.eqiad.wmnet with OS bullseye |
[production] |
14:33 |
<taavi> |
UTC afternoon backports done |
[production] |
14:28 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:879101|Track callers of parseRevisionParsoidHtml.]] (duration: 09m 34s) |
[production] |