2023-05-03
ยง
|
10:33 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host kubestagemaster2001.codfw.wmnet |
[production] |
10:32 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-fe1004.eqiad.wmnet |
[production] |
10:27 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2117 (T335838)', diff saved to https://phabricator.wikimedia.org/P47335 and previous config saved to /var/cache/conftool/dbconfig/20230503-102719-ladsgroup.json |
[production] |
10:27 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2117.codfw.wmnet with reason: Maintenance |
[production] |
10:27 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2117.codfw.wmnet with reason: Maintenance |
[production] |
10:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2114 (T335838)', diff saved to https://phabricator.wikimedia.org/P47334 and previous config saved to /var/cache/conftool/dbconfig/20230503-102654-ladsgroup.json |
[production] |
10:25 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab2003.wikimedia.org |
[production] |
10:21 |
<akosiaris@deploy1002> |
helmfile [staging] DONE helmfile.d/services/machinetranslation: apply |
[production] |
10:20 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2124 (re)pooling @ 10%: Repooling after migrating', diff saved to https://phabricator.wikimedia.org/P47333 and previous config saved to /var/cache/conftool/dbconfig/20230503-102018-root.json |
[production] |
10:19 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host gitlab2003.wikimedia.org |
[production] |
10:19 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2111', diff saved to https://phabricator.wikimedia.org/P47332 and previous config saved to /var/cache/conftool/dbconfig/20230503-101926-ladsgroup.json |
[production] |
10:18 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling update on A:netbox |
[production] |
10:18 |
<ayounsi@cumin1001> |
START - Cookbook sre.netbox.update-extras rolling update on A:netbox |
[production] |
10:18 |
<lucaswerkmeister-wmde@deploy1002> |
lucaswerkmeister-wmde and migr: Backport for [[gerrit:914297|wblistentityusage: Deprecate wbeu prefix, new output format (T300460 T196962)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
10:18 |
<jelto@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host gitlab2003.wikimedia.org |
[production] |
10:17 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp2001.codfw.wmnet |
[production] |
10:16 |
<eoghan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on aphlict1001.eqiad.wmnet with reason: aphlict1002 is now active |
[production] |
10:16 |
<eoghan@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on aphlict1001.eqiad.wmnet with reason: aphlict1002 is now active |
[production] |
10:13 |
<akosiaris@deploy1002> |
helmfile [staging] START helmfile.d/services/machinetranslation: apply |
[production] |
10:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2114', diff saved to https://phabricator.wikimedia.org/P47331 and previous config saved to /var/cache/conftool/dbconfig/20230503-101147-ladsgroup.json |
[production] |
10:10 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host arclamp2001.codfw.wmnet |
[production] |
10:10 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf2003.codfw.wmnet |
[production] |
10:09 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:wikikube-staging-worker-eqiad |
[production] |
10:09 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp1001.eqiad.wmnet |
[production] |
10:07 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite2004.codfw.wmnet |
[production] |
10:07 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host webperf2003.codfw.wmnet |
[production] |
10:07 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf1003.eqiad.wmnet |
[production] |
10:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2124 (re)pooling @ 5%: Repooling after migrating', diff saved to https://phabricator.wikimedia.org/P47330 and previous config saved to /var/cache/conftool/dbconfig/20230503-100513-root.json |
[production] |
10:04 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2111 (T335838)', diff saved to https://phabricator.wikimedia.org/P47329 and previous config saved to /var/cache/conftool/dbconfig/20230503-100420-ladsgroup.json |
[production] |
10:03 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host webperf1003.eqiad.wmnet |
[production] |
10:02 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host arclamp1001.eqiad.wmnet |
[production] |
10:00 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host graphite2004.codfw.wmnet |
[production] |
10:00 |
<lucaswerkmeister-wmde@deploy1002> |
Started scap: Backport for [[gerrit:914297|wblistentityusage: Deprecate wbeu prefix, new output format (T300460 T196962)]] |
[production] |
09:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2111 (T335838)', diff saved to https://phabricator.wikimedia.org/P47328 and previous config saved to /var/cache/conftool/dbconfig/20230503-095901-ladsgroup.json |
[production] |
09:58 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2111.codfw.wmnet with reason: Maintenance |
[production] |
09:58 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2111.codfw.wmnet with reason: Maintenance |
[production] |
09:56 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2114', diff saved to https://phabricator.wikimedia.org/P47327 and previous config saved to /var/cache/conftool/dbconfig/20230503-095641-ladsgroup.json |
[production] |
09:55 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance |
[production] |
09:55 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance |
[production] |
09:53 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1217.eqiad.wmnet with reason: Cloning db1110 from db1217:3323 T335092 |
[production] |
09:53 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1217.eqiad.wmnet with reason: Cloning db1110 from db1217:3323 T335092 |
[production] |
09:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2124 (re)pooling @ 3%: Repooling after migrating', diff saved to https://phabricator.wikimedia.org/P47325 and previous config saved to /var/cache/conftool/dbconfig/20230503-095008-root.json |
[production] |
09:49 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:914436|Personalized praise: Run convertNumber() before displaying numbers (T322443)]], [[gerrit:914435|Personalized praise: Run convertNumber() before displaying numbers (T322443)]] (duration: 06m 53s) |
[production] |
09:47 |
<cgoubert@cumin1001> |
START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:wikikube-staging-worker-eqiad |
[production] |
09:47 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:wikikube-staging-worker-codfw |
[production] |
09:44 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 4:00:00 on db1110.eqiad.wmnet with reason: Moving to m3 T335092 |
[production] |
09:44 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 4 days, 4:00:00 on db1110.eqiad.wmnet with reason: Moving to m3 T335092 |
[production] |
09:42 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:914436|Personalized praise: Run convertNumber() before displaying numbers (T322443)]], [[gerrit:914435|Personalized praise: Run convertNumber() before displaying numbers (T322443)]] |
[production] |
09:41 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2114 (T335838)', diff saved to https://phabricator.wikimedia.org/P47324 and previous config saved to /var/cache/conftool/dbconfig/20230503-094135-ladsgroup.json |
[production] |
09:36 |
<jelto@cumin1001> |
END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab1004.wikimedia.org with reason: Install software version upgrade |
[production] |