101-150 of 10000 results (39ms)
2022-04-12 ยง
17:03 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2027.codfw.wmnet with OS buster [production]
16:57 <razzi@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on clouddb1013.eqiad.wmnet with reason: host reimage [production]
16:54 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase2027.codfw.wmnet with reason: host reimage [production]
16:53 <razzi@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on clouddb1013.eqiad.wmnet with reason: host reimage [production]
16:49 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase2027.codfw.wmnet with reason: host reimage [production]
16:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24513 and previous config saved to /var/cache/conftool/dbconfig/20220412-164907-ladsgroup.json [production]
16:42 <razzi@cumin1001> START - Cookbook sre.hosts.reimage for host clouddb1013.eqiad.wmnet with OS bullseye [production]
16:33 <mutante> gitlab: pausing runner-1013, then will remove it and create new bullseye runner to replace it [production]
16:30 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host restbase2027.codfw.wmnet with OS buster [production]
16:08 <dzahn@cumin2002> conftool action : set/pooled=no; selector: dc=eqiad,name=mw1308.eqiad.wmnet [production]
16:01 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:56 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
15:51 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24512 and previous config saved to /var/cache/conftool/dbconfig/20220412-155143-ladsgroup.json [production]
15:51 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
15:51 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
15:51 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1106.eqiad.wmnet with reason: Maintenance [production]
15:51 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1106.eqiad.wmnet with reason: Maintenance [production]
15:49 <arturo> aborrero@apt1001:~ $ sudo -i reprepro -C component/prometheus-openstack-exporter includedeb bullseye-wikimedia ${PWD}/prometheus-openstack-exporter_1.5.0-1_amd64.deb (T302178) [production]
15:46 <cmjohnson@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudstore1010.wikimedia.org with OS bullseye [production]
15:44 <arturo> removed a bunch of old src & binary packages for prometheus-openstack-exporter (T302178) [production]
15:36 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase2026.codfw.wmnet [production]
15:36 <razzi@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host clouddb1013.eqiad.wmnet with OS bullseye [production]
15:36 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase2020.codfw.wmnet [production]
15:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase2017.codfw.wmnet [production]
15:34 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase2019.codfw.wmnet [production]
15:27 <cmjohnson@cumin1001> START - Cookbook sre.hosts.reimage for host cloudstore1011.wikimedia.org with OS bullseye [production]
15:23 <cmjohnson@cumin1001> START - Cookbook sre.hosts.reimage for host cloudstore1010.wikimedia.org with OS bullseye [production]
15:22 <cmjohnson@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudstore1010.wikimedia.org with OS bullseye [production]
15:20 <cmjohnson@cumin1001> START - Cookbook sre.hosts.reimage for host cloudstore1010.wikimedia.org with OS bullseye [production]
15:10 <razzi@cumin1001> START - Cookbook sre.hosts.reimage for host clouddb1013.eqiad.wmnet with OS bullseye [production]
15:09 <vgutierrez@puppetmaster1001> conftool action : set/pooled=inactive; selector: name=cp5002.eqsin.wmnet [production]
15:08 <razzi@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on clouddb1013.eqiad.wmnet with reason: Upgrade clouddb1013 to bullseye [production]
15:08 <razzi@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on clouddb1013.eqiad.wmnet with reason: Upgrade clouddb1013 to bullseye [production]
15:07 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
15:07 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
15:07 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
15:07 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
15:06 <dancy@deploy1002> Synchronized php-1.39.0-wmf.6/includes/EditPage.php: Backport: [[gerrit:778641|Temporarily undeprecate EditPage::$textbox2 (T305028)]] (duration: 00m 52s) [production]
15:05 <hnowlan@deploy1002> Finished deploy [restbase/deploy@627f7d7]: add guw.wikipedia.org (duration: 15m 56s) [production]
15:03 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1139.eqiad.wmnet with reason: Maintenance [production]
15:03 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1139.eqiad.wmnet with reason: Maintenance [production]
14:49 <hnowlan@deploy1002> Started deploy [restbase/deploy@627f7d7]: add guw.wikipedia.org [production]
14:47 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudstore1011.mgmt.eqiad.wmnet with reboot policy FORCED [production]
14:47 <hnowlan@deploy1002> Finished deploy [restbase/deploy@31675fb]: add guw.wikipedia.org (duration: 00m 22s) [production]
14:46 <hnowlan@deploy1002> Started deploy [restbase/deploy@31675fb]: add guw.wikipedia.org [production]
14:45 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudstore1010.mgmt.eqiad.wmnet with reboot policy FORCED [production]
14:30 <cmjohnson@cumin1001> START - Cookbook sre.hosts.provision for host cloudstore1011.mgmt.eqiad.wmnet with reboot policy FORCED [production]
14:29 <cmjohnson@cumin1001> START - Cookbook sre.hosts.provision for host cloudstore1010.mgmt.eqiad.wmnet with reboot policy FORCED [production]
14:17 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
14:17 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]