1601-1650 of 10000 results (109ms)
2023-01-19 ยง
11:17 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1054.eqiad.wmnet with reason: host reimage [production]
11:13 <filippo@cumin1001> START - Cookbook sre.dns.netbox [production]
11:09 <filippo@cumin1001> START - Cookbook sre.hosts.decommission for hosts webperf2004.codfw.wmnet [production]
11:08 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts webperf1004.eqiad.wmnet [production]
11:08 <filippo@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:08 <filippo@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: webperf1004.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - filippo@cumin1001" [production]
11:06 <filippo@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: webperf1004.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - filippo@cumin1001" [production]
11:06 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host mc1054.eqiad.wmnet with OS bullseye [production]
11:02 <filippo@cumin1001> START - Cookbook sre.dns.netbox [production]
10:58 <filippo@cumin1001> START - Cookbook sre.hosts.decommission for hosts webperf1004.eqiad.wmnet [production]
10:44 <hnowlan@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
10:44 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=99) [production]
10:44 <hnowlan@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
10:44 <hnowlan> rebooting maps-eqiad for updates [production]
10:27 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
10:27 <cgoubert@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
10:27 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
10:27 <cgoubert@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
10:27 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply [production]
10:27 <cgoubert@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
10:27 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
10:24 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on webperf2004.codfw.wmnet with reason: decom [production]
10:24 <filippo@cumin1001> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on webperf2004.codfw.wmnet with reason: decom [production]
10:19 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "new ping host - jmm@cumin2002" [production]
10:17 <claime> Restarted maintenance scripts on mwmaint1002.eqiad.wmnet [production]
10:17 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "new ping host - jmm@cumin2002" [production]
10:17 <cgoubert@cumin1001> END (PASS) - Cookbook sre.switchdc.mediawiki.08-start-maintenance (exit_code=0) [production]
10:15 <cgoubert@cumin1001> START - Cookbook sre.switchdc.mediawiki.08-start-maintenance [production]
10:13 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwmaint1002.eqiad.wmnet [production]
10:07 <cgoubert@cumin1001> START - Cookbook sre.hosts.reboot-single for host mwmaint1002.eqiad.wmnet [production]
10:06 <cgoubert@cumin1001> END (PASS) - Cookbook sre.switchdc.mediawiki.01-stop-maintenance (exit_code=0) [production]
10:06 <cgoubert@cumin1001> START - Cookbook sre.switchdc.mediawiki.01-stop-maintenance [production]
10:05 <claime> Stopping maintenance scripts on mwmaint1002.eqiad.wmnet for reboot [production]
09:55 <moritzm> installing ping3003 T273509 [production]
09:27 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on ldap-corp[1001,2001].wikimedia.org with reason: Decommissioning [production]
09:27 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on ldap-corp[1001,2001].wikimedia.org with reason: Decommissioning [production]
09:24 <jnuche@deploy1002> rebuilt and synchronized wikiversions files: all wikis to 1.40.0-wmf.19 refs T325582 [production]
09:17 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
09:17 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
09:16 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
09:16 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
08:26 <moritzm> installing sudo security updates [production]
07:45 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
07:45 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]