4451-4500 of 10000 results (44ms)
2022-01-22 §
22:38 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
22:38 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
14:51 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
14:51 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
08:35 <elukey> `apt-get clean` on an-test-coord1001 to free some space [production]
08:25 <elukey> remove the `--debug=true` etcd daemon arg from ml-etcd2002 (only node having it, probably a manual test done in the past) and cleaned up spammy etcd logs to free space [production]
01:30 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
01:30 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
00:27 <dzahn@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=miscweb [production]
2022-01-21 §
22:23 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
22:23 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
21:43 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
21:42 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
21:42 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
21:40 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
21:38 <brennen@deploy1002> Synchronized php-1.38.0-wmf.18/extensions/VisualEditor/modules/ve-mw: Backport: [[gerrit:756066|Revert "Re-duplicate deduplicated TemplateStyles" (T287675 T299251 T299767)]] (duration: 00m 49s) [production]
21:21 <topranks> Running homer against cr1-eqiad and cr2-eqiad to remove entries on analytics-in4/6 filters that refer to decommissioned deb mirror host sodium. [production]
19:14 <ayounsi@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:10 <ayounsi@cumin1001> START - Cookbook sre.dns.netbox [production]
19:05 <ayounsi@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:01 <ayounsi@cumin1001> START - Cookbook sre.dns.netbox [production]
18:46 <herron> restarting pybal on lvs1015,lvs1020,lvs2009,lvs2010 to remove legacy elk5 services T299700 [production]
18:39 <robh@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:36 <robh@cumin1001> START - Cookbook sre.dns.netbox [production]
18:26 <ayounsi@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:15 <ayounsi@cumin1001> START - Cookbook sre.dns.netbox [production]
17:42 <rzl> rzl@apt1001:~$ sudo -i reprepro -C main include buster-wikimedia /home/rzl/python3-imagecatalog/imagecatalog_0.0.4-1_amd64.changes [production]
16:56 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase1021.eqiad.wmnet [production]
16:55 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase1021.eqiad.wmnet with OS buster [production]
16:47 <hnowlan@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1021.eqiad.wmnet with OS buster [production]
16:47 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase1020.eqiad.wmnet [production]
16:46 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase1020.eqiad.wmnet with OS buster [production]
16:26 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts sodium.wikimedia.org [production]
16:20 <hnowlan@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1020.eqiad.wmnet with OS buster [production]
16:18 <aqu@deploy1002> Finished deploy [airflow-dags/analytics-test@3ad07a0]: (no justification provided) (duration: 00m 08s) [production]
16:18 <aqu@deploy1002> Started deploy [airflow-dags/analytics-test@3ad07a0]: (no justification provided) [production]
16:05 <jhathaway@cumin1001> START - Cookbook sre.hosts.decommission for hosts sodium.wikimedia.org [production]
16:04 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=restbase1019.eqiad.wmnet [production]
16:03 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase1019.eqiad.wmnet with OS buster [production]
16:02 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2000 days, 0:00:00 on sodium.wikimedia.org with reason: decom [production]
16:02 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 2000 days, 0:00:00 on sodium.wikimedia.org with reason: decom [production]
15:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on ganeti1013.eqiad.wmnet with reason: Remove from Ganeti cluster for reimage [production]
15:51 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on ganeti1013.eqiad.wmnet with reason: Remove from Ganeti cluster for reimage [production]
15:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on ganeti1018.eqiad.wmnet with reason: Remove from Ganeti cluster for reimage [production]
15:51 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on ganeti1018.eqiad.wmnet with reason: Remove from Ganeti cluster for reimage [production]
15:50 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti1025.eqiad.wmnet to ganeti01.svc.eqiad.wmnet [production]
15:50 <moritzm> added ganeti1025 to Ganeti eqiad cluster T293909 [production]
15:29 <jhathaway@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
15:29 <jhathaway@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on mx1001.wikimedia.org with reason: kernel testing [production]
15:25 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2026.codfw.wmnet with OS buster [production]