3801-3850 of 10000 results (70ms)
2022-07-11 §
07:22 <marostegui@cumin1001> START - Cookbook sre.hosts.decommission for hosts db2080.codfw.wmnet [production]
07:09 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti2027.codfw.wmnet with OS bullseye [production]
07:00 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db2077.codfw.wmnet [production]
06:58 <marostegui@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
06:54 <marostegui@cumin1001> START - Cookbook sre.dns.netbox [production]
06:50 <marostegui@cumin1001> START - Cookbook sre.hosts.decommission for hosts db2077.codfw.wmnet [production]
06:28 <_joe_> repool thumbor1005 [production]
06:28 <_joe_> depooled thumbor1005, downgraded firejail, restarted units [production]
00:23 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
00:19 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
00:19 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
2022-07-10 §
13:48 <godog> silence ProbeDown pages for thumbor:8800 until wed [production]
2022-07-09 §
13:34 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:33 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:33 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:32 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
01:48 <krinkle@deploy1002> Synchronized php-1.39.0-wmf.19/includes/ResourceLoader/: I3e43b10d26858c5b (duration: 03m 37s) [production]
01:44 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
01:43 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
01:43 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
01:42 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
01:37 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
01:36 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
01:36 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
01:35 <krinkle@deploy1002> Synchronized wmf-config/: I1bb97d1d601 (duration: 03m 24s) [production]
01:35 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
2022-07-08 §
21:44 <ryankemper> [Elastic] Reshuffled shards on eqiad to get cluster back into green status (from yellow): https://phabricator.wikimedia.org/P30995#130117 [production]
21:32 <ori> apt1001: reprepro -C main include buster-wikimedia libvmod-querysort_0.2_amd64.changes [production]
19:58 <thcipriani> quick phab downtime for deploy to fix T312614 [production]
19:57 <cdanis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab.wmfusercontent.org with reason: bug fix [production]
19:57 <cdanis@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on phab.wmfusercontent.org with reason: bug fix [production]
19:57 <cdanis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phabricator.wikimedia.org with reason: bug fix [production]
19:56 <cdanis@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on phabricator.wikimedia.org with reason: bug fix [production]
19:56 <cdanis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab1001.eqiad.wmnet with reason: bug fix [production]
19:56 <cdanis@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on phab1001.eqiad.wmnet with reason: bug fix [production]
19:49 <tzatziki> removing 2 files for legal compliance [production]
18:42 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudelastic1001.wikimedia.org with OS bullseye [production]
18:26 <urandom> changing Cassandra superuser password, AQS cluster -- T311652 [production]
18:21 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudelastic1001.wikimedia.org with reason: host reimage [production]
18:18 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudelastic1001.wikimedia.org with reason: host reimage [production]
18:03 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host cloudelastic1001.wikimedia.org with OS bullseye [production]
16:25 <bking@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1005.wikimedia.org with OS bullseye [production]
15:29 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host cloudelastic1005.wikimedia.org with OS bullseye [production]
15:27 <bking@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1005.wikimedia.org with OS bullseye [production]
15:27 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host cloudelastic1005.wikimedia.org with OS bullseye [production]
15:15 <bking@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudelastic1005.wikimedia.org with OS bullseye [production]
15:00 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host cloudelastic1005.wikimedia.org with OS bullseye [production]
14:59 <bking@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudelastic1005.wikimedia.org with OS bullseye [production]
14:49 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host cloudelastic1005.wikimedia.org with OS bullseye [production]
14:46 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudelastic1004.wikimedia.org with OS bullseye [production]