6801-6850 of 10000 results (69ms)
2024-07-12 ยง
23:31 <bd808> Removed self from project members [logging]
23:21 <bd808> Added rule to scap-access security group to let deployment-deploy04.deployment-prep.eqiad1.wikimedia.cloud submit logs (T369962) [logging]
23:20 <thcipriani> un-skipping logstash checks in beta [releng]
23:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2220 (T367856)', diff saved to https://phabricator.wikimedia.org/P66420 and previous config saved to /var/cache/conftool/dbconfig/20240712-231912-marostegui.json [production]
23:16 <bd808> Added self as project member to work on T369962 [logging]
22:59 <thcipriani> skipping logstash checks in beta [releng]
22:34 <tzatziki> removing 1 file for legal compliance [production]
22:32 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1223 (T367856)', diff saved to https://phabricator.wikimedia.org/P66419 and previous config saved to /var/cache/conftool/dbconfig/20240712-223226-marostegui.json [production]
22:32 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1223.eqiad.wmnet with reason: Maintenance [production]
22:32 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1223.eqiad.wmnet with reason: Maintenance [production]
22:32 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1212 (T367856)', diff saved to https://phabricator.wikimedia.org/P66418 and previous config saved to /var/cache/conftool/dbconfig/20240712-223204-marostegui.json [production]
22:21 <tzatziki> removing 1 file for legal compliance [production]
22:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P66417 and previous config saved to /var/cache/conftool/dbconfig/20240712-221656-marostegui.json [production]
22:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P66416 and previous config saved to /var/cache/conftool/dbconfig/20240712-220149-marostegui.json [production]
21:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1212 (T367856)', diff saved to https://phabricator.wikimedia.org/P66415 and previous config saved to /var/cache/conftool/dbconfig/20240712-214642-marostegui.json [production]
21:38 <thcipriani> update beta-* CI jobs, pool deployment-deploy04 in jenkins, offline deployment-deploy03 [releng]
20:12 <thcipriani> disable beta-code-update-eqiad/beta-scap-sync-world until server tinkering concludes [releng]
20:02 <bd808> Restarted Jenkins agent on deployment-deploy03 [releng]
19:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1212 (T367856)', diff saved to https://phabricator.wikimedia.org/P66414 and previous config saved to /var/cache/conftool/dbconfig/20240712-190224-marostegui.json [production]
19:02 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
19:02 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
19:02 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1212.eqiad.wmnet with reason: Maintenance [production]
19:02 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1212.eqiad.wmnet with reason: Maintenance [production]
19:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1198 (T367856)', diff saved to https://phabricator.wikimedia.org/P66413 and previous config saved to /var/cache/conftool/dbconfig/20240712-190154-marostegui.json [production]
18:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P66412 and previous config saved to /var/cache/conftool/dbconfig/20240712-184647-marostegui.json [production]
18:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P66411 and previous config saved to /var/cache/conftool/dbconfig/20240712-183140-marostegui.json [production]
18:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1198 (T367856)', diff saved to https://phabricator.wikimedia.org/P66410 and previous config saved to /var/cache/conftool/dbconfig/20240712-181632-marostegui.json [production]
17:49 <thcipriani> reconfigure beta-code-update-eqiad beta-scap-sync-world beta-update-databases-eqiad pending merge of https://gerrit.wikimedia.org/r/1053956 [releng]
17:10 <hnowlan@cumin1002> conftool action : set/pooled=yes:weight=10; selector: name=(mw1349.eqiad.wmnet|mw1350.eqiad.wmnet|mw1351.eqiad.wmnet) [production]
17:07 <hnowlan@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1349.eqiad.wmnet [production]
17:07 <hnowlan@cumin1002> START - Cookbook sre.hosts.remove-downtime for mw1349.eqiad.wmnet [production]
17:07 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw[1350-1351].eqiad.wmnet [production]
17:07 <cgoubert@cumin1002> START - Cookbook sre.hosts.remove-downtime for mw[1350-1351].eqiad.wmnet [production]
17:06 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1351.eqiad.wmnet with OS buster [production]
17:03 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1350.eqiad.wmnet with OS buster [production]
17:00 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1349.eqiad.wmnet with OS buster [production]
16:32 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1351.eqiad.wmnet with reason: host reimage [production]
16:29 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1350.eqiad.wmnet with reason: host reimage [production]
16:27 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1349.eqiad.wmnet with reason: host reimage [production]
16:24 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1351.eqiad.wmnet with reason: host reimage [production]
16:24 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1350.eqiad.wmnet with reason: host reimage [production]
16:23 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1349.eqiad.wmnet with reason: host reimage [production]
16:17 <dcausse@deploy1002> helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
16:16 <dcausse@deploy1002> helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply [production]
16:10 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1351.eqiad.wmnet with OS buster [production]
16:09 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1350.eqiad.wmnet with OS buster [production]
16:09 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1349.eqiad.wmnet with OS buster [production]
16:05 <cgoubert@cumin1002> conftool action : set/pooled=inactive; selector: name=(mw1349|mw1350|mw1351).eqiad.wmnet,cluster=(jobrunner|videoscaler) [production]
16:05 <cgoubert@cumin1002> conftool action : set/pooled=yes; selector: name=(mw1349|mw1350|mw1351).eqiad.wmnet,cluster=(jobrunner|videoscaler) [production]
16:04 <claime> pooling mw1349, mw1350, mw1351 as jobrunners [production]