201-250 of 10000 results (7ms)
2025-01-23 ยง
14:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P72266 and previous config saved to /var/cache/conftool/dbconfig/20250123-142504-marostegui.json [production]
14:21 <kamila@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1142.eqiad.wmnet wikikube-worker1143.eqiad.wmnet wikikube-worker1144.eqiad.wmnet wikikube-worker1145.eqiad.wmnet wikikube-worker1146.eqiad.wmnet wikikube-worker1147.eqiad.wmnet on all recursors [production]
14:21 <kamila@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker1142.eqiad.wmnet wikikube-worker1143.eqiad.wmnet wikikube-worker1144.eqiad.wmnet wikikube-worker1145.eqiad.wmnet wikikube-worker1146.eqiad.wmnet wikikube-worker1147.eqiad.wmnet on all recursors [production]
14:21 <raymond-ndibe@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
14:21 <samtar@deploy2002> Finished scap sync-world: Backport for [[gerrit:1113463|cirrus: stop writing to wikitech index from the MW JobQueue]], [[gerrit:1113750|cirrus: cleanup unused settings (T374702)]] (duration: 12m 00s) [production]
14:14 <samtar@deploy2002> dcausse, samtar: Continuing with sync [production]
14:13 <samtar@deploy2002> dcausse, samtar: Backport for [[gerrit:1113463|cirrus: stop writing to wikitech index from the MW JobQueue]], [[gerrit:1113750|cirrus: cleanup unused settings (T374702)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:13 <raymond-ndibe@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]
14:12 <marostegui@cumin1002> dbctl commit (dc=all): 'db2166 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72264 and previous config saved to /var/cache/conftool/dbconfig/20250123-141209-root.json [production]
14:10 <dcaro> reboot tools-static-15 due to nginx stuck on nfs [tools]
14:10 <root@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1239.eqiad.wmnet with reason: host reimage [production]
14:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1165 (T384592)', diff saved to https://phabricator.wikimedia.org/P72263 and previous config saved to /var/cache/conftool/dbconfig/20250123-140957-marostegui.json [production]
14:09 <samtar@deploy2002> Started scap sync-world: Backport for [[gerrit:1113463|cirrus: stop writing to wikitech index from the MW JobQueue]], [[gerrit:1113750|cirrus: cleanup unused settings (T374702)]] [production]
14:07 <root@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1239.eqiad.wmnet with reason: host reimage [production]
14:07 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-restart-tcp-mss-clamper (exit_code=0) rolling restart_daemons on A:cp-text_eqiad [production]
14:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1165 (T384592)', diff saved to https://phabricator.wikimedia.org/P72262 and previous config saved to /var/cache/conftool/dbconfig/20250123-140649-marostegui.json [production]
14:06 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 20:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1015,1019].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
14:06 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1165.eqiad.wmnet with reason: Maintenance [production]
14:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Repool db1165', diff saved to https://phabricator.wikimedia.org/P72261 and previous config saved to /var/cache/conftool/dbconfig/20250123-140524-marostegui.json [production]
13:57 <marostegui@cumin1002> dbctl commit (dc=all): 'db2166 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72259 and previous config saved to /var/cache/conftool/dbconfig/20250123-135704-root.json [production]
13:57 <marostegui@cumin1002> dbctl commit (dc=all): 'db1165 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72258 and previous config saved to /var/cache/conftool/dbconfig/20250123-135704-root.json [production]
13:56 <fceratto@cumin1002> dbctl commit (dc=all): 'Depool db2140 T384480', diff saved to https://phabricator.wikimedia.org/P72257 and previous config saved to /var/cache/conftool/dbconfig/20250123-135655-fceratto.json [production]
13:55 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from parse1006 to wikikube-worker1147 [production]
13:54 <kamila@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1147 [production]
13:52 <kamila@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1147 [production]
13:52 <kamila@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:52 <kamila@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming parse1006 to wikikube-worker1147 - kamila@cumin1002" [production]
13:50 <root@cumin1002> START - Cookbook sre.hosts.reimage for host db1239.eqiad.wmnet with OS bookworm [production]
13:50 <kamila@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming parse1006 to wikikube-worker1147 - kamila@cumin1002" [production]
13:49 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-restart-tcp-mss-clamper rolling restart_daemons on A:cp-text_eqiad [production]
13:49 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-restart-tcp-mss-clamper (exit_code=0) rolling restart_daemons on A:cp-text_esams [production]
13:46 <jynus@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1239.eqiad.wmnet with reason: reimage [production]
13:42 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from parse1005 to wikikube-worker1146 [production]
13:42 <kamila@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1146 [production]
13:41 <kamila@cumin1002> START - Cookbook sre.dns.netbox [production]
13:41 <godog> bounce mtail on centrallog2002 - high system cpu usage and perf top reports native_queued_spin_lock_slowpath [production]
13:41 <kamila@cumin1002> START - Cookbook sre.hosts.rename from parse1006 to wikikube-worker1147 [production]
13:38 <kamila@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1146 [production]
13:38 <kamila@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:38 <kamila@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming parse1005 to wikikube-worker1146 - kamila@cumin1002" [production]
13:37 <kamila@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming parse1005 to wikikube-worker1146 - kamila@cumin1002" [production]
13:33 <marostegui@cumin1002> dbctl commit (dc=all): 'db2166 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72256 and previous config saved to /var/cache/conftool/dbconfig/20250123-133311-root.json [production]
13:33 <marostegui@cumin1002> dbctl commit (dc=all): 'db1165 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72255 and previous config saved to /var/cache/conftool/dbconfig/20250123-133304-root.json [production]
13:31 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-restart-tcp-mss-clamper rolling restart_daemons on A:cp-text_esams [production]
13:31 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1113799|file: Add caller to write queries (T384481)]] (duration: 09m 43s) [production]
13:30 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from parse1004 to wikikube-worker1145 [production]
13:30 <kamila@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1145 [production]
13:29 <cmooney@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netflow1002.eqiad.wmnet with reason: disabling alerts as I'm running gnmic manually rather than with systemd [production]
13:28 <kamila@cumin1002> START - Cookbook sre.dns.netbox [production]
13:28 <kamila@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1145 [production]