3501-3550 of 10000 results (113ms)
2024-06-27 §
00:56 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2175 (T367856)', diff saved to https://phabricator.wikimedia.org/P65502 and previous config saved to /var/cache/conftool/dbconfig/20240627-005613-marostegui.json [production]
00:56 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance [production]
00:55 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance [production]
00:55 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367856)', diff saved to https://phabricator.wikimedia.org/P65501 and previous config saved to /var/cache/conftool/dbconfig/20240627-005549-marostegui.json [production]
00:40 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65500 and previous config saved to /var/cache/conftool/dbconfig/20240627-004042-marostegui.json [production]
00:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65499 and previous config saved to /var/cache/conftool/dbconfig/20240627-002535-marostegui.json [production]
00:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367856)', diff saved to https://phabricator.wikimedia.org/P65498 and previous config saved to /var/cache/conftool/dbconfig/20240627-001028-marostegui.json [production]
2024-06-26 §
23:56 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5021.eqsin.wmnet with OS bullseye [production]
23:26 <mutante> people1004 - stopped confd which logs every 3 seconds that it can't find any templates (T356296) [production]
23:23 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage [production]
23:20 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage [production]
23:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65497 and previous config saved to /var/cache/conftool/dbconfig/20240626-231020-marostegui.json [production]
23:10 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance [production]
23:10 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance [production]
23:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1211 (T364069)', diff saved to https://phabricator.wikimedia.org/P65496 and previous config saved to /var/cache/conftool/dbconfig/20240626-230958-marostegui.json [production]
22:54 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1211', diff saved to https://phabricator.wikimedia.org/P65495 and previous config saved to /var/cache/conftool/dbconfig/20240626-225451-marostegui.json [production]
22:47 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS bullseye [production]
22:41 <brett@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5021.eqsin.wmnet [production]
22:39 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1211', diff saved to https://phabricator.wikimedia.org/P65494 and previous config saved to /var/cache/conftool/dbconfig/20240626-223944-marostegui.json [production]
22:26 <brett@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5020.eqsin.wmnet [production]
22:24 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1211 (T364069)', diff saved to https://phabricator.wikimedia.org/P65493 and previous config saved to /var/cache/conftool/dbconfig/20240626-222434-marostegui.json [production]
22:22 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5020.eqsin.wmnet with OS bullseye [production]
21:50 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage [production]
21:46 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage [production]
21:40 <cjming> end of UTC late backport window [production]
21:38 <cjming@deploy1002> Finished scap: Backport for [[gerrit:1050005|Homepage: don't load yesterdays edits on desktop (T368405)]] (duration: 08m 48s) [production]
21:33 <cjming@deploy1002> cjming, migr: Continuing with sync [production]
21:32 <cjming@deploy1002> cjming, migr: Backport for [[gerrit:1050005|Homepage: don't load yesterdays edits on desktop (T368405)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:29 <cjming@deploy1002> Started scap: Backport for [[gerrit:1050005|Homepage: don't load yesterdays edits on desktop (T368405)]] [production]
21:29 <hashar> restarting CI Jenkins [production]
21:13 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5020.eqsin.wmnet with OS bullseye [production]
21:13 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5020.eqsin.wmnet with OS bullseye [production]
21:05 <cjming@deploy1002> Finished scap: Backport for [[gerrit:1050002|Homepage: log rendering time for each module and each wiki (T368405)]] (duration: 14m 01s) [production]
20:59 <eileen> config revision changed from 0b822cd3 to 994e7b81 [production]
20:57 <cjming@deploy1002> cjming, migr: Continuing with sync [production]
20:55 <cjming@deploy1002> cjming, migr: Backport for [[gerrit:1050002|Homepage: log rendering time for each module and each wiki (T368405)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:51 <cjming@deploy1002> Started scap: Backport for [[gerrit:1050002|Homepage: log rendering time for each module and each wiki (T368405)]] [production]
20:50 <jdrewniak@deploy1002> Finished scap: Backport for [[gerrit:1049972|Enable user pages and select special pages in dark mode (1.43.0-wmf.11) (T366364 T366375 T367375 T367581 T367582 T367583)]] (duration: 08m 09s) [production]
20:47 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5020.eqsin.wmnet with OS bullseye [production]
20:45 <jdrewniak@deploy1002> jdlrobson, jdrewniak: Continuing with sync [production]
20:44 <jdrewniak@deploy1002> jdlrobson, jdrewniak: Backport for [[gerrit:1049972|Enable user pages and select special pages in dark mode (1.43.0-wmf.11) (T366364 T366375 T367375 T367581 T367582 T367583)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:42 <jdrewniak@deploy1002> Started scap: Backport for [[gerrit:1049972|Enable user pages and select special pages in dark mode (1.43.0-wmf.11) (T366364 T366375 T367375 T367581 T367582 T367583)]] [production]
20:40 <jdrewniak@deploy1002> Synchronized portals: Wikimedia Portals Update: [[gerrit:1050007| Bumping portals to master (T128546)]] (duration: 06m 58s) [production]
20:33 <jdrewniak@deploy1002> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1050007| Bumping portals to master (T128546)]] (duration: 07m 27s) [production]
20:28 <brett@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5020.eqsin.wmnet [production]
20:21 <cjming@deploy1002> Finished scap: Backport for [[gerrit:1049947|Update QuickSurvey coverage rate for Automoderator patroller workstream survey (T362969)]] (duration: 08m 46s) [production]
20:15 <cjming@deploy1002> cjming, kgraessle: Continuing with sync [production]
20:14 <cjming@deploy1002> cjming, kgraessle: Backport for [[gerrit:1049947|Update QuickSurvey coverage rate for Automoderator patroller workstream survey (T362969)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:12 <cjming@deploy1002> Started scap: Backport for [[gerrit:1049947|Update QuickSurvey coverage rate for Automoderator patroller workstream survey (T362969)]] [production]
20:08 <mutante> lists1001:/lib/systemd/system# rm wmf_auto_restart_apache2.* ; systemctl reset-failed - reaction to monitoring alert "FIRING: SystemdUnitFailed: wmf_auto_restart_apache2.service on lists1001:9100" [production]