4701-4750 of 10000 results (100ms)
2024-05-06 §
05:26 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2165.codfw.wmnet with reason: Maintenance [production]
05:26 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2165.codfw.wmnet with reason: Maintenance [production]
2024-05-05 §
11:09 <brennen@deploy1002> Finished deploy [phabricator/deployment@dd53761]: test deploy phab1004 for T364271 (duration: 00m 32s) [production]
11:08 <brennen@deploy1002> Started deploy [phabricator/deployment@dd53761]: test deploy phab1004 for T364271 [production]
11:08 <brennen@deploy1002> Finished deploy [phabricator/deployment@dd53761]: test deploy phab2002 for T364271 (duration: 00m 32s) [production]
11:07 <brennen@deploy1002> Started deploy [phabricator/deployment@dd53761]: test deploy phab2002 for T364271 [production]
11:04 <taavi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab.wmfusercontent.org with reason: brennen is deploying things [production]
11:03 <taavi@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on phab.wmfusercontent.org with reason: brennen is deploying things [production]
11:03 <taavi@cumin1002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1:00:00 on phabricator.wikimedia.org with reason: brennen is deploying things [production]
11:03 <taavi@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on phabricator.wikimedia.org with reason: brennen is deploying things [production]
11:03 <taavi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab1004.eqiad.wmnet with reason: brennen is deploying things [production]
11:03 <taavi@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on phab1004.eqiad.wmnet with reason: brennen is deploying things [production]
08:42 <taavi> taavi@gerrit1003 ~ $ sudo systemctl restart apache2 [production]
2024-05-04 §
13:41 <jayme> doubled the number of eventgate-main replicas in eqiad to 16 [production]
07:39 <taavi@cumin1002> END (PASS) - Cookbook sre.wikireplicas.update-views (exit_code=0) [production]
07:33 <taavi@cumin1002> START - Cookbook sre.wikireplicas.update-views [production]
03:07 <denisse> Restarting `status curator_actions_cluster_wide.service` to log with DEBUGG level on logstash2026 - T364190 [production]
03:06 <denisse> Enable log level DEBUG for curator on logstash2026 - T364190 [production]
01:33 <bblack@cumin1002> conftool action : set/weight=100; selector: name=dns7.* [production]
01:24 <bblack> lvs7001 - restart pybal [production]
01:23 <bblack> lvs7003 - restart pybal [production]
2024-05-03 §
21:38 <ryankemper@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6 days, 0:00:00 on wdqs2023.codfw.wmnet with reason: T362920 [production]
21:38 <ryankemper@cumin2002> START - Cookbook sre.hosts.downtime for 6 days, 0:00:00 on wdqs2023.codfw.wmnet with reason: T362920 [production]
21:27 <ryankemper> T362920 [wdqs] Depooled `wdqs2023` in preparation to switch it to a graph split host [production]
19:02 <sukhe> cleaning up stale confd template files for magru related reimaging [production]
18:44 <brett@cumin2002> conftool action : set/pooled=yes; selector: name=ncredir7002.magru.wmnet,service=nginx [production]
18:43 <brett@cumin2002> conftool action : set/pooled=yes; selector: name=ncredir7001.magru.wmnet,service=nginx [production]
18:38 <brett@cumin2002> conftool action : set/pooled=no; selector: name=ncredir7001.magru.wmnet,service=nginx [production]
18:38 <brett@cumin2002> conftool action : set/pooled=no; selector: name=ncredir7002.magru.wmnet,service=nginx [production]
18:29 <brett@cumin2002> conftool action : set/pooled=yes; selector: name=ncredir7002.magru.wmnet,service=nginx [production]
18:29 <brett@cumin2002> conftool action : set/weight=1; selector: name=ncredir7002.magru.wmnet,service=nginx [production]
18:29 <brett@cumin2002> conftool action : set/pooled=yes; selector: name=ncredir7001.magru.wmnet,service=nginx [production]
18:28 <brett@cumin2002> conftool action : set/weight=1; selector: name=ncredir7001.magru.wmnet,service=nginx [production]
17:45 <dcausse> repooling wdqs1012 [production]
17:27 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
17:27 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
17:14 <brett@cumin2002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host ncredir7002.magru.wmnet [production]
17:14 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir7002.magru.wmnet with OS bookworm [production]
17:13 <denisse> Run `sudo mdadm --add /dev/md1 /dev/sdg` on `centrallog1002` - T363660 [production]
17:01 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
17:00 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
17:00 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195 (T361627)', diff saved to https://phabricator.wikimedia.org/P61862 and previous config saved to /var/cache/conftool/dbconfig/20240503-170054-marostegui.json [production]
16:47 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir7002.magru.wmnet with reason: host reimage [production]
16:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61860 and previous config saved to /var/cache/conftool/dbconfig/20240503-164546-marostegui.json [production]
16:44 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir7002.magru.wmnet with reason: host reimage [production]
16:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61859 and previous config saved to /var/cache/conftool/dbconfig/20240503-163039-marostegui.json [production]
16:18 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host ncredir7002.magru.wmnet with OS bookworm [production]
16:15 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2195 (T361627)', diff saved to https://phabricator.wikimedia.org/P61858 and previous config saved to /var/cache/conftool/dbconfig/20240503-161531-marostegui.json [production]
15:54 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2195 (T361627)', diff saved to https://phabricator.wikimedia.org/P61857 and previous config saved to /var/cache/conftool/dbconfig/20240503-155432-marostegui.json [production]
15:54 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2195.codfw.wmnet with reason: Maintenance [production]