4101-4150 of 10000 results (104ms)
2022-11-10 ยง
17:44 <dcausse@deploy1002> Started deploy [wikimedia/discovery/analytics@84dd7b5]: T320656: image_suggestions: schedule ad hoc dataset to fix improper suggestions [production]
17:37 <sukhe> [done] running sukhe@cumin2002:~$ homer "cr*-ulsfo*" commit "Gerrit 855583: sites.yaml: add lvs4008 (ulsfo hardware refresh)" [production]
17:36 <sukhe> running sukhe@cumin2002:~$ homer "cr*-ulsfo*" commit "Gerrit 855583: sites.yaml: add lvs4008 (ulsfo hardware refresh)" [production]
17:35 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs4008.ulsfo.wmnet with OS buster [production]
17:34 <rzl> rzl@mwmaint1002:~$ sudo systemctl start mediawiki_job_growthexperiments-userImpactDelete.service # test run for T322706 T322541 [production]
17:33 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P39044 and previous config saved to /var/cache/conftool/dbconfig/20221110-173300-marostegui.json [production]
17:31 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2137:3315', diff saved to https://phabricator.wikimedia.org/P39043 and previous config saved to /var/cache/conftool/dbconfig/20221110-173153-marostegui.json [production]
17:28 <urandom> restarting bootstrap of aqs1016-a -- T307802 [production]
17:26 <urandom> increasing stream throughput to 400mbit, aqs1011-{a,b} & aqs1013-{a,b} -- T307802 [production]
17:26 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1194 (T322618)', diff saved to https://phabricator.wikimedia.org/P39042 and previous config saved to /var/cache/conftool/dbconfig/20221110-172611-ladsgroup.json [production]
17:26 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
17:25 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
17:25 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T322618)', diff saved to https://phabricator.wikimedia.org/P39041 and previous config saved to /var/cache/conftool/dbconfig/20221110-172549-ladsgroup.json [production]
17:23 <rzl> rzl@mwmaint1002:~$ sudo systemctl start mediawiki_job_growthexperiments-userImpactUpdateRecentlyEdited.service # test run for T322706 T322541 [production]
17:18 <robh@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['ganeti1033'] [production]
17:18 <rzl> rzl@mwmaint1002:~$ sudo systemctl start mediawiki_job_growthexperiments-userImpactUpdateRecentlyRegistered.service # test run for T322706 T322541 [production]
17:17 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P39040 and previous config saved to /var/cache/conftool/dbconfig/20221110-171753-marostegui.json [production]
17:16 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2137:3315 (T321130)', diff saved to https://phabricator.wikimedia.org/P39039 and previous config saved to /var/cache/conftool/dbconfig/20221110-171646-marostegui.json [production]
17:13 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs4008.ulsfo.wmnet with reason: host reimage [production]
17:13 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db2137:3315 (T321130)', diff saved to https://phabricator.wikimedia.org/P39038 and previous config saved to /var/cache/conftool/dbconfig/20221110-171329-marostegui.json [production]
17:13 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
17:13 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 5:00:00 on db2137.codfw.wmnet with reason: Maintenance [production]
17:13 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2128 (T321130)', diff saved to https://phabricator.wikimedia.org/P39037 and previous config saved to /var/cache/conftool/dbconfig/20221110-171308-marostegui.json [production]
17:10 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P39036 and previous config saved to /var/cache/conftool/dbconfig/20221110-171043-ladsgroup.json [production]
17:10 <robh@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ganeti1033'] [production]
17:09 <robh@cumin1001> END (ERROR) - Cookbook sre.hardware.upgrade-firmware (exit_code=97) upgrade firmware for hosts ['ganeti1033'] [production]
17:09 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs4008.ulsfo.wmnet with reason: host reimage [production]
17:02 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134 (T321123)', diff saved to https://phabricator.wikimedia.org/P39035 and previous config saved to /var/cache/conftool/dbconfig/20221110-170247-marostegui.json [production]
17:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1134 (T321123)', diff saved to https://phabricator.wikimedia.org/P39034 and previous config saved to /var/cache/conftool/dbconfig/20221110-170139-marostegui.json [production]
17:01 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
17:01 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
17:01 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
17:01 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
17:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1132 (T321123)', diff saved to https://phabricator.wikimedia.org/P39033 and previous config saved to /var/cache/conftool/dbconfig/20221110-170102-marostegui.json [production]
16:58 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2128', diff saved to https://phabricator.wikimedia.org/P39032 and previous config saved to /var/cache/conftool/dbconfig/20221110-165802-marostegui.json [production]
16:55 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P39031 and previous config saved to /var/cache/conftool/dbconfig/20221110-165536-ladsgroup.json [production]
16:53 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host lvs4008.ulsfo.wmnet with OS buster [production]
16:45 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1132', diff saved to https://phabricator.wikimedia.org/P39030 and previous config saved to /var/cache/conftool/dbconfig/20221110-164556-marostegui.json [production]
16:44 <sukhe@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host lvs4008.ulsfo.wmnet with OS buster [production]
16:42 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2128', diff saved to https://phabricator.wikimedia.org/P39029 and previous config saved to /var/cache/conftool/dbconfig/20221110-164255-marostegui.json [production]
16:40 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T322618)', diff saved to https://phabricator.wikimedia.org/P39028 and previous config saved to /var/cache/conftool/dbconfig/20221110-164030-ladsgroup.json [production]
16:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1191 (T322618)', diff saved to https://phabricator.wikimedia.org/P39027 and previous config saved to /var/cache/conftool/dbconfig/20221110-163819-ladsgroup.json [production]
16:38 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
16:38 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
16:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1174 (T322618)', diff saved to https://phabricator.wikimedia.org/P39026 and previous config saved to /var/cache/conftool/dbconfig/20221110-163758-ladsgroup.json [production]
16:37 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs4008.ulsfo.wmnet with reason: host reimage [production]
16:34 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs4008.ulsfo.wmnet with reason: host reimage [production]
16:33 <robh@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ganeti1033'] [production]
16:33 <robh@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['ganeti1033'] [production]
16:32 <robh@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ganeti1033'] [production]