5901-5950 of 10000 results (96ms)
2022-12-07 ยง
22:44 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
22:44 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T322618)', diff saved to https://phabricator.wikimedia.org/P42540 and previous config saved to /var/cache/conftool/dbconfig/20221207-224440-ladsgroup.json [production]
22:41 <ryankemper> T301167 Downtimed `wdqs20[09-12]` for 7 days [production]
22:37 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
22:36 <ryankemper@puppetmaster1001> conftool action : set/weight=10:pooled=no; selector: name=wdqs2009.* [production]
22:36 <ryankemper@puppetmaster1001> conftool action : set/weight=10:pooled=no; selector: name=wdqs2010.* [production]
22:35 <bking@cumin2002> START - Cookbook sre.wdqs.data-reload [production]
22:32 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
22:30 <bking@cumin2002> START - Cookbook sre.wdqs.data-reload [production]
22:29 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
22:29 <bking@cumin2002> START - Cookbook sre.wdqs.data-reload [production]
22:29 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P42539 and previous config saved to /var/cache/conftool/dbconfig/20221207-222934-ladsgroup.json [production]
22:29 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
22:28 <bking@cumin2002> START - Cookbook sre.wdqs.data-reload [production]
22:26 <bking@cumin2002> END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) [production]
22:25 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer [production]
22:25 <bking@cumin2002> END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) [production]
22:23 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer [production]
22:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P42538 and previous config saved to /var/cache/conftool/dbconfig/20221207-221427-ladsgroup.json [production]
22:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180 (T322618)', diff saved to https://phabricator.wikimedia.org/P42537 and previous config saved to /var/cache/conftool/dbconfig/20221207-220110-ladsgroup.json [production]
21:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T322618)', diff saved to https://phabricator.wikimedia.org/P42536 and previous config saved to /var/cache/conftool/dbconfig/20221207-215921-ladsgroup.json [production]
21:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1187 (T322618)', diff saved to https://phabricator.wikimedia.org/P42535 and previous config saved to /var/cache/conftool/dbconfig/20221207-215712-ladsgroup.json [production]
21:57 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1187.eqiad.wmnet with reason: Maintenance [production]
21:56 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1187.eqiad.wmnet with reason: Maintenance [production]
21:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T322618)', diff saved to https://phabricator.wikimedia.org/P42534 and previous config saved to /var/cache/conftool/dbconfig/20221207-215651-ladsgroup.json [production]
21:56 <TheresNoTime> UTC late backport window done [production]
21:51 <samtar@deploy1002> backport aborted: (duration: 00m 15s) [production]
21:49 <samtar@deploy1002> Sync cancelled. [production]
21:47 <sukhe> homer "cr*-eqsin*" commit "running homer for Gerrit: 865773" [production]
21:46 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P42533 and previous config saved to /var/cache/conftool/dbconfig/20221207-214603-ladsgroup.json [production]
21:44 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts lvs5003.eqsin.wmnet [production]
21:44 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:44 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: lvs5003.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
21:43 <samtar@deploy1002> samtar and stang: Backport for [[gerrit:865766|specieswiki: Install GeoData extension (T324348)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
21:43 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: lvs5003.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
21:41 <samtar@deploy1002> Started scap: Backport for [[gerrit:865766|specieswiki: Install GeoData extension (T324348)]] [production]
21:41 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P42532 and previous config saved to /var/cache/conftool/dbconfig/20221207-214145-ladsgroup.json [production]
21:41 <sukhe@cumin2002> START - Cookbook sre.dns.netbox [production]
21:39 <samtar@deploy1002> Finished scap: Backport for [[gerrit:865737|Remove Research Incentive survey from frwiki (T321930)]] (duration: 09m 04s) [production]
21:36 <sukhe@cumin2002> START - Cookbook sre.hosts.decommission for hosts lvs5003.eqsin.wmnet [production]
21:36 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs5003.eqsin.wmnet with reason: downtimed, in the process of decom [production]
21:36 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 4:00:00 on lvs5003.eqsin.wmnet with reason: downtimed, in the process of decom [production]
21:34 <sukhe> homer "cr*-eqsin*" commit "running homer for Gerrit: 865742" [production]
21:32 <samtar@deploy1002> samtar and dani: Backport for [[gerrit:865737|Remove Research Incentive survey from frwiki (T321930)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet [production]
21:32 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs5006.eqsin.wmnet with OS buster [production]
21:32 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin2002" [production]
21:30 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin2002" [production]
21:30 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P42530 and previous config saved to /var/cache/conftool/dbconfig/20221207-213057-ladsgroup.json [production]
21:30 <samtar@deploy1002> Started scap: Backport for [[gerrit:865737|Remove Research Incentive survey from frwiki (T321930)]] [production]
21:28 <samtar@deploy1002> Finished scap: Backport for [[gerrit:865070|hewiki: enable parser cache writes for parsoid's page/html endpoint. (T322672 T320534 T320529)]], [[gerrit:865071|Page 5% of calls to parsoid's page/html endpoint write to PC (T322672)]] (duration: 20m 35s) [production]