1301-1350 of 10000 results (97ms)
2024-06-11 ยง
18:15 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
18:15 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
18:15 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2159.codfw.wmnet with reason: Maintenance [production]
18:14 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2159.codfw.wmnet with reason: Maintenance [production]
18:14 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150 (T364069)', diff saved to https://phabricator.wikimedia.org/P64641 and previous config saved to /var/cache/conftool/dbconfig/20240611-181448-marostegui.json [production]
18:10 <brennen> 1.43.0-wmf.9 train (T361403): no blockers, rolling to group0 [production]
18:08 <ejegg> stopped fundraising scheduled jobs [production]
17:59 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P64640 and previous config saved to /var/cache/conftool/dbconfig/20240611-175941-marostegui.json [production]
17:59 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
17:58 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
17:56 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
17:56 <taavi@deploy1002> Finished scap: Backport for [[gerrit:1038750|wikitech: Stop loading OpenStackManager (T161553 T338477 T359544)]] (duration: 12m 00s) [production]
17:56 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
17:47 <taavi@deploy1002> taavi: Continuing with sync [production]
17:46 <taavi@deploy1002> taavi: Backport for [[gerrit:1038750|wikitech: Stop loading OpenStackManager (T161553 T338477 T359544)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
17:45 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
17:45 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
17:44 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P64639 and previous config saved to /var/cache/conftool/dbconfig/20240611-174434-marostegui.json [production]
17:44 <taavi@deploy1002> Started scap: Backport for [[gerrit:1038750|wikitech: Stop loading OpenStackManager (T161553 T338477 T359544)]] [production]
17:37 <rzl@deploy1002> Finished scap: (no justification provided) (duration: 11m 40s) [production]
17:33 <rzl> rzl@cumin2002:~$ sudo cumin 'C:profile::mediawiki::webserver' 'enable-puppet T366649' [production]
17:33 <rzl@deploy1002> rzl: Continuing with sync [production]
17:30 <rzl@deploy1002> rzl: (no justification provided) synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
17:29 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150 (T364069)', diff saved to https://phabricator.wikimedia.org/P64638 and previous config saved to /var/cache/conftool/dbconfig/20240611-172928-marostegui.json [production]
17:26 <rzl@deploy1002> Started scap: (no justification provided) [production]
17:14 <rzl> rzl@cumin2002:~$ sudo cumin 'C:profile::mediawiki::webserver' 'disable-puppet T366649' [production]
17:11 <ejegg> fundraising civicrm upgraded from ebfbad86 to 7252b1b9 [production]
17:09 <ebernhardson@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:09 <ebernhardson@deploy1002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:09 <kamila@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye [production]
17:08 <ebernhardson@deploy1002> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:08 <ebernhardson@deploy1002> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:04 <ebernhardson@deploy1002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:04 <ebernhardson@deploy1002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:04 <bking@cumin2002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in search_eqiad [production]
17:04 <bking@cumin2002> START - Cookbook sre.elasticsearch.ban Unbanning all hosts in search_eqiad [production]
16:59 <kamila@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye [production]
16:56 <kamila@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye [production]
16:56 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply [production]
16:56 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply [production]
16:53 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply [production]
16:53 <brouberol@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply [production]
16:51 <kamila@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye [production]
16:47 <ryankemper@cumin2002> END (PASS) - Cookbook sre.hadoop.reboot-workers (exit_code=0) for Hadoop test cluster [production]
16:40 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:restbase-codfw [production]
16:37 <kamila@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye [production]
16:36 <kamila@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye [production]
16:35 <ebernhardson@deploy1002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
16:35 <ebernhardson@deploy1002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
16:33 <kamila@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "updated wikikube-ctrl1002 status - kamila@cumin1002 - T366204" [production]