4201-4250 of 10000 results (122ms)
2024-08-26 ยง
13:53 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2013.codfw.wmnet with OS bullseye [production]
13:53 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T370903)', diff saved to https://phabricator.wikimedia.org/P67786 and previous config saved to /var/cache/conftool/dbconfig/20240826-135301-ladsgroup.json [production]
13:52 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on rpki2003.codfw.wmnet with reason: host reimage [production]
13:52 <cgoubert@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker2013.codfw.wmnet [production]
13:51 <cgoubert@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker2013.codfw.wmnet [production]
13:50 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1180 (T370903)', diff saved to https://phabricator.wikimedia.org/P67785 and previous config saved to /var/cache/conftool/dbconfig/20240826-135052-ladsgroup.json [production]
13:50 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1180.eqiad.wmnet with reason: Maintenance [production]
13:50 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1180.eqiad.wmnet with reason: Maintenance [production]
13:50 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67784 and previous config saved to /var/cache/conftool/dbconfig/20240826-135031-ladsgroup.json [production]
13:45 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1064390|use shellbox-video globally (adding group2, including commons) (T356241)]] (duration: 08m 04s) [production]
13:45 <Dreamy_Jazz> Started 6hr maximum scan on nowiki - https://wikitech.wikimedia.org/wiki/MediaModeration [production]
13:41 <urbanecm@deploy1003> hnowlan, urbanecm: Continuing with sync [production]
13:40 <urbanecm@deploy1003> hnowlan, urbanecm: Backport for [[gerrit:1064390|use shellbox-video globally (adding group2, including commons) (T356241)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:37 <urbanecm@deploy1003> Started scap sync-world: Backport for [[gerrit:1064390|use shellbox-video globally (adding group2, including commons) (T356241)]] [production]
13:36 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1064795|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394|scripts: add script for running jobs from stdin rather than http (T369048)]] (duration: 26m 53s) [production]
13:35 <ayounsi@cumin1002> START - Cookbook sre.hosts.reimage for host rpki2003.codfw.wmnet with OS bookworm [production]
13:35 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P67783 and previous config saved to /var/cache/conftool/dbconfig/20240826-133524-ladsgroup.json [production]
13:34 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM rpki2003.codfw.wmnet - ayounsi@cumin1002" [production]
13:34 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM rpki2003.codfw.wmnet - ayounsi@cumin1002" [production]
13:34 <sukhe@cumin1002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on A:cp-eqsin and A:cp for 9.2.5-1wm2 [production]
13:34 <ayounsi@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) rpki2003.codfw.wmnet on all recursors [production]
13:34 <ayounsi@cumin1002> START - Cookbook sre.dns.wipe-cache rpki2003.codfw.wmnet on all recursors [production]
13:34 <ayounsi@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:34 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM rpki2003.codfw.wmnet - ayounsi@cumin1002" [production]
13:34 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM rpki2003.codfw.wmnet - ayounsi@cumin1002" [production]
13:30 <ayounsi@cumin1002> START - Cookbook sre.dns.netbox [production]
13:30 <ayounsi@cumin1002> START - Cookbook sre.ganeti.makevm for new host rpki2003.codfw.wmnet [production]
13:29 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host rpki2003.codfw.wmnet [production]
13:29 <ayounsi@cumin1002> START - Cookbook sre.ganeti.makevm for new host rpki2003.codfw.wmnet [production]
13:28 <urbanecm@deploy1003> hnowlan, urbanecm, ihurbain: Continuing with sync [production]
13:27 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1190 (T371742)', diff saved to https://phabricator.wikimedia.org/P67782 and previous config saved to /var/cache/conftool/dbconfig/20240826-132738-ladsgroup.json [production]
13:27 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1190.eqiad.wmnet with reason: Maintenance [production]
13:27 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1190.eqiad.wmnet with reason: Maintenance [production]
13:24 <urbanecm@deploy1003> hnowlan, urbanecm, ihurbain: Backport for [[gerrit:1064795|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394|scripts: add script for running jobs from stdin rather than http (T369048)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:20 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P67781 and previous config saved to /var/cache/conftool/dbconfig/20240826-132016-ladsgroup.json [production]
13:09 <urbanecm@deploy1003> Started scap sync-world: Backport for [[gerrit:1064795|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394|scripts: add script for running jobs from stdin rather than http (T369048)]] [production]
13:07 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply [production]
13:06 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/shellbox-video: apply [production]
13:06 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
13:05 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
13:05 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67780 and previous config saved to /var/cache/conftool/dbconfig/20240826-130510-ladsgroup.json [production]
13:04 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67779 and previous config saved to /var/cache/conftool/dbconfig/20240826-130401-ladsgroup.json [production]
13:03 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1173.eqiad.wmnet with reason: Maintenance [production]
13:03 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1173.eqiad.wmnet with reason: Maintenance [production]
13:03 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67778 and previous config saved to /var/cache/conftool/dbconfig/20240826-130350-ladsgroup.json [production]
12:48 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P67777 and previous config saved to /var/cache/conftool/dbconfig/20240826-124843-ladsgroup.json [production]
12:33 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P67776 and previous config saved to /var/cache/conftool/dbconfig/20240826-123336-ladsgroup.json [production]
12:32 <arnaudb@cumin1002> dbctl commit (dc=all): 'Weight db2214 T373174', diff saved to https://phabricator.wikimedia.org/P67775 and previous config saved to /var/cache/conftool/dbconfig/20240826-123205-arnaudb.json [production]
12:29 <arnaudb@cumin1002> dbctl commit (dc=all): 'Promote db2129 to s6 primary T373174', diff saved to https://phabricator.wikimedia.org/P67774 and previous config saved to /var/cache/conftool/dbconfig/20240826-122925-arnaudb.json [production]
12:28 <arnaudb> Starting s6 codfw failover from db2214 to db2129 - T373174 [production]