2024-08-26
ยง
|
13:34 |
<sukhe@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on A:cp-eqsin and A:cp for 9.2.5-1wm2 |
[production] |
13:34 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) rpki2003.codfw.wmnet on all recursors |
[production] |
13:34 |
<ayounsi@cumin1002> |
START - Cookbook sre.dns.wipe-cache rpki2003.codfw.wmnet on all recursors |
[production] |
13:34 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
13:34 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM rpki2003.codfw.wmnet - ayounsi@cumin1002" |
[production] |
13:34 |
<ayounsi@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM rpki2003.codfw.wmnet - ayounsi@cumin1002" |
[production] |
13:30 |
<ayounsi@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
13:30 |
<ayounsi@cumin1002> |
START - Cookbook sre.ganeti.makevm for new host rpki2003.codfw.wmnet |
[production] |
13:29 |
<ayounsi@cumin1002> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host rpki2003.codfw.wmnet |
[production] |
13:29 |
<ayounsi@cumin1002> |
START - Cookbook sre.ganeti.makevm for new host rpki2003.codfw.wmnet |
[production] |
13:28 |
<urbanecm@deploy1003> |
hnowlan, urbanecm, ihurbain: Continuing with sync |
[production] |
13:27 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1190 (T371742)', diff saved to https://phabricator.wikimedia.org/P67782 and previous config saved to /var/cache/conftool/dbconfig/20240826-132738-ladsgroup.json |
[production] |
13:27 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1190.eqiad.wmnet with reason: Maintenance |
[production] |
13:27 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1190.eqiad.wmnet with reason: Maintenance |
[production] |
13:24 |
<urbanecm@deploy1003> |
hnowlan, urbanecm, ihurbain: Backport for [[gerrit:1064795|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394|scripts: add script for running jobs from stdin rather than http (T369048)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:20 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P67781 and previous config saved to /var/cache/conftool/dbconfig/20240826-132016-ladsgroup.json |
[production] |
13:09 |
<urbanecm@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1064795|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394|scripts: add script for running jobs from stdin rather than http (T369048)]] |
[production] |
13:07 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply |
[production] |
13:06 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/shellbox-video: apply |
[production] |
13:06 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
13:05 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
13:05 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67780 and previous config saved to /var/cache/conftool/dbconfig/20240826-130510-ladsgroup.json |
[production] |
13:04 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67779 and previous config saved to /var/cache/conftool/dbconfig/20240826-130401-ladsgroup.json |
[production] |
13:03 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
13:03 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
13:03 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67778 and previous config saved to /var/cache/conftool/dbconfig/20240826-130350-ladsgroup.json |
[production] |
12:48 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P67777 and previous config saved to /var/cache/conftool/dbconfig/20240826-124843-ladsgroup.json |
[production] |
12:33 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P67776 and previous config saved to /var/cache/conftool/dbconfig/20240826-123336-ladsgroup.json |
[production] |
12:32 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Weight db2214 T373174', diff saved to https://phabricator.wikimedia.org/P67775 and previous config saved to /var/cache/conftool/dbconfig/20240826-123205-arnaudb.json |
[production] |
12:29 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Promote db2129 to s6 primary T373174', diff saved to https://phabricator.wikimedia.org/P67774 and previous config saved to /var/cache/conftool/dbconfig/20240826-122925-arnaudb.json |
[production] |
12:28 |
<arnaudb> |
Starting s6 codfw failover from db2214 to db2129 - T373174 |
[production] |
12:25 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: Testing |
[production] |
12:25 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: Testing |
[production] |
12:21 |
<godog> |
move to /root unused and about to expire cert on puppetmaster1001:/var/lib/puppet/server/ssl/ca/signed/webperf.discovery.wmnet.pem |
[production] |
12:18 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67773 and previous config saved to /var/cache/conftool/dbconfig/20240826-121828-ladsgroup.json |
[production] |
12:18 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 268434 |
[production] |
12:17 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 268434 |
[production] |
12:17 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 263903 |
[production] |
12:17 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 263903 |
[production] |
12:17 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 61754 |
[production] |
12:17 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 61754 |
[production] |
12:16 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 269115 |
[production] |
12:16 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 269115 |
[production] |
12:16 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 274607 |
[production] |
12:16 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 274607 |
[production] |
12:14 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67772 and previous config saved to /var/cache/conftool/dbconfig/20240826-121419-ladsgroup.json |
[production] |
12:14 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
12:14 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
12:14 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1165 (T370903)', diff saved to https://phabricator.wikimedia.org/P67771 and previous config saved to /var/cache/conftool/dbconfig/20240826-121408-ladsgroup.json |
[production] |
12:12 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox |
[production] |