2024-07-11
ยง
|
09:19 |
<jiji@deploy1002> |
Started scap sync-world: Remove mcrouter container and exporter from mediawiki pods |
[production] |
09:18 |
<ayounsi@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
09:18 |
<ayounsi@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2003.codfw.wmnet |
[production] |
09:13 |
<jiji@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply |
[production] |
09:12 |
<jiji@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply |
[production] |
09:11 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.reimage for host netbox1003.eqiad.wmnet with OS bookworm |
[production] |
09:10 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM netbox1003.eqiad.wmnet - ayounsi@cumin1002" |
[production] |
09:09 |
<ayounsi@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM netbox1003.eqiad.wmnet - ayounsi@cumin1002" |
[production] |
09:09 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox1003.eqiad.wmnet on all recursors |
[production] |
09:09 |
<ayounsi@cumin1002> |
START - Cookbook sre.dns.wipe-cache netbox1003.eqiad.wmnet on all recursors |
[production] |
09:09 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
09:09 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM netbox1003.eqiad.wmnet - ayounsi@cumin1002" |
[production] |
09:08 |
<ayounsi@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM netbox1003.eqiad.wmnet - ayounsi@cumin1002" |
[production] |
09:05 |
<ayounsi@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
09:05 |
<ayounsi@cumin1002> |
START - Cookbook sre.ganeti.makevm for new host netbox1003.eqiad.wmnet |
[production] |
09:05 |
<jiji@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply |
[production] |
09:04 |
<jiji@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-api-int: apply |
[production] |
09:02 |
<jiji@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply |
[production] |
09:00 |
<jiji@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-api-int: apply |
[production] |
08:57 |
<jiji@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-debug: apply |
[production] |
08:57 |
<jiji@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-debug: apply |
[production] |
08:55 |
<jiji@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply |
[production] |
08:55 |
<jiji@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-debug: apply |
[production] |
08:46 |
<elukey> |
cd /srv/git/private; git reset --hard HEAD^ on puppetserver1001 to remove my last local commit (test before migration of the private repo to puppetserver1001) - T368023 |
[production] |
08:41 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
08:41 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
08:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T367856)', diff saved to https://phabricator.wikimedia.org/P66280 and previous config saved to /var/cache/conftool/dbconfig/20240711-084151-marostegui.json |
[production] |
08:30 |
<hashar> |
Switched CI Quibble and Phan jobs based on PHP 8.1, 8.2 and 8.3 from Buster to Bullseye - T335766 T366799 T369146 |
[production] |
08:26 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P66279 and previous config saved to /var/cache/conftool/dbconfig/20240711-082644-marostegui.json |
[production] |
08:15 |
<aklapper@deploy1002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.43.0-wmf.13 refs T366958 |
[production] |
08:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P66278 and previous config saved to /var/cache/conftool/dbconfig/20240711-081137-marostegui.json |
[production] |
08:05 |
<jelto@cumin1002> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab to new version |
[production] |
07:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T367856)', diff saved to https://phabricator.wikimedia.org/P66277 and previous config saved to /var/cache/conftool/dbconfig/20240711-075630-marostegui.json |
[production] |
07:50 |
<marostegui> |
Deploy schema change on s3 codfw db2127 dbmaint T367856 |
[production] |
07:48 |
<dcausse> |
closing the backport window |
[production] |
07:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2127.codfw.wmnet with reason: Long schema change |
[production] |
07:48 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2127.codfw.wmnet with reason: Long schema change |
[production] |
07:47 |
<dcausse@deploy1002> |
Finished scap: Backport for [[gerrit:1053533|Fix pool counter metric]] (duration: 09m 56s) |
[production] |
07:46 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2127 T369691', diff saved to https://phabricator.wikimedia.org/P66276 and previous config saved to /var/cache/conftool/dbconfig/20240711-074629-marostegui.json |
[production] |
07:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db2205 to s3 primary T369691', diff saved to https://phabricator.wikimedia.org/P66275 and previous config saved to /var/cache/conftool/dbconfig/20240711-074534-marostegui.json |
[production] |
07:45 |
<marostegui> |
Starting s3 codfw failover from db2127 to db2205 - T369691 |
[production] |
07:42 |
<dcausse@deploy1002> |
dcausse: Continuing with sync |
[production] |
07:41 |
<dcausse@deploy1002> |
dcausse: Backport for [[gerrit:1053533|Fix pool counter metric]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:37 |
<dcausse@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1053533|Fix pool counter metric]] |
[production] |
07:31 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s3 T369691 |
[production] |
07:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db2205 with weight 0 T369691', diff saved to https://phabricator.wikimedia.org/P66274 and previous config saved to /var/cache/conftool/dbconfig/20240711-073101-root.json |
[production] |
07:30 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance |
[production] |
07:30 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 25 hosts with reason: Primary switchover s3 T369691 |
[production] |
07:30 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance |
[production] |
07:28 |
<jgiannelos@deploy1002> |
Finished scap: Backport for [[gerrit:1053006|Linter: trigger parsoid parses on template changes (T361013)]] (duration: 14m 25s) |
[production] |