2024-05-13
ยง
|
09:46 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/thumbor: sync |
[production] |
09:39 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2184.codfw.wmnet with OS bookworm |
[production] |
09:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1200 (T364299)', diff saved to https://phabricator.wikimedia.org/P62338 and previous config saved to /var/cache/conftool/dbconfig/20240513-093200-marostegui.json |
[production] |
09:28 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: sync |
[production] |
09:27 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/thumbor: sync |
[production] |
09:23 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2184.codfw.wmnet with reason: host reimage |
[production] |
09:20 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2184.codfw.wmnet with reason: host reimage |
[production] |
09:05 |
<jynus> |
deploy new stat grants at m1:dbbackups T362509 |
[production] |
09:03 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2184.codfw.wmnet with OS bookworm |
[production] |
09:02 |
<marostegui@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db2184.codfw.wmnet with OS bookworm |
[production] |
09:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1200 (T364299)', diff saved to https://phabricator.wikimedia.org/P62337 and previous config saved to /var/cache/conftool/dbconfig/20240513-090035-marostegui.json |
[production] |
09:00 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1200.eqiad.wmnet with reason: Maintenance |
[production] |
09:00 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1200.eqiad.wmnet with reason: Maintenance |
[production] |
09:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185 (T364299)', diff saved to https://phabricator.wikimedia.org/P62336 and previous config saved to /var/cache/conftool/dbconfig/20240513-090011-marostegui.json |
[production] |
09:00 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts snapshot1009.eqiad.wmnet |
[production] |
09:00 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
09:00 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: snapshot1009.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1002" |
[production] |
08:58 |
<btullis@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: snapshot1009.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1002" |
[production] |
08:56 |
<btullis@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
08:53 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2184.codfw.wmnet with OS bookworm |
[production] |
08:51 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts snapshot1009.eqiad.wmnet |
[production] |
08:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P62335 and previous config saved to /var/cache/conftool/dbconfig/20240513-084503-marostegui.json |
[production] |
08:45 |
<marostegui@deploy1002> |
Finished scap: Backport for [[gerrit:1029109|db-production.php: Enable writes on es6 and es7 (T364446)]] (duration: 44m 00s) |
[production] |
08:32 |
<marostegui@deploy1002> |
marostegui: Continuing with sync |
[production] |
08:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P62334 and previous config saved to /var/cache/conftool/dbconfig/20240513-082956-marostegui.json |
[production] |
08:24 |
<moritzm> |
installing PHP 7.3 security updates |
[production] |
08:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185 (T364299)', diff saved to https://phabricator.wikimedia.org/P62333 and previous config saved to /var/cache/conftool/dbconfig/20240513-081448-marostegui.json |
[production] |
08:03 |
<marostegui@deploy1002> |
marostegui: Backport for [[gerrit:1029109|db-production.php: Enable writes on es6 and es7 (T364446)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
08:01 |
<marostegui@deploy1002> |
Started scap: Backport for [[gerrit:1029109|db-production.php: Enable writes on es6 and es7 (T364446)]] |
[production] |
08:00 |
<moritzm> |
installing python2.7 security updates |
[production] |
07:58 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:1030866|Fix static cache access (T364693)]] (duration: 16m 54s) |
[production] |
07:54 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 17451 |
[production] |
07:53 |
<moritzm> |
installing libgd2 security updates |
[production] |
07:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2213 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P62332 and previous config saved to /var/cache/conftool/dbconfig/20240513-075256-root.json |
[production] |
07:46 |
<ladsgroup@deploy1002> |
ladsgroup: Continuing with sync |
[production] |
07:44 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. |
[production] |
07:44 |
<ladsgroup@deploy1002> |
ladsgroup: Backport for [[gerrit:1030866|Fix static cache access (T364693)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:41 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:1030866|Fix static cache access (T364693)]] |
[production] |
07:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1185 (T364299)', diff saved to https://phabricator.wikimedia.org/P62331 and previous config saved to /var/cache/conftool/dbconfig/20240513-074103-marostegui.json |
[production] |
07:40 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
07:40 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
07:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183 (T364299)', diff saved to https://phabricator.wikimedia.org/P62330 and previous config saved to /var/cache/conftool/dbconfig/20240513-074041-marostegui.json |
[production] |
07:38 |
<brouberol@cumin2002> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. |
[production] |
07:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2213 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P62329 and previous config saved to /var/cache/conftool/dbconfig/20240513-073750-root.json |
[production] |
07:37 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:1025300|ContentTranslation: Update publishing setting for cswiki (T353049)]] (duration: 32m 03s) |
[production] |
07:35 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'configure' for AS: 17451 |
[production] |
07:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1158 (T352010)', diff saved to https://phabricator.wikimedia.org/P62328 and previous config saved to /var/cache/conftool/dbconfig/20240513-073031-ladsgroup.json |
[production] |
07:30 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
07:30 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
07:30 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1158.eqiad.wmnet with reason: Maintenance |
[production] |