2024-06-17
§
|
22:42 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1041.eqiad.wmnet with OS bookworm |
[production] |
22:40 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on eqiad1, with recreate False, for hosts list: ['cloudvirt1041'] |
[cloudvirt-canary] |
22:40 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1041'] |
[cloudvirt-canary] |
22:38 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=99) on eqiad1, with recreate False, for hosts list: ['cloudvirt1041'] |
[cloudvirt-canary] |
22:38 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1041'] |
[cloudvirt-canary] |
22:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T352010)', diff saved to https://phabricator.wikimedia.org/P65121 and previous config saved to /var/cache/conftool/dbconfig/20240617-223010-ladsgroup.json |
[production] |
22:28 |
<cdobbins@cumin1002> |
START - Cookbook sre.hosts.reimage for host cp4043.ulsfo.wmnet with OS bullseye |
[production] |
22:26 |
<cdobbins@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4043.ulsfo.wmnet with OS bullseye |
[production] |
22:25 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching cassandra-dev200[2-3].codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
22:15 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1041.eqiad.wmnet with reason: host reimage |
[production] |
22:15 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P65120 and previous config saved to /var/cache/conftool/dbconfig/20240617-221503-ladsgroup.json |
[production] |
22:12 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1041.eqiad.wmnet with reason: host reimage |
[production] |
22:11 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching cassandra-dev200[2-3].codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
22:05 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching cassandra-dev2001.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
21:59 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P65119 and previous config saved to /var/cache/conftool/dbconfig/20240617-215956-ladsgroup.json |
[production] |
21:58 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching cassandra-dev2001.codfw.wmnet: Apply Cassandra upgrade to 4.1.5 — T354970 - eevans@cumin1002 |
[production] |
21:55 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt1041.eqiad.wmnet with OS bookworm |
[production] |
21:55 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on eqiad1, with recreate False, for hosts list: ['cloudvirt1040'] |
[cloudvirt-canary] |
21:55 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1040'] |
[cloudvirt-canary] |
21:47 |
<brennen> |
gitlab: set phorge integration as default instance-wide issue tracker (T337570) - this may have knock-on effects |
[releng] |
21:44 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T352010)', diff saved to https://phabricator.wikimedia.org/P65118 and previous config saved to /var/cache/conftool/dbconfig/20240617-214449-ladsgroup.json |
[production] |
21:41 |
<cdobbins@cumin1002> |
START - Cookbook sre.hosts.reimage for host cp4043.ulsfo.wmnet with OS bullseye |
[production] |
21:20 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1040.eqiad.wmnet with OS bookworm |
[production] |
21:09 |
<cdobbins@cumin1002> |
conftool action : set/pooled=no; selector: name=cp4043.ulsfo.wmnet |
[production] |
21:09 |
<cdobbins@cumin1002> |
conftool action : set/pooled=no; selector: name=4043.ulsfo.wmnet |
[production] |
21:05 |
<jforrester@deploy1002> |
Finished scap: Backport for [[gerrit:1046767|Fix styles for new heading HTML (T367468)]] (duration: 18m 57s) |
[production] |
20:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1174 (T364069)', diff saved to https://phabricator.wikimedia.org/P65117 and previous config saved to /var/cache/conftool/dbconfig/20240617-205955-marostegui.json |
[production] |
20:59 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1174.eqiad.wmnet with reason: Maintenance |
[production] |
20:59 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1174.eqiad.wmnet with reason: Maintenance |
[production] |
20:55 |
<jforrester@deploy1002> |
jforrester: Continuing with sync |
[production] |
20:52 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: host reimage |
[production] |
20:50 |
<jforrester@deploy1002> |
jforrester: Backport for [[gerrit:1046767|Fix styles for new heading HTML (T367468)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:50 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: host reimage |
[production] |
20:46 |
<jforrester@deploy1002> |
Started scap: Backport for [[gerrit:1046767|Fix styles for new heading HTML (T367468)]] |
[production] |
20:34 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt1040.eqiad.wmnet with OS bookworm |
[production] |
20:33 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1039.eqiad.wmnet with OS bookworm |
[production] |
20:32 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on eqiad1, with recreate False, for hosts list: ['cloudvirt1039'] |
[cloudvirt-canary] |
20:32 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate False, for hosts list: ['cloudvirt1039'] |
[cloudvirt-canary] |
20:10 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1041.eqiad.wmnet' (T364457) |
[admin] |
20:08 |
<brett@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp4042.ulsfo.wmnet |
[production] |
20:07 |
<jforrester@deploy1002> |
jforrester: Continuing with sync |
[production] |
20:06 |
<jforrester@deploy1002> |
jforrester: Backport for [[gerrit:1041659|[wikifunctionswiki] Remove right to promote/demote sysops and bureaucrats from staff (T365627)]], [[gerrit:1039767|Add a note that you cannot change wgCategoryCollation easily (T362494 T366809)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:06 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1039.eqiad.wmnet with reason: host reimage |
[production] |
20:06 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4042.ulsfo.wmnet with OS bullseye |
[production] |
20:03 |
<andrewbogott> |
repaced ovs hosts in the 'ceph' aggregate |
[admin] |
20:02 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1039.eqiad.wmnet with reason: host reimage |
[production] |
20:01 |
<jforrester@deploy1002> |
Started scap: Backport for [[gerrit:1041659|[wikifunctionswiki] Remove right to promote/demote sysops and bureaucrats from staff (T365627)]], [[gerrit:1039767|Add a note that you cannot change wgCategoryCollation easily (T362494 T366809)]] |
[production] |
19:55 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1041.eqiad.wmnet' (T364457) |
[admin] |
19:55 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1040.eqiad.wmnet' (T364457) |
[admin] |
19:55 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2204 (T352010)', diff saved to https://phabricator.wikimedia.org/P65116 and previous config saved to /var/cache/conftool/dbconfig/20240617-195520-ladsgroup.json |
[production] |