5301-5350 of 10000 results (37ms)
2025-01-30 ยง
14:43 <aokoth@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
14:43 <aokoth@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
14:41 <marostegui@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72877 and previous config saved to /var/cache/conftool/dbconfig/20250130-144115-root.json [production]
14:37 <hashar@deploy2002> Started scap sync-world: Backport for [[gerrit:1115383|SuggestedEditSession: remove incorrect cast to integer (T385117)]], [[gerrit:1115384|SuggestedEditSession: remove incorrect cast to integer (T385117)]] [production]
14:36 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2194', diff saved to https://phabricator.wikimedia.org/P72876 and previous config saved to /var/cache/conftool/dbconfig/20250130-143607-marostegui.json [production]
14:33 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1036.eqiad.wmnet}' (T384946) [admin]
14:33 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=97) on hosts matched by 'D{cloudvirt1036.eqiad.wmnet}' (T384946) [admin]
14:32 <aokoth@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
14:30 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) [admin]
14:26 <marostegui@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72875 and previous config saved to /var/cache/conftool/dbconfig/20250130-142609-root.json [production]
14:25 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack [admin]
14:22 <aokoth@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
14:21 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2194', diff saved to https://phabricator.wikimedia.org/P72873 and previous config saved to /var/cache/conftool/dbconfig/20250130-142100-marostegui.json [production]
14:20 <aokoth@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
14:19 <jayme> stopped puppet on all kubernetes hosts [production]
14:18 <hashar@deploy2002> Finished scap sync-world: Backport for [[gerrit:1115062|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115059|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115336|migrateConfigToCommunity: Include an edit summary (T385024)]], [[gerrit:1115337|migrateConfigToCommunity: Include an edit summary (T385024)]] (duration: 11m 19s) [production]
14:11 <hashar@deploy2002> urbanecm, hashar: Continuing with sync [production]
14:11 <marostegui@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72872 and previous config saved to /var/cache/conftool/dbconfig/20250130-141104-root.json [production]
14:10 <aokoth@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
14:10 <hashar@deploy2002> urbanecm, hashar: Backport for [[gerrit:1115062|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115059|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115336|migrateConfigToCommunity: Include an edit summary (T385024)]], [[gerrit:1115337|migrateConfigToCommunity: Include an edit summary (T385024)]] synced to the testservers (https://wik [production]
14:06 <hashar@deploy2002> Started scap sync-world: Backport for [[gerrit:1115062|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115059|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115336|migrateConfigToCommunity: Include an edit summary (T385024)]], [[gerrit:1115337|migrateConfigToCommunity: Include an edit summary (T385024)]] [production]
14:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2194 (T384592)', diff saved to https://phabricator.wikimedia.org/P72871 and previous config saved to /var/cache/conftool/dbconfig/20250130-140553-marostegui.json [production]
14:04 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2029.codfw.wmnet [production]
13:55 <marostegui@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72870 and previous config saved to /var/cache/conftool/dbconfig/20250130-135559-root.json [production]
13:50 <jayme@cumin1002> START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster staging-codfw: Kubernetes upgrade [production]
13:50 <marostegui@cumin1002> dbctl commit (dc=all): 'db2224 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72869 and previous config saved to /var/cache/conftool/dbconfig/20250130-135024-root.json [production]
13:45 <elukey@deploy2002> helmfile [eqiad] DONE helmfile.d/services/kartotherian: sync [production]
13:44 <elukey@deploy2002> helmfile [eqiad] START helmfile.d/services/kartotherian: sync [production]
13:38 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.set_maintenance (exit_code=0) [admin]
13:38 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.set_maintenance [admin]
13:37 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.set_maintenance (exit_code=0) [admin]
13:37 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.set_maintenance [admin]
13:37 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.set_maintenance (exit_code=0) [admin]
13:36 <hashar@deploy2002> Finished scap sync-world: Backport for [[gerrit:1115344|Fix response error handling in FlickrBlacklist (T385143)]] (duration: 11m 54s) [production]
13:36 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.set_maintenance [admin]
13:36 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.set_maintenance (exit_code=0) [admin]
13:35 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.set_maintenance [admin]
13:35 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.set_maintenance (exit_code=0) [admin]
13:35 <marostegui@cumin1002> dbctl commit (dc=all): 'db2224 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72868 and previous config saved to /var/cache/conftool/dbconfig/20250130-133519-root.json [production]
13:34 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.set_maintenance [admin]
13:30 <hashar@deploy2002> hashar: Continuing with sync [production]
13:27 <hashar@deploy2002> hashar: Backport for [[gerrit:1115344|Fix response error handling in FlickrBlacklist (T385143)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:24 <hashar@deploy2002> Started scap sync-world: Backport for [[gerrit:1115344|Fix response error handling in FlickrBlacklist (T385143)]] [production]
13:20 <marostegui@cumin1002> dbctl commit (dc=all): 'db2224 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72867 and previous config saved to /var/cache/conftool/dbconfig/20250130-132014-root.json [production]
13:15 <aokoth@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
13:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage [production]
13:06 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage [production]
13:05 <marostegui@cumin1002> dbctl commit (dc=all): 'db2224 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72866 and previous config saved to /var/cache/conftool/dbconfig/20250130-130509-root.json [production]
13:05 <aokoth@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
13:03 <jayme@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on 7 hosts with reason: K8s update [production]