8651-8700 of 10000 results (39ms)
2023-02-14 ยง
14:41 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1001.eqiad.wmnet with OS bullseye [production]
14:41 <andrew@cumin2002> END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - andrew@cumin2002" [production]
14:41 <elukey@cumin1001> START - Cookbook sre.k8s.upgrade-cluster Upgrade K8s version: Upgrade ml-staging-codfw cluster to 1.23 [production]
14:40 <andrew@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - andrew@cumin2002" [production]
14:34 <taavi> disable webservice failing to start, left the maintainer a talk page message [tools.pmidtool]
14:30 <jgiannelos@deploy1002> helmfile [codfw] DONE helmfile.d/services/proton: apply [production]
14:28 <jgiannelos@deploy1002> helmfile [codfw] START helmfile.d/services/proton: apply [production]
14:28 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]
14:26 <jgiannelos@deploy1002> helmfile [eqiad] START helmfile.d/services/proton: apply [production]
14:25 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/proton: apply [production]
14:24 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/proton: apply [production]
14:23 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]
14:23 <jgiannelos@deploy1002> helmfile [eqiad] START helmfile.d/services/proton: apply [production]
14:23 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/proton: apply [production]
14:23 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/proton: apply [production]
14:22 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/proton: apply [production]
14:22 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/proton: apply [production]
14:21 <jgiannelos@deploy1002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
14:20 <jgiannelos@deploy1002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
14:19 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
14:18 <jgiannelos@deploy1002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
14:18 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
14:17 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
14:11 <moritzm> installing libde265 security updates [production]
14:04 <zabe> delete deployment-db10 and volume db10 # T329577 [releng]
13:52 <taavi> Disabled cron jobs updating the schedule of a conference in 2019 that were running every minute. Not a great use of shared resources. [tools.germancon-mobile]
13:43 <taavi> disable webservice failing to start due to dependency issues, disabled as maintainer has been inactive and unreachable since 2015 [tools.crosswatch]
13:38 <taavi> disable webservice failing to start due to dependency issues, left the maintainer a talk page message [tools.movestats]
13:29 <taavi> disable webservice failing to start due to dependency issues, left the maintainer a talk page message [tools.facebook-messenger-chatbot]
13:27 <Rook> Bump oauthlib 97f241bacff7af60cebfffa0d8eb945c7545577d [paws]
13:23 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1001.eqiad.wmnet with reason: host reimage [production]
13:20 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1001.eqiad.wmnet with reason: host reimage [production]
13:17 <andrewbogott> restarting all eqiad1 openstack services because that seems to sometimes help things *shrug* [admin]
13:16 <taavi> removed service.manifest, tool is otherwise totally empty so it was failing to start. marked for deletion as it's been inactive since 2017 [tools.hexacore]
13:08 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1001.eqiad.wmnet with OS bullseye [production]
13:07 <marostegui@cumin1001> dbctl commit (dc=all): 'db2181 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P44624 and previous config saved to /var/cache/conftool/dbconfig/20230214-130708-root.json [production]
13:06 <taavi> shut down webservice [tools.my-first-pywikibot-tool]
12:52 <marostegui@cumin1001> dbctl commit (dc=all): 'db2181 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P44623 and previous config saved to /var/cache/conftool/dbconfig/20230214-125203-root.json [production]
12:36 <marostegui@cumin1001> dbctl commit (dc=all): 'db2181 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P44622 and previous config saved to /var/cache/conftool/dbconfig/20230214-123659-root.json [production]
12:21 <marostegui@cumin1001> dbctl commit (dc=all): 'db2181 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P44621 and previous config saved to /var/cache/conftool/dbconfig/20230214-122154-root.json [production]
12:12 <arturo> the fixed webservicemonitor is starting a bunch of grid webservices (T329611) [tools]
12:09 <arturo> included tools-manifests 0.25 in tools-buster aptly repo, deploying it now! (T329611, T329467, T244809) [tools]
12:06 <marostegui@cumin1001> dbctl commit (dc=all): 'db2181 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P44620 and previous config saved to /var/cache/conftool/dbconfig/20230214-120649-root.json [production]
12:02 <arturo> included tools-manifests 0.25 in toolsbeta-buster aptly repo (T329611, T329467, T244809) [toolsbeta]
11:53 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host schema2004.codfw.wmnet [production]
11:51 <marostegui@cumin1001> dbctl commit (dc=all): 'db2181 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P44619 and previous config saved to /var/cache/conftool/dbconfig/20230214-115144-root.json [production]
11:49 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host schema2004.codfw.wmnet [production]
11:47 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host schema2003.codfw.wmnet [production]
11:44 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host schema2003.codfw.wmnet [production]
11:38 <zabe> deployment-db11: start replicating from deployment-db09 # T329577 [releng]