401-450 of 10000 results (95ms)
2024-08-01 ยง
13:00 <urbanecm@deploy1003> helmfile [staging] DONE helmfile.d/services/linkrecommendation: sync [production]
12:59 <urbanecm@deploy1003> helmfile [staging] START helmfile.d/services/linkrecommendation: sync [production]
12:59 <urbanecm@deploy1003> helmfile [codfw] DONE helmfile.d/services/linkrecommendation: sync [production]
12:58 <urbanecm@deploy1003> helmfile [codfw] START helmfile.d/services/linkrecommendation: sync [production]
12:55 <urbanecm@deploy1003> helmfile [eqiad] DONE helmfile.d/services/linkrecommendation: sync [production]
12:55 <urbanecm@deploy1003> helmfile [eqiad] START helmfile.d/services/linkrecommendation: sync [production]
12:55 <urbanecm@deploy1003> helmfile [eqiad] DONE helmfile.d/services/linkrecommendation: apply [production]
12:55 <urbanecm@deploy1003> helmfile [eqiad] START helmfile.d/services/linkrecommendation: apply [production]
12:52 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host gerrit2003.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
12:40 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on alert2002.wikimedia.org with reason: host reimage [production]
12:39 <urbanecm> Decommission Add Link models for akwiki, nawiki (T371598) [production]
12:37 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on alert2002.wikimedia.org with reason: host reimage [production]
12:26 <isaranto@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:19 <urbanecm> [urbanecm@mwmaint1002 ~]$ mwscript extensions/GrowthExperiments/maintenance/revalidateLinkRecommendations.php --wiki=dewiki --olderThan=1721045915 --verbose # T371597 [production]
12:18 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host alert2002.wikimedia.org with OS bookworm [production]
12:10 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['alert2002'] [production]
12:10 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['vrts2002'] [production]
12:10 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['alert2002'] [production]
12:10 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['vrts2002'] [production]
12:09 <cgoubert@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) for host kubestage1003.eqiad.wmnet [production]
12:09 <cgoubert@cumin1002> START - Cookbook sre.k8s.pool-depool-node for host kubestage1003.eqiad.wmnet [production]
12:09 <cgoubert@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) for host kubestage1003.eqiad.wmnet [production]
12:06 <cgoubert@cumin1002> START - Cookbook sre.k8s.pool-depool-node for host kubestage1003.eqiad.wmnet [production]
11:49 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host vrts2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
11:49 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host alert2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
11:48 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host vrts2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
11:48 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host alert2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
11:48 <kevinbazira@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
11:31 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P67192 and previous config saved to /var/cache/conftool/dbconfig/20240801-113108-root.json [production]
11:16 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P67191 and previous config saved to /var/cache/conftool/dbconfig/20240801-111602-root.json [production]
11:00 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P67190 and previous config saved to /var/cache/conftool/dbconfig/20240801-110057-root.json [production]
10:45 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P67189 and previous config saved to /var/cache/conftool/dbconfig/20240801-104551-root.json [production]
10:30 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P67188 and previous config saved to /var/cache/conftool/dbconfig/20240801-103046-root.json [production]
10:15 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P67187 and previous config saved to /var/cache/conftool/dbconfig/20240801-101541-root.json [production]
10:00 <marostegui@cumin1002> dbctl commit (dc=all): 'db1233 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P67186 and previous config saved to /var/cache/conftool/dbconfig/20240801-100035-root.json [production]
09:54 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephosd1035.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:44 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1035.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:36 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephmon1006.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:31 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1233.eqiad.wmnet with reason: Maintenance [production]
09:31 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on db1233.eqiad.wmnet with reason: Maintenance [production]
09:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1233', diff saved to https://phabricator.wikimedia.org/P67185 and previous config saved to /var/cache/conftool/dbconfig/20240801-093123-marostegui.json [production]
09:27 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephmon1006.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:24 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephmon1005.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:16 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephmon1005.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:08 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephmon1004.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:00 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephmon1004.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
08:57 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2230.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
08:55 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host db2230.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
08:49 <ayounsi@cumin1002> END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) [production]
08:48 <ayounsi@cumin1002> START - Cookbook sre.postgresql.postgres-init [production]