7551-7600 of 10000 results (162ms)
2024-09-23 ยง
18:47 <jgleeson> SmashPig upgraded from 02ba8a7e to 697344d7 [production]
17:06 <ebernhardson@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:05 <ebernhardson@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
16:53 <jgleeson> SmashPig upgraded from ac85ad1d to 02ba8a7e [production]
16:49 <ebernhardson@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
16:49 <ebernhardson@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
16:43 <ebernhardson@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
16:43 <ebernhardson@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
16:43 <pt1979@cumin1002> START - Cookbook sre.hosts.dhcp for host db1246.eqiad.wmnet [production]
16:38 <ryankemper@cumin2002> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
16:38 <pt1979@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1246.eqiad.wmnet with OS bookworm [production]
16:08 <elukey@puppetserver1001> conftool action : set/pooled=no; selector: name=registry1003.eqiad.wmnet,service=docker-registry,dc=eqiad [production]
16:08 <elukey@puppetserver1001> conftool action : set/weight=10; selector: name=registry1005.eqiad.wmnet,service=docker-registry,dc=eqiad [production]
16:08 <elukey@puppetserver1001> conftool action : set/pooled=yes; selector: name=registry1005.eqiad.wmnet,service=docker-registry,dc=eqiad [production]
16:05 <dcausse@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
16:05 <dcausse@deploy1003> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
16:04 <elukey@puppetserver1001> conftool action : set/pooled=true,weight=10; selector: name=registry1005.eqiad.wmnet,service=docker-registry,dc=eqiad [production]
16:03 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
16:03 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
15:59 <dcausse@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
15:59 <dcausse@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
15:46 <pt1979@cumin1002> START - Cookbook sre.hosts.reimage for host db1246.eqiad.wmnet with OS bookworm [production]
15:35 <ryankemper@cumin2002> START - Cookbook sre.wdqs.restart [production]
15:20 <stevemunene@deploy1003> Finished deploy [wdqs/wdqs@316bf7f]: allow 3 new endpoints T364233 T368085 T374195 (duration: 05m 51s) [production]
15:14 <stevemunene@deploy1003> Started deploy [wdqs/wdqs@316bf7f]: allow 3 new endpoints T364233 T368085 T374195 [production]
15:03 <volans@cumin1002> dbctl commit (dc=all): 'emergency failover pc3 to pc1015', diff saved to https://phabricator.wikimedia.org/P69396 and previous config saved to /var/cache/conftool/dbconfig/20240923-150320-volans.json [production]
14:51 <moritzm> powercycle pc1013 (DIMM error in DIMM_A9) [production]
14:50 <elukey@puppetserver1001> conftool action : set/pooled=true,weight=10; selector: name=registry1005.eqiad.wmnet,service=docker-registry,dc=eqiad [production]
14:49 <elukey@puppetserver1001> conftool action : set/pooled=true; selector: name=registry1005.eqiad.wmnet [production]
14:49 <elukey@puppetserver1001> conftool action : set/weight=10; selector: name=registry1005.eqiad.wmnet [production]
14:48 <elukey@puppetserver1001> conftool action : set/pooled=true; selector: name=registry1005.eqiad.wmnet [production]
14:38 <stevemunene@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1176.eqiad.wmnet with OS bullseye [production]
14:31 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host registry1005.eqiad.wmnet with OS bookworm [production]
14:30 <sukhe> sudo cumin 'O:wikidough' 'run-puppet-agent' [production]
14:30 <jynus> restarting and moving replication source of pc1015 T375382 [production]
14:16 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-codfw [production]
14:16 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr2-codfw [production]
14:16 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on registry1005.eqiad.wmnet with reason: host reimage [production]
14:15 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr1-codfw [production]
14:15 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr1-codfw [production]
14:14 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-esams [production]
14:14 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr2-esams [production]
14:14 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device asw1-by27-esams [production]
14:13 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device asw1-by27-esams [production]
14:12 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on registry1005.eqiad.wmnet with reason: host reimage [production]
14:04 <xcollazo@deploy1003> Finished deploy [airflow-dags/analytics@3e2d3b8]: Deploy latest DAGs to analytics Airflow instance. T369868. (duration: 00m 48s) [production]
14:03 <xcollazo@deploy1003> Started deploy [airflow-dags/analytics@3e2d3b8]: Deploy latest DAGs to analytics Airflow instance. T369868. [production]
14:02 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device asw1-bw27-esams [production]
14:02 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device asw1-bw27-esams [production]
14:00 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr1-esams [production]