101-150 of 10000 results (101ms)
2024-09-23 ยง
16:03 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
16:03 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
15:59 <dcausse@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
15:59 <dcausse@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
15:46 <pt1979@cumin1002> START - Cookbook sre.hosts.reimage for host db1246.eqiad.wmnet with OS bookworm [production]
15:35 <ryankemper@cumin2002> START - Cookbook sre.wdqs.restart [production]
15:20 <stevemunene@deploy1003> Finished deploy [wdqs/wdqs@316bf7f]: allow 3 new endpoints T364233 T368085 T374195 (duration: 05m 51s) [production]
15:14 <stevemunene@deploy1003> Started deploy [wdqs/wdqs@316bf7f]: allow 3 new endpoints T364233 T368085 T374195 [production]
15:03 <volans@cumin1002> dbctl commit (dc=all): 'emergency failover pc3 to pc1015', diff saved to https://phabricator.wikimedia.org/P69396 and previous config saved to /var/cache/conftool/dbconfig/20240923-150320-volans.json [production]
14:51 <moritzm> powercycle pc1013 (DIMM error in DIMM_A9) [production]
14:50 <elukey@puppetserver1001> conftool action : set/pooled=true,weight=10; selector: name=registry1005.eqiad.wmnet,service=docker-registry,dc=eqiad [production]
14:49 <elukey@puppetserver1001> conftool action : set/pooled=true; selector: name=registry1005.eqiad.wmnet [production]
14:49 <elukey@puppetserver1001> conftool action : set/weight=10; selector: name=registry1005.eqiad.wmnet [production]
14:48 <elukey@puppetserver1001> conftool action : set/pooled=true; selector: name=registry1005.eqiad.wmnet [production]
14:38 <stevemunene@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1176.eqiad.wmnet with OS bullseye [production]
14:31 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host registry1005.eqiad.wmnet with OS bookworm [production]
14:30 <sukhe> sudo cumin 'O:wikidough' 'run-puppet-agent' [production]
14:30 <jynus> restarting and moving replication source of pc1015 T375382 [production]
14:16 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-codfw [production]
14:16 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr2-codfw [production]
14:16 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on registry1005.eqiad.wmnet with reason: host reimage [production]
14:15 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr1-codfw [production]
14:15 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr1-codfw [production]
14:14 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-esams [production]
14:14 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr2-esams [production]
14:14 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device asw1-by27-esams [production]
14:13 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device asw1-by27-esams [production]
14:12 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on registry1005.eqiad.wmnet with reason: host reimage [production]
14:04 <xcollazo@deploy1003> Finished deploy [airflow-dags/analytics@3e2d3b8]: Deploy latest DAGs to analytics Airflow instance. T369868. (duration: 00m 48s) [production]
14:03 <xcollazo@deploy1003> Started deploy [airflow-dags/analytics@3e2d3b8]: Deploy latest DAGs to analytics Airflow instance. T369868. [production]
14:02 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device asw1-bw27-esams [production]
14:02 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device asw1-bw27-esams [production]
14:00 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr1-esams [production]
14:00 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr1-esams [production]
13:59 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host registry1005.eqiad.wmnet with OS bookworm [production]
13:59 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device asw1-b13-drmrs [production]
13:59 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device asw1-b13-drmrs [production]
13:58 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device asw1-b12-drmrs [production]
13:58 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device asw1-b12-drmrs [production]
13:57 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-drmrs [production]
13:57 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr2-drmrs [production]
13:56 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr1-drmrs [production]
13:56 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr1-drmrs [production]
13:56 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr2-eqsin [production]
13:55 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr2-eqsin [production]
13:52 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr3-eqsin [production]
13:52 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr3-eqsin [production]
13:51 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr4-ulsfo [production]
13:51 <ayounsi@cumin1002> START - Cookbook sre.network.tls for network device cr4-ulsfo [production]
13:50 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device cr3-ulsfo [production]