1151-1200 of 10000 results (97ms)
2024-08-07 ยง
20:35 <jclark@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1004.wikimedia.org with reason: host reimage [production]
20:21 <cjming> end of UTC late backport window [production]
20:17 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host gerrit1004.wikimedia.org with OS bookworm [production]
20:15 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host gerrit1004.wikimedia.org with OS bookworm [production]
20:11 <milimetric@deploy1003> Finished deploy [airflow-dags/analytics@049c09e]: (no justification provided) (duration: 00m 03s) [production]
20:11 <milimetric@deploy1003> Started deploy [airflow-dags/analytics@049c09e]: (no justification provided) [production]
20:08 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host vrts1003.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:04 <jclark@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:04 <jclark@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt vrts1003 - jclark@cumin1002" [production]
20:04 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt vrts1003 - jclark@cumin1002" [production]
20:01 <jclark@cumin1002> START - Cookbook sre.dns.netbox [production]
19:59 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1004.wikimedia.org with reason: host reimage [production]
19:55 <jclark@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1004.wikimedia.org with reason: host reimage [production]
19:53 <milimetric@deploy1003> Finished deploy [airflow-dags/analytics@049c09e]: (no justification provided) (duration: 00m 59s) [production]
19:52 <milimetric@deploy1003> Started deploy [airflow-dags/analytics@049c09e]: (no justification provided) [production]
19:52 <milimetric@deploy1003> Finished deploy [airflow-dags/analytics@216348d]: (no justification provided) (duration: 00m 47s) [production]
19:51 <milimetric@deploy1003> Started deploy [airflow-dags/analytics@216348d]: (no justification provided) [production]
19:47 <milimetric@deploy1003> Finished deploy [airflow-dags/analytics@049c09e]: Deploying new Browser General job (duration: 00m 02s) [production]
19:47 <milimetric@deploy1003> Started deploy [airflow-dags/analytics@049c09e]: Deploying new Browser General job [production]
19:46 <milimetric@deploy1003> Finished deploy [airflow-dags/analytics@049c09e]: Deploying new Browser General job (duration: 00m 41s) [production]
19:45 <milimetric@deploy1003> Started deploy [airflow-dags/analytics@049c09e]: Deploying new Browser General job [production]
19:39 <ebernhardson@deploy1003> Finished deploy [airflow-dags/search@049c09e]: workaround process_sparql_query oom issues (duration: 00m 20s) [production]
19:39 <ebernhardson@deploy1003> Started deploy [airflow-dags/search@049c09e]: workaround process_sparql_query oom issues [production]
19:38 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host gerrit1004.wikimedia.org with OS bookworm [production]
19:37 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host gerrit1004.mgmt.eqiad.wmnet with reboot policy FORCED [production]
19:33 <brett> start pybal on lvs1017 [production]
19:32 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1017.eqiad.wmnet [production]
19:29 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs1017.eqiad.wmnet [production]
19:18 <brennen@deploy1003> Finished scap: Backport for [[gerrit:1060489|Fix TypeError in PendingChanges by handling null subPage (T371986)]] (duration: 08m 23s) [production]
19:14 <brennen@deploy1003> brennen: Continuing with sync [production]
19:12 <brennen@deploy1003> brennen: Backport for [[gerrit:1060489|Fix TypeError in PendingChanges by handling null subPage (T371986)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
19:11 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host gerrit1004.mgmt.eqiad.wmnet with reboot policy FORCED [production]
19:11 <jclark@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:11 <jclark@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt gerrit1004 - jclark@cumin1002" [production]
19:11 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt gerrit1004 - jclark@cumin1002" [production]
19:10 <brennen@deploy1003> Started scap sync-world: Backport for [[gerrit:1060489|Fix TypeError in PendingChanges by handling null subPage (T371986)]] [production]
19:08 <jclark@cumin1002> START - Cookbook sre.dns.netbox [production]
19:04 <brett> stop pybal on lvs1017 for server reboot [production]
19:00 <brett> start pybal on lvs1018 [production]
18:59 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1018.eqiad.wmnet [production]
18:56 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs1018.eqiad.wmnet [production]
18:45 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1296.eqiad.wmnet with OS bullseye [production]
18:40 <brett> stop pybal on lvs1018 for server reboot [production]
18:39 <milimetric@deploy1003> Finished deploy [analytics/refinery@fe20690]: Syncing browser general script hive version (duration: 16m 05s) [production]
18:35 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1296.eqiad.wmnet with reason: host reimage [production]
18:33 <brett> start pybal on lvs1019 [production]
18:32 <jclark@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1296.eqiad.wmnet with reason: host reimage [production]
18:32 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1038.eqiad.wmnet with OS bullseye [production]
18:30 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1296.eqiad.wmnet with OS bullseye [production]
18:28 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1296.eqiad.wmnet with OS bullseye [production]