201-250 of 10000 results (55ms)
2022-08-18 ยง
22:33 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
22:32 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
22:32 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
22:32 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
22:31 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
22:31 <dancy@deploy1002> rebuilt and synchronized wikiversions files: group2 wikis to 1.39.0-wmf.23 refs T314186 [production]
22:25 <dancy> Rolling the train back to group1 due to T315620 [production]
22:25 <xcollazo@deploy1002> Finished deploy [airflow-dags/platform_eng@ff0a0e2]: (no justification provided) (duration: 00m 19s) [production]
22:24 <xcollazo@deploy1002> Started deploy [airflow-dags/platform_eng@ff0a0e2]: (no justification provided) [production]
22:16 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kubernetes2024.mgmt.codfw.wmnet with reboot policy FORCED [production]
22:09 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2024.mgmt.codfw.wmnet with reboot policy FORCED [production]
22:05 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:02 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
22:02 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:50 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
21:48 <pt1979@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host kubernetes2024 [production]
21:47 <pt1979@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host kubernetes2024 [production]
21:20 <brennen> end of UTC late backport and config window [production]
21:20 <brennen@deploy1002> Finished scap: [[gerrit:824433|Set initial-zoom via JavaScript to avoid font-scaling issue in iPad (T311795)]] (duration: 10m 16s) [production]
21:15 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
21:14 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
21:14 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
21:14 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on relforge[1003-1004].eqiad.wmnet with reason: elastic 7 upgrade [production]
21:14 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on relforge[1003-1004].eqiad.wmnet with reason: elastic 7 upgrade [production]
21:13 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
21:09 <brennen@deploy1002> Started scap: [[gerrit:824433|Set initial-zoom via JavaScript to avoid font-scaling issue in iPad (T311795)]] [production]
21:03 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-stretch2002.codfw.wmnet with OS bullseye [production]
20:40 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:40 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:40 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host kafka-stretch2002.codfw.wmnet with OS bullseye [production]
20:39 <brennen@deploy1002> Finished scap: [[gerrit:816239|Allow admin to grant/revoke "transwiki" group on zh(wikt|wb|wq|ws) (T313657)]] (duration: 07m 09s) [production]
20:39 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:37 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-stretch2002.codfw.wmnet with OS bullseye [production]
20:32 <brennen@deploy1002> Started scap: [[gerrit:816239|Allow admin to grant/revoke "transwiki" group on zh(wikt|wb|wq|ws) (T313657)]] [production]
20:29 <brennen@deploy1002> Finished scap: [[gerrit:824395|Deploy partial action blocks to cswiki (T315525)]] (duration: 19m 16s) [production]
20:20 <robh@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host dumpsdata1006.eqiad.wmnet with OS bullseye [production]
20:19 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:18 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:18 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:17 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:10 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host kafka-stretch2002.codfw.wmnet with OS bullseye [production]
20:09 <brennen@deploy1002> Started scap: [[gerrit:824395|Deploy partial action blocks to cswiki (T315525)]] [production]
20:00 <robh@cumin1001> START - Cookbook sre.hosts.reimage for host dumpsdata1006.eqiad.wmnet with OS bullseye [production]
19:57 <ottomata> renable puppet on an-master* [production]
19:47 <ottomata> temporarily disable puppet on an-master100* while applying change in test cluster - T312858 [production]
19:34 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dumpsdata1007.eqiad.wmnet with OS bullseye [production]
19:19 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dumpsdata1007.eqiad.wmnet with reason: host reimage [production]
19:16 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on dumpsdata1007.eqiad.wmnet with reason: host reimage [production]
19:10 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
19:00 <robh@cumin1001> START - Cookbook sre.hosts.reimage for host dumpsdata1007.eqiad.wmnet with OS bullseye [production]