5301-5350 of 10000 results (102ms)
2024-05-02 ยง
18:10 <sukhe@cumin1002> START - Cookbook sre.dns.netbox [production]
18:10 <sukhe@cumin1002> START - Cookbook sre.ganeti.makevm for new host doh7001.wikimedia.org [production]
18:09 <sukhe@cumin1002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host doh7001.wikimedia.org [production]
18:09 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh7001.wikimedia.org on all recursors [production]
18:09 <sukhe@cumin1002> START - Cookbook sre.dns.wipe-cache doh7001.wikimedia.org on all recursors [production]
18:09 <sukhe@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM doh7001.wikimedia.org - sukhe@cumin1002" [production]
18:09 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:08 <sukhe@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM doh7001.wikimedia.org - sukhe@cumin1002" [production]
18:05 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:restbase-codfw: Apply updated JDK 8 - eevans@cumin1002 [production]
18:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1241 (T361627)', diff saved to https://phabricator.wikimedia.org/P61756 and previous config saved to /var/cache/conftool/dbconfig/20240502-180136-marostegui.json [production]
17:58 <sfaci@deploy1002> helmfile [staging] START helmfile.d/services/editor-analytics: apply [production]
17:55 <sukhe@cumin1002> START - Cookbook sre.dns.netbox [production]
17:55 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh7001.wikimedia.org on all recursors [production]
17:55 <sukhe@cumin1002> START - Cookbook sre.dns.wipe-cache doh7001.wikimedia.org on all recursors [production]
17:55 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:53 <sukhe@cumin1002> START - Cookbook sre.dns.netbox [production]
17:53 <sfaci@deploy1002> helmfile [staging] START helmfile.d/services/editor-analytics: apply [production]
17:52 <sukhe@cumin1002> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
17:50 <sfaci@deploy1002> helmfile [staging] START helmfile.d/services/editor-analytics: apply [production]
17:49 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1241 (T361627)', diff saved to https://phabricator.wikimedia.org/P61755 and previous config saved to /var/cache/conftool/dbconfig/20240502-174920-marostegui.json [production]
17:49 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1241.eqiad.wmnet with reason: Maintenance [production]
17:49 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1241.eqiad.wmnet with reason: Maintenance [production]
17:48 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1221 (T361627)', diff saved to https://phabricator.wikimedia.org/P61754 and previous config saved to /var/cache/conftool/dbconfig/20240502-174856-marostegui.json [production]
17:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P61753 and previous config saved to /var/cache/conftool/dbconfig/20240502-173349-marostegui.json [production]
17:24 <brett@cumin2002> START - Cookbook sre.dns.netbox [production]
17:24 <brett@cumin2002> START - Cookbook sre.ganeti.makevm for new host ncredir7001.magru.wmnet [production]
17:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P61752 and previous config saved to /var/cache/conftool/dbconfig/20240502-171840-marostegui.json [production]
17:15 <sfaci@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
17:15 <sfaci@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
17:05 <sukhe@cumin1002> START - Cookbook sre.dns.netbox [production]
17:05 <sukhe@cumin1002> START - Cookbook sre.ganeti.makevm for new host doh7001.wikimedia.org [production]
17:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1221 (T361627)', diff saved to https://phabricator.wikimedia.org/P61751 and previous config saved to /var/cache/conftool/dbconfig/20240502-170332-marostegui.json [production]
16:53 <sfaci@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
16:52 <sfaci@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
16:52 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1221 (T361627)', diff saved to https://phabricator.wikimedia.org/P61750 and previous config saved to /var/cache/conftool/dbconfig/20240502-165211-marostegui.json [production]
16:52 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
16:51 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
16:51 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1221.eqiad.wmnet with reason: Maintenance [production]
16:51 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1221.eqiad.wmnet with reason: Maintenance [production]
16:51 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1199 (T361627)', diff saved to https://phabricator.wikimedia.org/P61749 and previous config saved to /var/cache/conftool/dbconfig/20240502-165129-marostegui.json [production]
16:40 <sukhe@cumin1002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum7001.magru.wmnet [production]
16:40 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum7001.magru.wmnet with OS bookworm [production]
16:39 <sfaci@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
16:38 <sfaci@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
16:36 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P61748 and previous config saved to /var/cache/conftool/dbconfig/20240502-163622-marostegui.json [production]
16:21 <amastilovic@deploy1002> Finished deploy [airflow-dags/analytics@7513bfa]: (no justification provided) (duration: 00m 44s) [production]
16:21 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P61747 and previous config saved to /var/cache/conftool/dbconfig/20240502-162114-marostegui.json [production]
16:20 <amastilovic@deploy1002> Started deploy [airflow-dags/analytics@7513bfa]: (no justification provided) [production]
16:16 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum7001.magru.wmnet with reason: host reimage [production]
16:15 <sukhe> running authdns-update once again to confirm state of dns700[12] [production]