4151-4200 of 10000 results (45ms)
2023-11-01 §
10:10 <jelto@cumin1001> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: GitLab version upgrade [production]
10:03 <jelto@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: GitLab version upgrade [production]
10:02 <jelto@cumin1001> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: GitLab version upgrade [production]
09:57 <moritzm> installing yajl security updates [production]
09:46 <moritzm> installing ncurses security updates [production]
09:28 <moritzm> installing RT security updates [production]
09:11 <moritzm> installing curl security updates [production]
09:06 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.grid.cleanup_queue_errors (exit_code=0) [tools]
09:06 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.grid.cleanup_queue_errors [tools]
09:06 <taavi@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.grid.cleanup_queue_errors (exit_code=99) [toolsbeta]
09:06 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.grid.cleanup_queue_errors [toolsbeta]
09:04 <taavi> reset local cookbook changes on cloudcumin1001 which were causing issues with puppet runs [admin]
09:02 <taavi> restart nova-fullstack which had had some issues after yesterday's cloudcontrol1007 reimage [admin]
08:47 <taavi> restart puppetdb [tools]
08:34 <jelto@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: GitLab version upgrade [production]
06:00 <kart_> Updated MinT to 2023-10-31-044726-production (T333969, T349991, T349079, T340507) [production]
05:57 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply [production]
05:51 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/machinetranslation: apply [production]
05:46 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply [production]
05:40 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/machinetranslation: apply [production]
05:32 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
05:29 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
02:17 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.drain_node (exit_code=0) (348643) [admin]
00:58 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (348643) [admin]
00:52 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.drain_node (exit_code=0) (348643) [admin]
00:51 <eileen> civicrm upgraded from 31d53b57 to 6ae3d3fc [production]
00:01 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1112.eqiad.wmnet with OS bullseye [production]
2023-10-31 §
23:59 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1111.eqiad.wmnet with OS bullseye [production]
23:51 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1110.eqiad.wmnet with OS bullseye [production]
23:46 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (348643) [admin]
23:43 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1112.eqiad.wmnet with reason: host reimage [production]
23:41 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1111.eqiad.wmnet with reason: host reimage [production]
23:38 <fabfur@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1112.eqiad.wmnet with reason: host reimage [production]
23:38 <fabfur@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1111.eqiad.wmnet with reason: host reimage [production]
23:33 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1110.eqiad.wmnet with reason: host reimage [production]
23:30 <fabfur@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1110.eqiad.wmnet with reason: host reimage [production]
23:23 <fabfur@cumin1001> START - Cookbook sre.hosts.reimage for host cp1112.eqiad.wmnet with OS bullseye [production]
23:23 <fabfur@cumin1001> START - Cookbook sre.hosts.reimage for host cp1111.eqiad.wmnet with OS bullseye [production]
23:23 <fabfur@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1112.eqiad.wmnet with OS bullseye [production]
23:22 <fabfur@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1111.eqiad.wmnet with OS bullseye [production]
23:15 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1107.eqiad.wmnet with OS bullseye [production]
23:15 <fabfur@cumin1001> START - Cookbook sre.hosts.reimage for host cp1112.eqiad.wmnet with OS bullseye [production]
23:15 <fabfur@cumin1001> START - Cookbook sre.hosts.reimage for host cp1111.eqiad.wmnet with OS bullseye [production]
23:15 <fabfur@cumin1001> START - Cookbook sre.hosts.reimage for host cp1110.eqiad.wmnet with OS bullseye [production]
23:15 <fabfur@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1111.eqiad.wmnet with OS bullseye [production]
23:14 <fabfur@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1110.eqiad.wmnet with OS bullseye [production]
23:14 <fabfur@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1112.eqiad.wmnet with OS bullseye [production]
23:12 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1109.eqiad.wmnet with OS bullseye [production]
23:09 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1108.eqiad.wmnet with OS bullseye [production]
23:08 <fabfur@cumin1001> START - Cookbook sre.hosts.reimage for host cp1112.eqiad.wmnet with OS bullseye [production]