2401-2450 of 10000 results (86ms)
2023-06-01 ยง
16:00 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudswift1001.eqiad.wmnet with OS bullseye [production]
15:59 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host cloudswift1001.eqiad.wmnet with OS bullseye [production]
15:57 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudswift1001.eqiad.wmnet with OS bullseye [production]
15:45 <aborrero@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
15:44 <aborrero@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
15:33 <aborrero@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
15:33 <aborrero@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
15:21 <fabfur> running run-puppet-agent on cp6010.drmrs.wmnet to fix icinga check from cookbook [production]
15:15 <bblack> lvs400[89]: upgrade pybal to 1.15.13 - T334703 [production]
15:11 <sukhe> reprepro -C component/pybal bullseye-wikimedia pybal_1.15.13_source.changes [production]
15:00 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mwlog1002.eqiad.wmnet with OS bullseye [production]
14:59 <moritzm> installing python-sqlparse security updates [production]
14:56 <ayounsi@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
14:56 <aborrero@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
14:55 <aborrero@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
14:55 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host cloudswift1001.eqiad.wmnet with OS bullseye [production]
14:53 <moritzm> installing jackson-databind security updates [production]
14:49 <ayounsi@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
14:45 <fabfur> running run-puppet-agent on cp6009.drmrs.wmnet to fix icinga check from cookbook [production]
14:44 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mwlog1002.eqiad.wmnet with reason: host reimage [production]
14:41 <herron@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mwlog1002.eqiad.wmnet with reason: host reimage [production]
14:40 <fabfur@cumin1001> START - Cookbook sre.cdn.run-puppet-restart-varnish rolling custom on A:cp-upload_drmrs and A:cp [production]
14:39 <ayounsi@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary [production]
14:39 <ayounsi@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary [production]
14:36 <fabfur@cumin1001> START - Cookbook sre.cdn.run-puppet-restart-varnish rolling custom on A:cp-text_drmrs and A:cp [production]
14:34 <aborrero@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol2004-dev.codfw.wmnet with OS bullseye [production]
14:29 <moritzm> installing imagemagick security updates on buster [production]
14:16 <herron@cumin1001> START - Cookbook sre.hosts.reimage for host mwlog1002.eqiad.wmnet with OS bullseye [production]
14:14 <fabfur> Disabled puppet on A:cp-drmrs for T323557 [production]
14:13 <mforns@deploy1002> Finished deploy [airflow-dags/analytics@3c9cc85]: (no justification provided) (duration: 00m 11s) [production]
14:13 <mforns@deploy1002> Started deploy [airflow-dags/analytics@3c9cc85]: (no justification provided) [production]
14:13 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158 (T336886)', diff saved to https://phabricator.wikimedia.org/P48700 and previous config saved to /var/cache/conftool/dbconfig/20230601-141317-ladsgroup.json [production]
14:11 <claime> Removing obsolete mediawiki-services-function-evaluator from registry - T337505 [production]
13:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158', diff saved to https://phabricator.wikimedia.org/P48699 and previous config saved to /var/cache/conftool/dbconfig/20230601-135811-ladsgroup.json [production]
13:52 <moritzm> installing sysstat security updates [production]
13:52 <jelto@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
13:51 <jelto@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
13:50 <jelto@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
13:50 <jelto@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
13:49 <jelto@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
13:49 <jelto@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
13:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158', diff saved to https://phabricator.wikimedia.org/P48698 and previous config saved to /var/cache/conftool/dbconfig/20230601-134304-ladsgroup.json [production]
13:29 <moritzm> installing openssl security updates on bullseye [production]
13:27 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158 (T336886)', diff saved to https://phabricator.wikimedia.org/P48697 and previous config saved to /var/cache/conftool/dbconfig/20230601-132758-ladsgroup.json [production]
13:23 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2158 (T336886)', diff saved to https://phabricator.wikimedia.org/P48695 and previous config saved to /var/cache/conftool/dbconfig/20230601-132319-ladsgroup.json [production]
13:23 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
13:23 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
13:22 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2158.codfw.wmnet with reason: Maintenance [production]
13:22 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2158.codfw.wmnet with reason: Maintenance [production]
13:22 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2151 (T336886)', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20230601-132238-ladsgroup.json [production]