1951-2000 of 10000 results (45ms)
2022-02-18 §
09:35 <moritzm> draining instances off ganeti1009 [production]
09:33 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1022.eqiad.wmnet with OS buster [production]
09:02 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti1022.eqiad.wmnet with reason: host reimage [production]
09:01 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM testvm2001.codfw.wmnet [production]
08:58 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1022.eqiad.wmnet with reason: host reimage [production]
08:57 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM testvm2002.codfw.wmnet [production]
08:54 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM testvm2002.codfw.wmnet [production]
08:53 <kart_> Updated cxserver to 2022-02-15-050044-production (T301443) [production]
08:52 <kartik@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
08:50 <kartik@deploy1002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
08:47 <kartik@deploy1002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
08:45 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1022.eqiad.wmnet with OS buster [production]
08:45 <kartik@deploy1002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
08:39 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
08:39 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
08:19 <kevinbazira@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . [production]
08:19 <kevinbazira@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . [production]
07:57 <elukey@deploy1002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
07:57 <elukey@deploy1002> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
07:57 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
07:57 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
07:42 <elukey@deploy1002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
07:42 <elukey@deploy1002> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
07:41 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
07:41 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
02:15 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
02:14 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
02:14 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
02:12 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
02:12 <cdanis@deploy1002> Synchronized wmf-config/InitialiseSettings.php: enable wmgEmergencyCaptcha for enwiki ff2f7ef64 T302047 (duration: 00m 49s) [production]
02:09 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
02:07 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
02:06 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
02:06 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
02:05 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
02:03 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
02:03 <cdanis@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Disable AbuseFilter throttling on enwiki 6692b4642 T302047 (duration: 00m 49s) [production]
2022-02-17 §
22:28 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:25 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
21:19 <razzi@cumin1001> END (ERROR) - Cookbook sre.ganeti.makevm (exit_code=93) for new host datahubsearch1002.eqiad.wmnet [production]
20:04 <dcausse@deploy1002> Finished deploy [wikimedia/discovery/analytics@66350a9]: (no justification provided) (duration: 02m 02s) [production]
20:02 <dcausse@deploy1002> Started deploy [wikimedia/discovery/analytics@66350a9]: (no justification provided) [production]
19:54 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase-dev2003.codfw.wmnet with OS buster [production]
19:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3312 (T300510)', diff saved to https://phabricator.wikimedia.org/P21009 and previous config saved to /var/cache/conftool/dbconfig/20220217-195302-ladsgroup.json [production]
19:45 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase-dev2003.codfw.wmnet with reason: host reimage [production]
19:41 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase-dev2003.codfw.wmnet with reason: host reimage [production]
19:37 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3312', diff saved to https://phabricator.wikimedia.org/P21008 and previous config saved to /var/cache/conftool/dbconfig/20220217-193757-ladsgroup.json [production]
19:35 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase-dev2002.codfw.wmnet with OS buster [production]
19:26 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase-dev2002.codfw.wmnet with reason: host reimage [production]
19:24 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host restbase-dev2003.codfw.wmnet with OS buster [production]