production SAL

4201-4250 of 10000 results (54ms)

2022-02-02 §
07:52	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1129 (T300402)', diff saved to https://phabricator.wikimedia.org/P19892 and previous config saved to /var/cache/conftool/dbconfig/20220202-075244-marostegui.json	[production]
07:52	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1129.eqiad.wmnet with reason: Maintenance	[production]
07:52	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1129.eqiad.wmnet with reason: Maintenance	[production]
07:52	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1156 (T300402)', diff saved to https://phabricator.wikimedia.org/P19891 and previous config saved to /var/cache/conftool/dbconfig/20220202-075236-marostegui.json	[production]
07:51	<taavi@deploy1002>	Finished deploy [horizon/deploy@9d02cd6]: update wmf-proxy-dashboard (eqiad1) (duration: 04m 09s)	[production]
07:47	<taavi@deploy1002>	Started deploy [horizon/deploy@9d02cd6]: update wmf-proxy-dashboard (eqiad1)	[production]
07:46	<elukey@deploy1002>	helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' .	[production]
07:45	<elukey@deploy1002>	helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' .	[production]
07:44	<taavi@deploy1002>	Finished deploy [horizon/deploy@9d02cd6]: update wmf-proxy-dashboard (duration: 02m 19s)	[production]
07:42	<taavi@deploy1002>	Started deploy [horizon/deploy@9d02cd6]: update wmf-proxy-dashboard	[production]
07:39	<marostegui@cumin1001>	dbctl commit (dc=all): 'Set es1020 with weight 10 T300127', diff saved to https://phabricator.wikimedia.org/P19890 and previous config saved to /var/cache/conftool/dbconfig/20220202-073918-root.json	[production]
07:38	<root@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 6 hosts with reason: Switchover es4 T300127	[production]
07:38	<root@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on 6 hosts with reason: Switchover es4 T300127	[production]
07:37	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
07:37	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P19889 and previous config saved to /var/cache/conftool/dbconfig/20220202-073731-marostegui.json	[production]
07:36	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
07:36	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
07:36	<marostegui@deploy1002>	Synchronized wmf-config/db-production.php: Disable writes on es4 T300127 (duration: 00m 50s)	[production]
07:35	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
07:30	<marostegui@deploy1002>	Synchronized wmf-config/ProductionServices.php: Disable writes on es4 T300127 (duration: 00m 51s)	[production]
07:22	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P19888 and previous config saved to /var/cache/conftool/dbconfig/20220202-072227-marostegui.json	[production]
07:07	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1156 (T300402)', diff saved to https://phabricator.wikimedia.org/P19887 and previous config saved to /var/cache/conftool/dbconfig/20220202-070722-marostegui.json	[production]
07:00	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1156 (T300402)', diff saved to https://phabricator.wikimedia.org/P19886 and previous config saved to /var/cache/conftool/dbconfig/20220202-070012-marostegui.json	[production]
07:00	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
07:00	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
07:00	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance	[production]
07:00	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance	[production]
06:59	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on 8 hosts with reason: Maintenance	[production]
06:59	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 12:00:00 on 8 hosts with reason: Maintenance	[production]
06:59	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2104.codfw.wmnet with reason: Maintenance	[production]
06:58	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2104.codfw.wmnet with reason: Maintenance	[production]
02:54	<pt1979@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
02:48	<pt1979@cumin2002>	START - Cookbook sre.dns.netbox	[production]
02:29	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve2008.codfw.wmnet with OS buster	[production]
02:19	<ejegg>	updated CiviCRM from 0513f1b7 to 3d379e25	[production]
01:57	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host ml-serve2008.codfw.wmnet with OS buster	[production]
01:40	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve2007.codfw.wmnet with OS buster	[production]
01:22	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host ml-serve2007.codfw.wmnet with OS buster	[production]
01:13	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ml-serve2007.codfw.wmnet with OS buster	[production]
01:12	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host ml-serve2007.codfw.wmnet with OS buster	[production]
01:12	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ml-serve2007.codfw.wmnet with OS buster	[production]
01:06	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
01:05	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
01:05	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
01:04	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
01:03	<ebernhardson@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:753788\|rdf-streaming-updater: add the reconciliation stream (T279541)]] (duration: 00m 49s)	[production]
00:53	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
00:53	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host ml-serve2007.codfw.wmnet with OS buster	[production]
00:52	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
00:52	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]