production SAL

1051-1100 of 10000 results (68ms)

2022-11-23 §
16:18	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1096:3316', diff saved to https://phabricator.wikimedia.org/P40794 and previous config saved to /var/cache/conftool/dbconfig/20221123-161844-ladsgroup.json	[production]
16:17	<eevans@cumin1001>	START - Cookbook sre.cassandra.roll-restart for nodes matching aqs[2001-2004].codfw.wmnet,aqs[1010-1015].eqiad.wmnet: T314309 restarting to pick up new JRE - eevans@cumin1001	[production]
16:16	<pt1979@cumin1001>	START - Cookbook sre.hosts.provision for host contint1002.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
16:16	<pt1979@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host contint1002.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
16:15	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317', diff saved to https://phabricator.wikimedia.org/P40793 and previous config saved to /var/cache/conftool/dbconfig/20221123-161512-marostegui.json	[production]
16:10	<hnowlan@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/thumbor: sync	[production]
16:09	<hnowlan@deploy1002>	helmfile [eqiad] START helmfile.d/services/thumbor: sync	[production]
16:08	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2117', diff saved to https://phabricator.wikimedia.org/P40792 and previous config saved to /var/cache/conftool/dbconfig/20221123-160837-ladsgroup.json	[production]
16:08	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/thumbor: sync	[production]
16:07	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/thumbor: sync	[production]
16:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1096:3316', diff saved to https://phabricator.wikimedia.org/P40791 and previous config saved to /var/cache/conftool/dbconfig/20221123-160338-ladsgroup.json	[production]
16:03	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/thumbor: sync	[production]
16:00	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'db1132 (re)pooling @ 100%: Maint done', diff saved to https://phabricator.wikimedia.org/P40790 and previous config saved to /var/cache/conftool/dbconfig/20221123-160022-ladsgroup.json	[production]
16:00	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317', diff saved to https://phabricator.wikimedia.org/P40789 and previous config saved to /var/cache/conftool/dbconfig/20221123-160005-marostegui.json	[production]
15:53	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2117', diff saved to https://phabricator.wikimedia.org/P40788 and previous config saved to /var/cache/conftool/dbconfig/20221123-155330-ladsgroup.json	[production]
15:53	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/thumbor: sync	[production]
15:52	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/thumbor: sync	[production]
15:52	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/thumbor: sync	[production]
15:51	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/thumbor: sync	[production]
15:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1096:3316 (T323214)', diff saved to https://phabricator.wikimedia.org/P40787 and previous config saved to /var/cache/conftool/dbconfig/20221123-154831-ladsgroup.json	[production]
15:45	<sukhe@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
15:45	<sukhe@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Updating for lvs4009 and lvs4010 - sukhe@cumin2002"	[production]
15:45	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'db1132 (re)pooling @ 75%: Maint done', diff saved to https://phabricator.wikimedia.org/P40786 and previous config saved to /var/cache/conftool/dbconfig/20221123-154517-ladsgroup.json	[production]
15:44	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317 (T321126)', diff saved to https://phabricator.wikimedia.org/P40785 and previous config saved to /var/cache/conftool/dbconfig/20221123-154459-marostegui.json	[production]
15:44	<sukhe@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Updating for lvs4009 and lvs4010 - sukhe@cumin2002"	[production]
15:42	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1170:3317 (T321126)', diff saved to https://phabricator.wikimedia.org/P40784 and previous config saved to /var/cache/conftool/dbconfig/20221123-154242-marostegui.json	[production]
15:42	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1170.eqiad.wmnet with reason: Maintenance	[production]
15:42	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 5:00:00 on db1170.eqiad.wmnet with reason: Maintenance	[production]
15:42	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T321126)', diff saved to https://phabricator.wikimedia.org/P40783 and previous config saved to /var/cache/conftool/dbconfig/20221123-154220-marostegui.json	[production]
15:42	<sukhe@cumin2002>	START - Cookbook sre.dns.netbox	[production]
15:41	<btullis@cumin2002>	START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons.	[production]
15:41	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/thumbor: sync	[production]
15:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2117 (T323214)', diff saved to https://phabricator.wikimedia.org/P40782 and previous config saved to /var/cache/conftool/dbconfig/20221123-153824-ladsgroup.json	[production]
15:35	<pt1979@cumin1001>	START - Cookbook sre.hosts.provision for host contint1002.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:31	<oblivian@deploy1002>	helmfile [staging] DONE helmfile.d/services/image-suggestion: apply	[production]
15:30	<oblivian@deploy1002>	helmfile [staging] START helmfile.d/services/image-suggestion: apply	[production]
15:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'db1132 (re)pooling @ 10%: Maint done', diff saved to https://phabricator.wikimedia.org/P40780 and previous config saved to /var/cache/conftool/dbconfig/20221123-153012-ladsgroup.json	[production]
15:29	<pt1979@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host contint1002.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:29	<jforrester@deploy1002>	Finished deploy [integration/docroot@52e4a00]: Deploying 52e4a00 for T311097 pointing Codex docs to latest (duration: 00m 14s)	[production]
15:28	<jforrester@deploy1002>	Started deploy [integration/docroot@52e4a00]: Deploying 52e4a00 for T311097 pointing Codex docs to latest	[production]
15:27	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P40779 and previous config saved to /var/cache/conftool/dbconfig/20221123-152714-marostegui.json	[production]
15:15	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.dns.netbox (exit_code=99)	[production]
15:15	<moritzm>	updating snapshot* hosts to PHP 7.4.33-1+0~20221108.73+debian10~1.gbpa00350a+wmf10u1 T323358	[production]
15:15	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'db1132 (re)pooling @ 25%: Maint done', diff saved to https://phabricator.wikimedia.org/P40778 and previous config saved to /var/cache/conftool/dbconfig/20221123-151507-ladsgroup.json	[production]
15:13	<pt1979@cumin2002>	START - Cookbook sre.dns.netbox	[production]
15:12	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P40777 and previous config saved to /var/cache/conftool/dbconfig/20221123-151207-marostegui.json	[production]
15:11	<pt1979@cumin1001>	START - Cookbook sre.hosts.provision for host contint1002.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:10	<claime>	deploying change 859575 on mw-* wikikube deployments	[production]
15:10	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1132.eqiad.wmnet with reason: Maintenance	[production]
15:10	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1132.eqiad.wmnet with reason: Maintenance	[production]