production SAL

451-500 of 10000 results (28ms)

2020-12-23 §
09:59	<hashar>	gerrit: removed old gerrit directory /srv/var-lib-gerrit2-cobalt.wikimedia.org/.gerritcodereview/ (was some tmp dirs for Gerrit jars )	[production]
09:54	<volans>	upgraded python3-wmflib to 0.0.5 on cumin1001	[production]
05:54	<ladsgroup@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: [[gerrit:651682\|Fix typo in autoreview right of eliminators in fawiki]] (duration: 00m 57s)	[production]
2020-12-22 §
21:57	<mutante>	apt1001 - sudo systemctl status rsync-aptrepo-apt2001.wikimedia.org.service - confirmed timer job is working like the cron before	[production]
21:31	<mutante>	deploy1002/deploy2002 - apt-get remove --purge php-readline and let puppet reinstall it (7.2 vs 7.3 after gerrit 651158) T265963	[production]
21:26	<andrewbogott>	upgrading wikitech-static: mediawiki to 1.35.1 and general apt upgrade	[production]
20:26	<eileen>	civicrm revision changed from e86e756807 to 6150267979, config revision is 52f1cbc5dd	[production]
19:32	<mutante>	restarting gerrit to pick up config change in gitiles for T269300	[production]
18:29	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on labstore1004.eqiad.wmnet with reason: REIMAGE	[production]
18:27	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on labstore1004.eqiad.wmnet with reason: REIMAGE	[production]
17:27	<andrewbogott>	shutting down labstore1004 in preparation for move and reimage	[production]
16:51	<mforns@deploy1001>	Finished deploy [analytics/refinery@21c0c89] (thin): Regular analytics weekly train THIN [analytics/refinery@Ie7bce02179547ee4c6756d52f9956f492c5b4df6] (duration: 00m 08s)	[production]
16:51	<mforns@deploy1001>	Started deploy [analytics/refinery@21c0c89] (thin): Regular analytics weekly train THIN [analytics/refinery@Ie7bce02179547ee4c6756d52f9956f492c5b4df6]	[production]
16:48	<volans>	restarted ferm on ms-be1026 (failed with DNS query for 'ms-be1055.eqiad.wmnet' failed: query timed out )	[production]
16:15	<bstorm>	downtimed and stopped puppet on labstore1004 and labstore1005 for failover T266202	[production]
15:23	<jgiannelos@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'wikifeeds' for release 'production' .	[production]
15:12	<jgiannelos@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'wikifeeds' for release 'production' .	[production]
15:08	<jgiannelos@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'wikifeeds' for release 'staging' .	[production]
11:52	<marostegui>	Set db1151 to writable T269324	[production]
11:10	<jbond42>	upload puppet 5.5.22 to jessie-wikimedia	[production]
11:02	<jbond42>	upload puppet 5.5.22 to stretch-wikimedia	[production]
10:51	<volans@cumin2001>	test SAL message from wmflib, please ignore	[production]
10:06	<volans>	upgraded python3-wmflib to 0.0.5 on cumin2001	[production]
08:52	<hashar>	gerrit: running jhat heap analyzer on gerrit2001 # T263008	[production]
07:27	<elukey>	reboot stat100[4-8] (analytics hadoop clients) for kernel upgrades	[production]
00:20	<crusnov@deploy1001>	Finished deploy [netbox/deploy@b17db99]: Redeploy of 2.9.10 to netbox-dev for dep test (duration: 00m 54s)	[production]
00:19	<crusnov@deploy1001>	Started deploy [netbox/deploy@b17db99]: Redeploy of 2.9.10 to netbox-dev for dep test	[production]
2020-12-21 §
23:20	<legoktm@deploy1001>	Synchronized docroot/noc/conf/index.php: noc: Fix "Currently active MediaWiki versions" (T235338) (duration: 00m 54s)	[production]
22:26	<crusnov@deploy1001>	Finished deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing p2 (duration: 00m 05s)	[production]
22:26	<crusnov@deploy1001>	Started deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing p2	[production]
22:26	<crusnov@deploy1001>	Finished deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing (duration: 01m 01s)	[production]
22:25	<sbassett>	Deployed security patch T270453	[production]
22:25	<crusnov@deploy1001>	Started deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing	[production]
22:18	<chaomodus>	Re-enabling puppet on Netbox production instances after havintg tested netbox2001 with new puppet code T266487	[production]
21:42	<legoktm>	manually imported debs to buster-wikimedia thirdparty/pyall component (T241195)	[production]
21:09	<chaomodus>	merging change 643354 for Netbox 2.9 support, puppet disabled on production machines until testing completed T266487	[production]
19:47	<dcausse>	repool wdqs1011	[production]
18:30	<dancy@deploy1001>	Finished scap: Backport of l10n changes for T270619 (duration: 21m 12s)	[production]
18:28	<robh@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1011.eqiad.wmnet with reason: REIMAGE	[production]
18:26	<robh@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1011.eqiad.wmnet with reason: REIMAGE	[production]
18:18	<robh@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE	[production]
18:16	<robh@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE	[production]
18:14	<volans>	uploaded python3-wmflib_0.0.5 to apt.wikimedia.org buster-wikimedia	[production]
18:09	<dancy@deploy1001>	Started scap: Backport of l10n changes for T270619	[production]
17:58	<jiji@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2024.codfw.wmnet with reason: REIMAGE	[production]
17:56	<legoktm@deploy1001>	Synchronized /srv/mediawiki-staging/php-1.36.0-wmf.22/extensions/FeaturedFeeds/includes/FeaturedFeeds.php: Don't load entire feed just to output the link to it (T266900) (duration: 01m 01s)	[production]
17:56	<jiji@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1024.eqiad.wmnet with reason: REIMAGE	[production]
17:56	<jiji@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc2024.codfw.wmnet with reason: REIMAGE	[production]
17:54	<jiji@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc1024.eqiad.wmnet with reason: REIMAGE	[production]
17:33	<robh@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE	[production]