3551-3600 of 10000 results (26ms)
2021-04-08 ยง
19:55 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw240[3-9].codfw.wmnet [production]
19:54 <dzahn@cumin1001> conftool action : set/weight=30; selector: name=mw240[3-9].codfw.wmnet [production]
19:50 <mutante> mw2403 through mw2411 - scap pull - new hardware [production]
19:35 <dduvall@deploy1002> rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.38 [production]
18:52 <phuedx> phuedx@deploy1002 Synchronized private/PrivateSettings.php: PrivateSettings: Add value for $wgWMEVectorPrefDiffSalt (T261842) [production]
18:51 <phuedx@deploy1002> Synchronized private/PrivateSettings.php: PrivateSettings: Add value for (T261842) (duration: 01m 06s) [production]
18:37 <mutante> mw2403 through mw2411 - serial rebooting [production]
18:31 <tgr@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . [production]
18:31 <tgr@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . [production]
18:29 <urbanecm@deploy1002> Synchronized php-1.36.0-wmf.38/extensions/VisualEditor/modules/ve-mw/ui/tools/ve.ui.MWBackTool.js: e0f3735f6a31d2914bae6c9daac1267707a2d108: Revert incorrect changes to ve.ui.MWBackCommand that made it stop working (T279613) (duration: 01m 07s) [production]
18:27 <bstorm> cleaned up the deprecated entries in /data/project/.system_sge/gridengine/etc/submithosts for toolsbeta-sgegrid-master and toolsbeta-sgegrid-shadow using the old fqdns T277653 [toolsbeta]
18:25 <bstorm> cleaned up the deprecated entries in /data/project/.system_sge/gridengine/etc/submithosts for tools-sgegrid-master and tools-sgegrid-shadow using the old fqdns T277653 [tools]
18:25 <tgr@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . [production]
18:25 <tgr@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . [production]
18:23 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw[2410-2411].codfw.wmnet with reason: new_install [production]
18:23 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw[2410-2411].codfw.wmnet with reason: new_install [production]
18:22 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 7 hosts with reason: new_install [production]
18:22 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 7 hosts with reason: new_install [production]
18:03 <mutante> mw2403 through mw2411 - new hardware moving into production, not pooled yet, initial puppet run, being added to icinga etc, creating mcrouter certs for them (T279599) [production]
18:02 <mutante> mw2403 through mw2401 - new hardwere moving into production, not pooled yet, initial puppet run, being added to icinga etc, creating mcrouter certs for them (T279599) [production]
17:59 <tgr@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . [production]
17:52 <ryankemper@cumin2001> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
17:29 <jgiannelos@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
17:23 <jgiannelos@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
17:18 <jgiannelos@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
17:16 <dancy> Scap 3.17.0 deployed to beta cluster [production]
16:51 <dancy> testing Scap 3.17.0 release on deployment-deploy01 [production]
16:33 <elukey> reboot an-worker1100 again to check if all the disks come up correctly [production]
16:33 <elukey> reboot an-worker1100 again to check if all the disks come up correctly [analytics]
16:16 <cmjohnson1> update bios cp1087, already deposed for h/w issues T278729 [production]
16:15 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1025.eqiad.wmnet with reason: REIMAGE [production]
16:13 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wtp1025.eqiad.wmnet with reason: REIMAGE [production]
16:10 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:05 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
15:57 <James_F> Zuul: [mediawiki/extensions/WikiToLDAP] Add quibble and phan job [releng]
15:51 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:44 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
15:43 <razzi> rebalance kafka partitions for webrequest_text partitions 17, 18 [analytics]
15:36 <elukey> reboot an-worker1100 to see if it helps with the strange BBU behavior [production]
15:35 <elukey> reboot an-worker1100 to see if it helps with the strange BBU behavior in T279475 [analytics]
14:07 <elukey> drop /var/spool/rsyslog from stat1008 - corrupted files due to root partition filled up caused a SEGV for rsyslog [analytics]
13:55 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cloudcephmon2001-dev.codfw.wmnet [production]
13:44 <andrew@cumin1001> START - Cookbook sre.hosts.decommission for hosts cloudcephmon2001-dev.codfw.wmnet [production]
13:41 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse2001.codfw.wmnet with reason: REIMAGE [production]
13:39 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on parse2001.codfw.wmnet with reason: REIMAGE [production]
13:24 <moritzm> installing groff bugfix updates from Buster point release [production]
12:49 <ema> cp5001: varnish-frontend-restart to test exp policy settings starting from a empty cache T275809 [production]
12:44 <moritzm> installing libbsd security updates for Buster [production]
12:39 <moritzm> installing xcftools security updates [production]
12:31 <marostegui@cumin1001> dbctl commit (dc=all): 'db1157 (re)pooling @ 100%: Repool after schema change', diff saved to https://phabricator.wikimedia.org/P15264 and previous config saved to /var/cache/conftool/dbconfig/20210408-123137-root.json [production]