2301-2350 of 10000 results (38ms)
2020-11-30 ยง
21:51 <mutante> parse2001 - scap pull [production]
21:51 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=parse2001.codfw.wmnet [production]
21:46 <andrewbogott> deleting project as per https://wikitech.wikimedia.org/wiki/News/Cloud_VPS_2020_Purge#techblog [techblog]
21:45 <razzi@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
21:44 <andrewbogott> deleting project as per https://wikitech.wikimedia.org/wiki/News/Cloud_VPS_2020_Purge#grantreview [grantreview]
21:42 <andrewbogott> deleting project as per https://wikitech.wikimedia.org/wiki/News/Cloud_VPS_2020_Purge#DELETE_sccache [sccache]
21:40 <andrewbogott> deleting project as per communication here with legoktm [butterfly]
21:38 <razzi@cumin1001> START - Cookbook sre.hosts.decommission [production]
21:27 <legoktm> building php8.0 images https://gerrit.wikimedia.org/r/643777 [releng]
21:00 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
20:58 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime [production]
20:58 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
20:55 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime [production]
20:47 <razzi@cumin1001> START - Cookbook sre.ganeti.makevm [production]
20:47 <pt1979@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
20:45 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
20:43 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime [production]
20:43 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime [production]
20:42 <mutante> reimaging deploy2002 with buster (not active, deploy1001/2001 are) T265963 [production]
20:39 <mutante> reimaging parse2001 (parsoid canary) with buster (T268524) [production]
20:36 <dzahn@cumin1001> conftool action : set/pooled=inactive; selector: name=parse2001.codfw.wmnet [production]
20:33 <mutante> depooling parse2001 to prepare for reimage T268524 [production]
20:32 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=parse2001.codfw.wmnet [production]
20:28 <mutante> reimaging deploy1002 with buster - not the active deployment server, deploy1001 still is (T265963) [production]
20:10 <ariel@deploy1001> Finished deploy [dumps/dumps@2f4d931]: per job batches for page content. step one. (duration: 00m 04s) [production]
20:10 <ariel@deploy1001> Started deploy [dumps/dumps@2f4d931]: per job batches for page content. step one. [production]
19:52 <papaul> power down ms-be2059 for RAID re-configuration [production]
19:47 <mutante> added Sukhbir to Ops vendor maintenance calendar permissions to make changes and share like all of SRE (T229860) [production]
19:23 <ppchelko@deploy1001> Synchronized wmf-config/CommonSettings.php: gerrit:644236 Decrease OAuth token expiration (duration: 00m 56s) [production]
19:17 <ppchelko@deploy1001> Synchronized wmf-config/InitialiseSettings.php: gerrit:644243 group2: switch ParserCache to JSON (duration: 00m 58s) [production]
19:14 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:11 <James_F> Zuul: [mediawiki/extensions/WikimediaApiPortalOAuth] Mark as in-prod and enable coverage reports T262495 [releng]
19:09 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
18:22 <bstorm> 1.17 upgrade for kubernetes complete T268669 [paws]
18:12 <andrewbogott> removing all osds from cloudcephosd1015 in order to investigate T268746 [admin]
17:51 <joal> Deploy refinery onto hdfs [analytics]
17:49 <joal> Kill-restart mediawiki-history-load job after refactor (1 coordinator per table) and tables addition [analytics]
17:47 <joal@deploy1001> Finished deploy [analytics/refinery@9db742d] (thin): Analytics special deploy before first of month - Hotfix -- THIN [analytics/refinery@9db742d] (duration: 00m 08s) [production]
17:47 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:47 <joal@deploy1001> Started deploy [analytics/refinery@9db742d] (thin): Analytics special deploy before first of month - Hotfix -- THIN [analytics/refinery@9db742d] [production]
17:43 <joal@deploy1001> Finished deploy [analytics/refinery@9db742d]: Analytics special deploy before first of month - Hotfix [analytics/refinery@9db742d] (duration: 11m 32s) [production]
17:37 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
17:32 <joal> Kill-restart mediawiki-history-reduced job for druid-public datasource number of shards update [analytics]
17:32 <joal> Deploy refinery using scap for naming hotfix [analytics]
17:31 <joal@deploy1001> Started deploy [analytics/refinery@9db742d]: Analytics special deploy before first of month - Hotfix [analytics/refinery@9db742d] [production]
17:25 <bstorm> upgrading the worker nodes (this will likely kill services briefly when some pods are rescheduled) T268669 [paws]
17:14 <bstorm> updated the calico-kube-controllers deployment to use our internal registry to deal with docker-hub rate-limiting T268669 T269016 [paws]
17:08 <chicocvenancio> delete orphaned jupyter server pod `kubectl -n prod delete pod jupyter--45volutionoftheuniverse`. Respective server not running in jupyter admin UI. [paws]
17:07 <moritzm> reset failed (now obsolete idp-u2f-sync/stunnel4 services on idp1001 [production]
16:50 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes:weight=10; selector: dc=eqiad,cluster=maps,service=kartotherian,name=maps1008.eqiad.wmnet [production]