6451-6500 of 10000 results (28ms)
2021-09-03 ยง
19:33 <krinkle@deploy1002> Started deploy [integration/docroot@6492b3d]: I48480e89e5f6 [production]
19:26 <bd808@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'toolhub' for release 'main' . [production]
19:19 <bstorm> adding config group validation rules for postgresql and mysql T290349 [trove]
19:14 <bstorm> adding config group validation rules for mariadb 10.5.10 T290349 [trove]
19:04 <ryankemper> T290330 `ryankemper@cumin1001:~$ sudo -E cumin 'P{wdqs2*}' 'sudo rm -fv /etc/cron.hourly/restart-blazegraph'` (Cleaned up manually created crons now that we have [somewhat hacky] systemd timers doing the same job) [production]
17:42 <dduvall@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
17:42 <dduvall> deploying blubberoid:2021-09-03-160524-production to eqiad/codfw (https://gerrit.wikimedia.org/r/c/blubber/+/716519) (T289367) [releng]
17:40 <dduvall@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'blubberoid' for release 'production' . [production]
17:39 <andrewbogott> restarting celery workers and reloading web UI to pick up timeout changes [quarry]
17:36 <dduvall> staging blubberoid to deploy https://gerrit.wikimedia.org/r/c/blubber/+/716519 [releng]
17:35 <dduvall@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . [production]
17:17 <ryankemper> T290330 Deployed https://gerrit.wikimedia.org/r/c/operations/puppet/+/717508 across `wdqs` fleet; codfw wdqs hosts will restart on average once per hour now to address ongoing availability issues for wdqs codfw [production]
16:45 <bstorm> set live wait_timeout variable to 28800 (the default) on the trove instance T290291 [quarry]
16:32 <bd808@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'toolhub' for release 'main' . [production]
16:10 <gehel> blazegraph (public cofdfw cluster) will now restart every hour - T290330 [production]
16:05 <wm-bot> <lucaswerkmeister> updated venv (includes mwparserfromhell 0.6.3) [tools.quickcategories]
15:54 <wm-bot> <lucaswerkmeister> deployed 3698f0b79c (add passive forms to Norwegian Bokmal verbs) [tools.lexeme-forms]
15:53 <jbond> enable puppet fleet wide to post puppetdb database maintance - T263578 [production]
15:35 <wm-bot> <lucaswerkmeister> deployed 8051248b60 (l10n updates) [tools.lexeme-forms]
15:34 <bstorm> rebooting labstore1005 to disconnect the drives from labstore1004 T290318 [admin]
15:24 <bstorm> stopping puppet and disabling backup syncs to labstore1005 on cloudbackup2002 T290318 [admin]
15:21 <jbond> create lvm snapshot puppetdb2002_data_snapshot on ganeti2023 - T263578 [production]
15:20 <bstorm> stopping puppet and disabling backup syncs to labstore1005 on cloudbackup2001 T290318 [admin]
15:17 <jbond> create lvm snapshot puppetdb1002_data_snapshot on ganeti1012 - T263578 [production]
15:00 <jbond> disable puppet fleet wide to preform puppetdb database maintance - T263578 [production]
14:58 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
14:58 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
14:35 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:29 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
14:20 <mutante> mw2264 - scap pull [production]
14:18 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
14:18 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
13:11 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts mc1027.eqiad.wmnet [production]
13:10 <dcausse> installing openjdk-8-dbg on wdqs2007 [production]
13:04 <jiji@cumin1001> START - Cookbook sre.hosts.decommission for hosts mc1027.eqiad.wmnet [production]
13:02 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts mc1023.eqiad.wmnet [production]
12:49 <majavah> deploying new tools-manifest version [tools]
12:48 <jiji@cumin1001> START - Cookbook sre.hosts.decommission for hosts mc1023.eqiad.wmnet [production]
12:46 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts mc[1035-1036].eqiad.wmnet [production]
12:32 <jiji@cumin1001> START - Cookbook sre.hosts.decommission for hosts mc[1035-1036].eqiad.wmnet [production]
12:12 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts mc[1028-1032].eqiad.wmnet [production]
12:03 <joal@deploy1002> Finished deploy [analytics/refinery@7208d3d] (thin): Analytics hotfix deploy (bis) THIN [analytics/refinery@7208d3d] (duration: 00m 06s) [production]
12:03 <joal@deploy1002> Started deploy [analytics/refinery@7208d3d] (thin): Analytics hotfix deploy (bis) THIN [analytics/refinery@7208d3d] [production]
12:03 <joal@deploy1002> Finished deploy [analytics/refinery@7208d3d]: Analytics hotfix deploy (bis)[analytics/refinery@7208d3d] (duration: 19m 16s) [production]
11:56 <dcausse@deploy1002> Finished deploy [wdqs/wdqs@8361ac9]: ban queries from a generic UA (duration: 19m 21s) [production]
11:44 <joal@deploy1002> Started deploy [analytics/refinery@7208d3d]: Analytics hotfix deploy (bis)[analytics/refinery@7208d3d] [production]
11:43 <joal> Deploying refinery to hotfix mediarequest cassandra3 loading jobs (second) [analytics]
11:42 <marostegui> Remove flaggedrevs_stats2 and flaggedrevs_stats from enwiki - T289050 [production]
11:37 <dcausse@deploy1002> Started deploy [wdqs/wdqs@8361ac9]: ban queries from a generic UA [production]
11:36 <dcausse@deploy1002> Finished deploy [wdqs/wdqs@8361ac9]: ban queries from a generic UA (duration: 01m 07s) [production]