2021-01-09
§
|
21:55 |
<wm-bot> |
<lucaswerkmeister> deployed 9a604413d3 (German toponym) |
[tools.lexeme-forms] |
15:11 |
<elukey> |
restart timers 'analytics-*' on labstore100[6,7] to apply new permission settings |
[analytics] |
08:31 |
<elukey> |
restart the failed hdfs rsync timers on labstore100[6,7] to kick off the remaining jobs |
[analytics] |
08:30 |
<elukey> |
execute hdfs chmod o+x of /wmf/data/archive/projectview /wmf/data/archive/projectview/legacy /wmf/data/archive/pageview/legacy to unblock hdfs rsyncs |
[analytics] |
08:24 |
<elukey> |
execute "sudo -u hdfs kerberos-run-command hdfs hdfs dfs -chmod o+rx /wmf/data/archive/pageview" to unblock labstore hdfs rsyncs |
[analytics] |
08:21 |
<elukey> |
execute "sudo -u hdfs kerberos-run-command hdfs hdfs dfs -chmod o+rx /wmf/data/archive/geoeditors" to unblock labstore hdfs rsync |
[analytics] |
04:30 |
<James_F> |
Zuul: [mediawiki/libs/RemexHtml] Enable PHP 8.0 jobs, now passing T271575 |
[releng] |
04:30 |
<James_F> |
Zuul: [mediawiki/libs/Equivset] Enable PHP 8.0 jobs, now passing T271575 |
[releng] |
00:52 |
<legoktm> |
bunch of restarts, repos with non-master default branches should be properly supported |
[codesearch] |
00:11 |
<mutante> |
puppetmaster2003 - restarted apache after spweing 500s |
[production] |
2021-01-08
§
|
23:47 |
<mutante> |
- shutting down and removing instance wikistats-dancing-goat - backup stored on -wild-tiger |
[wikistats] |
23:35 |
<mutante> |
deleting web proxy wikistats-old |
[wikistats] |
19:48 |
<andrew@deploy1001> |
Finished deploy [striker/deploy@e4db843]: Striker deploy for T269004 (duration: 02m 11s) |
[production] |
19:45 |
<andrew@deploy1001> |
Started deploy [striker/deploy@e4db843]: Striker deploy for T269004 |
[production] |
19:28 |
<andrew@deploy1001> |
Finished deploy [horizon/deploy@7466703]: Horizon with a bunch of Buster patches (duration: 02m 35s) |
[production] |
19:26 |
<andrew@deploy1001> |
Started deploy [horizon/deploy@7466703]: Horizon with a bunch of Buster patches |
[production] |
18:54 |
<joal> |
Restart jobs for permissions-fix (clickstream, mediacounts-archive, geoeditors-public_monthly, geoeditors-yearly, mobile_app-uniques-[daily|monthly], pageview-daily_dump, pageview-hourly, projectview-geo, unique_devices-[per_domain|per_project_family]-[daily|monthly]) |
[analytics] |
18:14 |
<joal> |
Restart projectview-hourly job (permissions test) |
[analytics] |
18:03 |
<joal> |
Deploy refinery onto HDFS |
[analytics] |
18:02 |
<joal@deploy1001> |
Finished deploy [analytics/refinery@db9da3c] (thin): Hotfix analytics deployment - THIN [analytics/refinery@db9da3c] (duration: 00m 07s) |
[production] |
18:02 |
<joal@deploy1001> |
Started deploy [analytics/refinery@db9da3c] (thin): Hotfix analytics deployment - THIN [analytics/refinery@db9da3c] |
[production] |
18:01 |
<joal@deploy1001> |
Finished deploy [analytics/refinery@db9da3c]: Hotfix analytics deployment [analytics/refinery@db9da3c] (duration: 11m 27s) |
[production] |
17:50 |
<joal> |
deploy refinery with scap |
[analytics] |
17:50 |
<joal@deploy1001> |
Started deploy [analytics/refinery@db9da3c]: Hotfix analytics deployment [analytics/refinery@db9da3c] |
[production] |
17:33 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on maps2007.codfw.wmnet with reason: Downtiming while not pooled |
[production] |
17:33 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on maps2007.codfw.wmnet with reason: Downtiming while not pooled |
[production] |
17:15 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2 days, 12:00:00 on maps2007.codfw.wmnet with reason: Downtiming while not pooled |
[production] |
17:15 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on maps2007.codfw.wmnet with reason: Downtiming while not pooled |
[production] |
17:15 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on maps1009.eqiad.wmnet with reason: Downtiming while not pooled |
[production] |
17:15 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on maps1009.eqiad.wmnet with reason: Downtiming while not pooled |
[production] |
17:10 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on labweb1001.wikimedia.org with reason: REIMAGE |
[production] |
17:08 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on labweb1001.wikimedia.org with reason: REIMAGE |
[production] |
16:58 |
<wm-bot> |
<lucaswerkmeister> deployed 4feee0dd9c (finish toolforge.org migration) |
[tools.pagepile-visual-filter] |
16:50 |
<razzi@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:43 |
<razzi@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
16:42 |
<andrewbogott> |
shutting down labweb1001 so I can really believe that all traffic is being served by 1002 |
[production] |
16:35 |
<andrew@deploy1001> |
Finished deploy [horizon/deploy@7466703]: selective disable of problematic compression block (duration: 01m 42s) |
[production] |
16:33 |
<andrew@deploy1001> |
Started deploy [horizon/deploy@7466703]: selective disable of problematic compression block |
[production] |
16:32 |
<andrew@deploy1001> |
Finished deploy [horizon/deploy@7466703]: selective disable of problematic compression block (duration: 01m 52s) |
[production] |
16:30 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
16:30 |
<andrew@deploy1001> |
Started deploy [horizon/deploy@7466703]: selective disable of problematic compression block |
[production] |
16:24 |
<razzi@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
15:59 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
15:58 |
<andrew@deploy1001> |
Finished deploy [horizon/deploy@ecaad83]: minor django package upgrades -> labweb1002 (duration: 04m 25s) |
[production] |