2022-02-21
§
|
02:13 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2152.codfw.wmnet with OS bullseye |
[production] |
02:04 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P21062 and previous config saved to /var/cache/conftool/dbconfig/20220221-020438-ladsgroup.json |
[production] |
01:57 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2152.codfw.wmnet with reason: host reimage |
[production] |
01:54 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2152.codfw.wmnet with reason: host reimage |
[production] |
01:49 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P21061 and previous config saved to /var/cache/conftool/dbconfig/20220221-014934-ladsgroup.json |
[production] |
01:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.reimage for host db2152.codfw.wmnet with OS bullseye |
[production] |
01:38 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2152 (T302185)', diff saved to https://phabricator.wikimedia.org/P21060 and previous config saved to /var/cache/conftool/dbconfig/20220221-013811-ladsgroup.json |
[production] |
01:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2152.codfw.wmnet with reason: Maintenance |
[production] |
01:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2152.codfw.wmnet with reason: Maintenance |
[production] |
01:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1168 (T298554)', diff saved to https://phabricator.wikimedia.org/P21059 and previous config saved to /var/cache/conftool/dbconfig/20220221-013429-ladsgroup.json |
[production] |
01:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1168 (T298554)', diff saved to https://phabricator.wikimedia.org/P21058 and previous config saved to /var/cache/conftool/dbconfig/20220221-012649-ladsgroup.json |
[production] |
01:26 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
01:26 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
01:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T298554)', diff saved to https://phabricator.wikimedia.org/P21057 and previous config saved to /var/cache/conftool/dbconfig/20220221-012642-ladsgroup.json |
[production] |
01:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P21056 and previous config saved to /var/cache/conftool/dbconfig/20220221-011137-ladsgroup.json |
[production] |
00:56 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P21055 and previous config saved to /var/cache/conftool/dbconfig/20220221-005632-ladsgroup.json |
[production] |
00:41 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T298554)', diff saved to https://phabricator.wikimedia.org/P21054 and previous config saved to /var/cache/conftool/dbconfig/20220221-004128-ladsgroup.json |
[production] |
00:16 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1113:3316 (T298554)', diff saved to https://phabricator.wikimedia.org/P21053 and previous config saved to /var/cache/conftool/dbconfig/20220221-001641-ladsgroup.json |
[production] |
00:16 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1113.eqiad.wmnet with reason: Maintenance |
[production] |
00:16 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1113.eqiad.wmnet with reason: Maintenance |
[production] |
2022-02-20
§
|
23:53 |
<wm-bot> |
<bd808> Update to 922ce7c (T294142) |
[tools.toolinfo-scraper] |
23:03 |
<wm-bot> |
<bd808> Bridge #wikimedia-ve to Telegram (T299326) |
[tools.bridgebot] |
19:49 |
<andrewbogott> |
moving nfs service from quarry-nfs-1 (bullseye) to quarry-nfs-2 (buster), testing to see if T302154 is a kernal or nfs-version issue |
[quarry] |
19:23 |
<taavi> |
hard rebooted quarry-nfs-1 again T302154 |
[quarry] |
12:55 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
12:51 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
12:51 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
12:47 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
12:27 |
<taavi@deploy1002> |
Synchronized private/PrivateSettings.php: T302047 (duration: 00m 49s) |
[production] |
10:32 |
<qchris> |
Manually triggering replication run of Gerrit's analytics/datahub to populate newly created analytics-datahub GitHub repo |
[releng] |
2022-02-19
§
|
18:39 |
<wm-bot> |
<bd808> Moved csp-report-prune to kubernetes |
[tools.csp-report] |
17:52 |
<wm-bot> |
<bd808> Move IgnoreNicks restrictions to per-gateway configuration (T296093) |
[tools.bridgebot] |
17:35 |
<wm-bot> |
<bd808> Joining #wikipedia-es-wikiproyectos (T301216) |
[tools.bridgebot] |
17:03 |
<wm-bot> |
<bd808> Updated to matterbridge v1.24.0 for new /chatid telegram command |
[tools.bridgebot] |
16:56 |
<wm-bot> |
<bd808> Move static-cleaner cron job to buster grid |
[tools.bridgebot] |
16:50 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:49 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
16:49 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:48 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
16:40 |
<ladsgroup@deploy1002> |
Synchronized private/PrivateSettings.php: (no justification provided) (duration: 00m 48s) |
[production] |
16:38 |
<ladsgroup@deploy1002> |
Synchronized private/PrivateSettings.php: (no justification provided) (duration: 00m 48s) |
[production] |
16:38 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:37 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
16:37 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:36 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
16:19 |
<wm-bot> |
<bd808> Moved updatetools job from stretch grid to toolforge-jobs |
[tools.admin] |
14:04 |
<taavi> |
reboot quarry-nfs-1 T302154 |
[quarry] |
12:24 |
<_joe_> |
restarted php-fpm on wtp1027 |
[production] |
12:21 |
<elukey> |
stop puppet on an-launcher1002, stop timers for eventlogging_to_druid_network_flows_internal_{hourly,daily} since no data is coming to the Kafka topic (expected due to some work for the Marseille DC) and it keeps alarming |
[analytics] |
12:19 |
<taavi> |
restart trafficserver-tls on deployment-cache-text06 |
[releng] |