1701-1750 of 10000 results (89ms)
2024-02-21 §
21:51 <urandom> boostrapping Cassandra, restbase1034-{a,b,c} — T354560 [production]
21:46 <jhuneidi@deploy2002> anzx and jhuneidi: Continuing with sync [production]
21:45 <jhuneidi@deploy2002> anzx and jhuneidi: Backport for [[gerrit:1005476|cswiki, commonswiki, enwiki: fix IP cap date and IP for WikiGap Editathon (T357978)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:44 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncmonitor1001.eqiad.wmnet with OS bookworm [production]
21:43 <jhuneidi@deploy2002> Started scap: Backport for [[gerrit:1005476|cswiki, commonswiki, enwiki: fix IP cap date and IP for WikiGap Editathon (T357978)]] [production]
21:42 <jhuneidi@deploy2002> Finished scap: Backport for [[gerrit:1005569|Remove Japanese Wikipedia from projects sharing user scripts (T301212)]], [[gerrit:1005570|Enable night mode on beta cluster (T357759)]] (duration: 15m 25s) [production]
21:40 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P57638 and previous config saved to /var/cache/conftool/dbconfig/20240221-214052-arnaudb.json [production]
21:34 <jhuneidi@deploy2002> jdlrobson and jhuneidi: Continuing with sync [production]
21:32 <rzl@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
21:31 <rzl@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
21:31 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncmonitor1001.eqiad.wmnet with reason: host reimage [production]
21:29 <jhuneidi@deploy2002> jdlrobson and jhuneidi: Backport for [[gerrit:1005569|Remove Japanese Wikipedia from projects sharing user scripts (T301212)]], [[gerrit:1005570|Enable night mode on beta cluster (T357759)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:27 <jhuneidi@deploy2002> Started scap: Backport for [[gerrit:1005569|Remove Japanese Wikipedia from projects sharing user scripts (T301212)]], [[gerrit:1005570|Enable night mode on beta cluster (T357759)]] [production]
21:27 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncmonitor1001.eqiad.wmnet with reason: host reimage [production]
21:26 <jhuneidi@deploy2002> Finished scap: Backport for [[gerrit:999062|Turn on Parsoid read views by default on officewiki (T355566)]] (duration: 15m 19s) [production]
21:25 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P57637 and previous config saved to /var/cache/conftool/dbconfig/20240221-212546-arnaudb.json [production]
21:24 <rzl@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
21:24 <rzl@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
21:19 <rzl@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
21:18 <rzl@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
21:18 <jhuneidi@deploy2002> cscott and jhuneidi: Continuing with sync [production]
21:17 <rzl@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
21:17 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host ncmonitor1001.eqiad.wmnet with OS bookworm [production]
21:17 <rzl@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
21:12 <jhuneidi@deploy2002> cscott and jhuneidi: Backport for [[gerrit:999062|Turn on Parsoid read views by default on officewiki (T355566)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:11 <jhuneidi@deploy2002> Started scap: Backport for [[gerrit:999062|Turn on Parsoid read views by default on officewiki (T355566)]] [production]
21:10 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2159 (T357189)', diff saved to https://phabricator.wikimedia.org/P57636 and previous config saved to /var/cache/conftool/dbconfig/20240221-211039-arnaudb.json [production]
21:00 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db2159 (T357189)', diff saved to https://phabricator.wikimedia.org/P57635 and previous config saved to /var/cache/conftool/dbconfig/20240221-210001-arnaudb.json [production]
20:59 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
20:59 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
20:59 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2159.codfw.wmnet with reason: Maintenance [production]
20:59 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2159.codfw.wmnet with reason: Maintenance [production]
20:59 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150 (T357189)', diff saved to https://phabricator.wikimedia.org/P57634 and previous config saved to /var/cache/conftool/dbconfig/20240221-205922-arnaudb.json [production]
20:54 <jhuneidi@deploy2002> Synchronized php: group1 wikis to 1.42.0-wmf.19 refs T354437 (duration: 08m 35s) [production]
20:46 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: group1 wikis to 1.42.0-wmf.19 refs T354437 [production]
20:44 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P57633 and previous config saved to /var/cache/conftool/dbconfig/20240221-204415-arnaudb.json [production]
20:39 <ebernhardson@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
20:39 <ebernhardson@deploy2002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
20:33 <ejegg> turned off nightly recurring charge job for Autorescue deployment [production]
20:29 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P57632 and previous config saved to /var/cache/conftool/dbconfig/20240221-202906-arnaudb.json [production]
20:16 <jhuneidi@deploy2002> scap failed: average error rate on 4/4 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org for details) [production]
20:14 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2150 (T357189)', diff saved to https://phabricator.wikimedia.org/P57631 and previous config saved to /var/cache/conftool/dbconfig/20240221-201400-arnaudb.json [production]
20:11 <jhuneidi@deploy2002> Finished scap: Backport for [[gerrit:1005481|CentralAuthHooks::onGetUserBlock: Only run for reg. users (T358112)]] (duration: 14m 09s) [production]
20:03 <jhuneidi@deploy2002> jhuneidi and matmarex: Continuing with sync [production]
20:02 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db2150 (T357189)', diff saved to https://phabricator.wikimedia.org/P57630 and previous config saved to /var/cache/conftool/dbconfig/20240221-200209-arnaudb.json [production]
20:02 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance [production]
20:02 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance [production]
20:01 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2122 (T357189)', diff saved to https://phabricator.wikimedia.org/P57629 and previous config saved to /var/cache/conftool/dbconfig/20240221-200148-arnaudb.json [production]
19:58 <jhuneidi@deploy2002> jhuneidi and matmarex: Backport for [[gerrit:1005481|CentralAuthHooks::onGetUserBlock: Only run for reg. users (T358112)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
19:57 <jhuneidi@deploy2002> Started scap: Backport for [[gerrit:1005481|CentralAuthHooks::onGetUserBlock: Only run for reg. users (T358112)]] [production]