2024-09-25
ยง
|
15:48 |
<swfrench@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1073895|debug.json: order codfw (primary) DC backends first (T370962)]] |
[production] |
15:24 |
<swfrench@deploy1003> |
Unlocked for deployment [ALL REPOSITORIES]: Datacenter Switchover - T370962 (duration: 58m 14s) |
[production] |
15:23 |
<cwhite> |
reinstall phatality on logstash1023 and repool logstash1023 and logstash1032 T374880 |
[production] |
15:21 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.09-run-puppet-on-db-masters (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:21 |
<Dreamy_Jazz> |
Running `foreachwikiindblist group2.dblist extensions/CheckUser/maintenance/populateCentralCheckUserIndexTables.php` on a tmux session for T375203 |
[production] |
15:12 |
<Dreamy_Jazz> |
Started MediaModeration script after datacenter switchover - https://wikitech.wikimedia.org/wiki/MediaModeration |
[production] |
15:12 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.09-run-puppet-on-db-masters for datacenter switchover from eqiad to codfw |
[production] |
15:09 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.09-restore-ttl (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:08 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.09-restore-ttl for datacenter switchover from eqiad to codfw |
[production] |
15:07 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.08-start-maintenance (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:05 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.08-start-maintenance for datacenter switchover from eqiad to codfw |
[production] |
15:04 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.08-restart-mw-jobrunner (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:04 |
<root@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-jobrunner: sync |
[production] |
15:03 |
<root@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-jobrunner: sync |
[production] |
15:03 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.08-restart-mw-jobrunner for datacenter switchover from eqiad to codfw |
[production] |
15:01 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.07-set-readwrite (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:01 |
<swfrench@cumin1002> |
MediaWiki read-only period ends at: 2024-09-25 15:01:28.078892 |
[production] |
15:01 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.07-set-readwrite for datacenter switchover from eqiad to codfw |
[production] |
15:01 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.06-set-db-readwrite (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:00 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.06-set-db-readwrite for datacenter switchover from eqiad to codfw |
[production] |
15:00 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.04-switch-mediawiki (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
15:00 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.04-switch-mediawiki for datacenter switchover from eqiad to codfw |
[production] |
14:59 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.03-set-db-readonly (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
14:59 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.03-set-db-readonly for datacenter switchover from eqiad to codfw |
[production] |
14:59 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.02-set-readonly (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
14:58 |
<swfrench@cumin1002> |
MediaWiki read-only period starts at: 2024-09-25 14:58:42.440378 |
[production] |
14:58 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.02-set-readonly for datacenter switchover from eqiad to codfw |
[production] |
14:54 |
<swfrench-wmf> |
reset failed units on mwmaint1002 to unblock 01-stop-maintenance - T370962 |
[production] |
14:52 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.01-stop-maintenance (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
14:52 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.01-stop-maintenance for datacenter switchover from eqiad to codfw |
[production] |
14:46 |
<swfrench@cumin1002> |
END (FAIL) - Cookbook sre.switchdc.mediawiki.01-stop-maintenance (exit_code=99) for datacenter switchover from eqiad to codfw |
[production] |
14:45 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.01-stop-maintenance for datacenter switchover from eqiad to codfw |
[production] |
14:41 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.00-reduce-ttl (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
14:35 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.00-reduce-ttl for datacenter switchover from eqiad to codfw |
[production] |
14:34 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.00-downtime-db-readonly-checks (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
14:34 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.00-downtime-db-readonly-checks for datacenter switchover from eqiad to codfw |
[production] |
14:33 |
<swfrench@cumin1002> |
END (PASS) - Cookbook sre.switchdc.mediawiki.00-disable-puppet (exit_code=0) for datacenter switchover from eqiad to codfw |
[production] |
14:33 |
<swfrench@cumin1002> |
START - Cookbook sre.switchdc.mediawiki.00-disable-puppet for datacenter switchover from eqiad to codfw |
[production] |
14:26 |
<swfrench@deploy1003> |
Locking from deployment [ALL REPOSITORIES]: Datacenter Switchover - T370962 |
[production] |
14:24 |
<jynus> |
run delete on cawiki (s7) db2181 (row format) T375507 |
[production] |
14:22 |
<kartik@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1075567|CX3 Build 0.2.0+20240925 (T374387 T370746 T368422 T374567 T355780 T374559 T374886 T375410)]] (duration: 14m 06s) |
[production] |
14:18 |
<kartik@deploy1003> |
kartik, sbisson: Continuing with sync |
[production] |
14:10 |
<kartik@deploy1003> |
kartik, sbisson: Backport for [[gerrit:1075567|CX3 Build 0.2.0+20240925 (T374387 T370746 T368422 T374567 T355780 T374559 T374886 T375410)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:08 |
<kartik@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1075567|CX3 Build 0.2.0+20240925 (T374387 T370746 T368422 T374567 T355780 T374559 T374886 T375410)]] |
[production] |
13:50 |
<swfrench-wmf> |
kartotherian repooled in eqiad due load issues - T370962 |
[production] |
13:47 |
<_joe_> |
repooling karthoterian in eqiad, a further roll restart in codfw |
[production] |
13:31 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
13:31 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
13:31 |
<_joe_> |
rolling restart of kartotherian in codfw |
[production] |
13:29 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |