2024-03-08
Β§
|
20:46 |
<mutante> |
planet1003/2003: apt-get remove prometheus-apache-exporter - T359596 |
[production] |
20:28 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
20:28 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
18:51 |
<taavi> |
arm cumin_master keyholder key on cumin1002 after ganeti1033 froze and rebooted |
[production] |
18:38 |
<cdanis> |
β cdanis@ganeti1027.eqiad.wmnet ~ πβ sudo gnt-node migrate -f ganeti1033.eqiad.wmnet |
[production] |
18:20 |
<cdanis> |
βcdanis@ganeti1027.eqiad.wmnet ~ πβ sudo gnt-node failover -f ganeti1033.eqiad.wmnet |
[production] |
18:17 |
<cdanis> |
forcibly rebooting ganeti1033 |
[production] |
18:13 |
<cdanis> |
β cdanis@ganeti1027.eqiad.wmnet ~ πβ sudo gnt-node migrate -f ganeti1033.eqiad.wmnet |
[production] |
18:04 |
<Dreamy_Jazz> |
Stopped scan on group 2 wiki (test complete) |
[production] |
17:55 |
<Dreamy_Jazz> |
Running `foreachwikiindblist group2.dblist extensions/MediaModeration/maintenance/scanFilesInScanTable.php --use-jobqueue --sleep 1 --verbose 2>&1 | tee ~/scan-files-in-scan-table-group2-sleep-1-no-render-now.txt` on a tmux session |
[production] |
15:32 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp4037.ulsfo.wmnet |
[production] |
15:32 |
<fabfur> |
repooling cp4037 for this weekend, all log-format changes are reverted (T351117) |
[production] |
15:28 |
<fabfur@cumin2002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp4037.ulsfo.wmnet |
[production] |
15:28 |
<fabfur@cumin2002> |
START - Cookbook sre.hosts.remove-downtime for cp4037.ulsfo.wmnet |
[production] |
14:33 |
<isaranto@deploy2002> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . |
[production] |
14:20 |
<eevans@deploy2002> |
Finished deploy [cassandra/logstash-logback-encoder@910b77d]: Deploying to updated target list β T354560 (duration: 00m 35s) |
[production] |
14:20 |
<eevans@deploy2002> |
Started deploy [cassandra/logstash-logback-encoder@910b77d]: Deploying to updated target list β T354560 |
[production] |
14:17 |
<eevans@deploy2002> |
Finished deploy [cassandra/logstash-logback-encoder@c200e79]: Deploying to updated target list β T354560 (duration: 00m 36s) |
[production] |
14:16 |
<eevans@deploy2002> |
Started deploy [cassandra/logstash-logback-encoder@c200e79]: Deploying to updated target list β T354560 |
[production] |
14:11 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase1041.eqiad.wmnet with reason: Bootstrapping β T354560 |
[production] |
14:11 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase1041.eqiad.wmnet with reason: Bootstrapping β T354560 |
[production] |
14:01 |
<arturo> |
update deb packages on bookworm thirdparty/kubeadm-k8s-1-24 for T359619 (apt1002) |
[production] |
10:30 |
<fabfur@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp4037.ulsfo.wmnet with reason: T358109 |
[production] |
10:30 |
<jnuche@deploy2002> |
Installation of scap version "4.70.1" completed for 374 hosts |
[production] |
10:30 |
<fabfur@cumin2002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on cp4037.ulsfo.wmnet with reason: T358109 |
[production] |
10:29 |
<jnuche@deploy2002> |
Installing scap version "4.70.1" for 374 hosts |
[production] |
10:08 |
<taavi@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2001-dev.codfw.wmnet with OS bookworm |
[production] |
09:49 |
<fabfur@cumin1002> |
conftool action : set/pooled=no; selector: name=cp4037.ulsfo.wmnet |
[production] |
09:49 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp4037.ulsfo.wmnet |
[production] |
09:40 |
<taavi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2001-dev.codfw.wmnet with reason: host reimage |
[production] |
09:39 |
<jnuche@deploy2002> |
Finished deploy [releng/jenkins-deploy@9bf7445] (releasing): (no justification provided) (duration: 00m 40s) |
[production] |
09:38 |
<jnuche@deploy2002> |
Started deploy [releng/jenkins-deploy@9bf7445] (releasing): (no justification provided) |
[production] |
09:38 |
<taavi@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2001-dev.codfw.wmnet with reason: host reimage |
[production] |
09:27 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2108 (re)pooling @ 100%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58687 and previous config saved to /var/cache/conftool/dbconfig/20240308-092705-arnaudb.json |
[production] |
09:26 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2106 (re)pooling @ 100%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58686 and previous config saved to /var/cache/conftool/dbconfig/20240308-092621-arnaudb.json |
[production] |
09:25 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2105 (re)pooling @ 100%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58685 and previous config saved to /var/cache/conftool/dbconfig/20240308-092546-arnaudb.json |
[production] |
09:17 |
<taavi@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt2001-dev.codfw.wmnet with OS bookworm |
[production] |
09:12 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2108 (re)pooling @ 75%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58684 and previous config saved to /var/cache/conftool/dbconfig/20240308-091159-arnaudb.json |
[production] |
09:11 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2106 (re)pooling @ 75%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58683 and previous config saved to /var/cache/conftool/dbconfig/20240308-091115-arnaudb.json |
[production] |
09:10 |
<kart_> |
Updated cxserver to 2024-03-08-084626-production (T359525) |
[production] |
09:10 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2105 (re)pooling @ 75%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58682 and previous config saved to /var/cache/conftool/dbconfig/20240308-091041-arnaudb.json |
[production] |
09:09 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
09:09 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
09:08 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
09:07 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
08:56 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2108 (re)pooling @ 50%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58681 and previous config saved to /var/cache/conftool/dbconfig/20240308-085654-arnaudb.json |
[production] |
08:56 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2106 (re)pooling @ 50%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58680 and previous config saved to /var/cache/conftool/dbconfig/20240308-085610-arnaudb.json |
[production] |
08:55 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2105 (re)pooling @ 50%: Temporary repool for the weekend', diff saved to https://phabricator.wikimedia.org/P58679 and previous config saved to /var/cache/conftool/dbconfig/20240308-085536-arnaudb.json |
[production] |
08:53 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
08:53 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |