2023-05-08
§
|
07:11 |
<jmm@cumin2002> |
END (ERROR) - Cookbook sre.ganeti.reimage (exit_code=97) for host netflow2003.codfw.wmnet with OS bookworm |
[production] |
07:05 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage |
[production] |
07:02 |
<volans@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage |
[production] |
06:50 |
<moritzm> |
bounce ferm on aux-k8s-ctrl1001 |
[production] |
06:49 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reimage for host netflow2003.codfw.wmnet with OS bookworm |
[production] |
06:48 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.reimage (exit_code=99) for host netflow2003.codfw.wmnet with OS bookworm |
[production] |
06:48 |
<volans@cumin1001> |
START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS bullseye |
[production] |
06:48 |
<kart_> |
Deployed MinT to the production (T331505) |
[production] |
06:47 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reimage for host netflow2003.codfw.wmnet with OS bookworm |
[production] |
06:47 |
<kartik@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply |
[production] |
06:44 |
<kartik@deploy1002> |
helmfile [eqiad] START helmfile.d/services/machinetranslation: apply |
[production] |
06:43 |
<kartik@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply |
[production] |
06:40 |
<kartik@deploy1002> |
helmfile [codfw] START helmfile.d/services/machinetranslation: apply |
[production] |
06:37 |
<wm-bot> |
<legoktm> Updated from ba74081 to ce22a2e |
[tools.masto-collab] |
05:55 |
<marostegui@deploy1002> |
Finished scap: Backport for [[gerrit:915725|Revert "ProductionServices.php: Promote pc2014 to pc1 master"]] (duration: 27m 46s) |
[production] |
05:46 |
<phedenskog@deploy1002> |
Finished deploy [performance/navtiming@9b22d3b]: Measure largest contentful paint element type (duration: 00m 05s) |
[production] |
05:46 |
<phedenskog@deploy1002> |
Started deploy [performance/navtiming@9b22d3b]: Measure largest contentful paint element type |
[production] |
05:42 |
<marostegui@deploy1002> |
marostegui: Backport for [[gerrit:915725|Revert "ProductionServices.php: Promote pc2014 to pc1 master"]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
05:28 |
<marostegui@deploy1002> |
Started scap: Backport for [[gerrit:915725|Revert "ProductionServices.php: Promote pc2014 to pc1 master"]] |
[production] |
05:18 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on pc1014.eqiad.wmnet with reason: Maintenance |
[production] |
05:18 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on pc1014.eqiad.wmnet with reason: Maintenance |
[production] |
05:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1113 (s5,s6) T336029', diff saved to https://phabricator.wikimedia.org/P47783 and previous config saved to /var/cache/conftool/dbconfig/20230508-051036-root.json |
[production] |
05:06 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on dbproxy1013.eqiad.wmnet with reason: Maintenance |
[production] |
05:05 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on dbproxy1013.eqiad.wmnet with reason: Maintenance |
[production] |
04:54 |
<marostegui> |
Deploy schema change on x1 eqiad wikishared with replication dbmaint T335834 |
[production] |
2023-05-06
§
|
23:19 |
<wm-bot> |
<legoktm> Updated from ae62c97 to 5e4814f |
[tools.masto-collab] |
12:44 |
<wm-bot> |
<lucaswerkmeister> restarted webservice, 500s reported and strange php socket errors in error.log |
[tools.sal] |
08:51 |
<jelto@cumin1001> |
END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab2002.wikimedia.org with reason: Install software version upgrade |
[production] |
08:03 |
<jelto@cumin1001> |
END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab1004.wikimedia.org with reason: Install software version upgrade |
[production] |
07:50 |
<jelto@cumin1001> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Install software version upgrade |
[production] |
07:44 |
<jelto@cumin1001> |
END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab1003.wikimedia.org with reason: Install software version upgrade |
[production] |
07:07 |
<jelto@cumin1001> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Install software version upgrade |
[production] |
06:50 |
<jelto@cumin1001> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Install software version upgrade |
[production] |
2023-05-05
§
|
23:39 |
<Krinkle> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/916634 |
[releng] |
23:24 |
<tzatziki> |
removing emails from 230 users per self-requests |
[production] |
22:21 |
<bd808> |
Added "RepoLookoutBot" to hiera key "dynamicproxy::blocked_user_agent_regex" to stop unnecessary scans by https://www.repo-lookout.org/ |
[tools] |
22:20 |
<bd808> |
Added |
[tools] |
18:57 |
<brennen@deploy1002> |
Finished scap: Backport for [[gerrit:915719|Revert "api: Use RevisionStore::newRevisionsFromBatch to fetch revision records" (T336008 T336022)]] (duration: 14m 21s) |
[production] |
18:44 |
<brennen@deploy1002> |
umherirrender and brennen: Backport for [[gerrit:915719|Revert "api: Use RevisionStore::newRevisionsFromBatch to fetch revision records" (T336008 T336022)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet |
[production] |
18:42 |
<brennen@deploy1002> |
Started scap: Backport for [[gerrit:915719|Revert "api: Use RevisionStore::newRevisionsFromBatch to fetch revision records" (T336008 T336022)]] |
[production] |
18:25 |
<brennen> |
train 1.41.0-wmf.7 (T330213): trying revert for T336008, T336022 |
[production] |
17:45 |
<Krinkle> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/916527 |
[releng] |
17:42 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs2011.codfw.wmnet with OS bullseye |
[production] |
17:27 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs2011.codfw.wmnet with reason: host reimage |
[production] |