2023-03-21
§
|
09:39 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 3:00:00 on kafka-main1005.eqiad.wmnet with reason: Stop kafka, attempt to reimage |
[production] |
09:25 |
<phedenskog@deploy2002> |
Finished deploy [performance/navtiming@d2b97ad]: (no justification provided) (duration: 00m 06s) |
[production] |
09:25 |
<phedenskog@deploy2002> |
Started deploy [performance/navtiming@d2b97ad]: (no justification provided) |
[production] |
09:06 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on cephosd[1001-1005].eqiad.wmnet with reason: Systemd units failing, pupper tries to bring them up periodically, spam on IRC |
[production] |
09:05 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on cephosd[1001-1005].eqiad.wmnet with reason: Systemd units failing, pupper tries to bring them up periodically, spam on IRC |
[production] |
08:31 |
<elukey> |
move purged daemons on cp nodes to a new CA bundle (to allow accepting kafka clients using PKI tls certs) - T319372 |
[production] |
06:50 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 13150 |
[production] |
06:49 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 13150 |
[production] |
03:57 |
<mwpresync@deploy2002> |
Pruned MediaWiki: 1.40.0-wmf.26 (duration: 02m 18s) |
[production] |
03:55 |
<mwpresync@deploy2002> |
Finished scap: testwikis wikis to 1.41.0-wmf.1 refs T330207 (duration: 52m 38s) |
[production] |
03:02 |
<mwpresync@deploy2002> |
Started scap: testwikis wikis to 1.41.0-wmf.1 refs T330207 |
[production] |
2023-03-20
§
|
22:00 |
<samtar@deploy2002> |
Finished scap: Backport for [[gerrit:901275|Add languages to Minerva HTML (T331905)]] (duration: 09m 45s) |
[production] |
21:52 |
<samtar@deploy2002> |
jdlrobson and samtar: Backport for [[gerrit:901275|Add languages to Minerva HTML (T331905)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
21:50 |
<samtar@deploy2002> |
Started scap: Backport for [[gerrit:901275|Add languages to Minerva HTML (T331905)]] |
[production] |
21:34 |
<TheresNoTime> |
`[samtar@mwmaint2002 ~]$ mwscript maintenance/namespaceDupes.php --wiki shwiki --fix` T332614 |
[production] |
21:25 |
<TheresNoTime> |
closing UTC late backport window, extended |
[production] |
21:22 |
<samtar@deploy2002> |
Finished scap: Backport for [[gerrit:901276|Rename project and project talk namespace for shwiki (T332614)]] (duration: 12m 22s) |
[production] |
21:11 |
<samtar@deploy2002> |
samtar and aleksandar: Backport for [[gerrit:901276|Rename project and project talk namespace for shwiki (T332614)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
21:10 |
<samtar@deploy2002> |
Started scap: Backport for [[gerrit:901276|Rename project and project talk namespace for shwiki (T332614)]] |
[production] |
21:09 |
<ebernhardson@deploy2002> |
Finished deploy [airflow-dags/search@1302ca2]: ensure swift_upload delete_after is an integer (duration: 00m 13s) |
[production] |
21:09 |
<ebernhardson@deploy2002> |
Started deploy [airflow-dags/search@1302ca2]: ensure swift_upload delete_after is an integer |
[production] |
21:09 |
<samtar@deploy2002> |
Finished scap: Backport for [[gerrit:898845|Enable new Vector (2022) "Add topic" button at arwiki (T331313)]], [[gerrit:898846|Enable DiscussionTools usability improvements at arwiki (T329407)]] (duration: 08m 34s) |
[production] |
21:02 |
<samtar@deploy2002> |
matmarex and samtar: Backport for [[gerrit:898845|Enable new Vector (2022) "Add topic" button at arwiki (T331313)]], [[gerrit:898846|Enable DiscussionTools usability improvements at arwiki (T329407)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet |
[production] |
21:00 |
<TheresNoTime> |
extending UTC late backport window |
[production] |
21:00 |
<samtar@deploy2002> |
Started scap: Backport for [[gerrit:898845|Enable new Vector (2022) "Add topic" button at arwiki (T331313)]], [[gerrit:898846|Enable DiscussionTools usability improvements at arwiki (T329407)]] |
[production] |
20:58 |
<kharlan@deploy2002> |
Finished scap: Backport for [[gerrit:901146|TryNewTask: Set an array fallback if TryNewTaskOptOuts is null]], [[gerrit:900685|PostEdit: Increment the edit-count-for-task-type count (T332319)]], [[gerrit:900684|LevelingUpManager: Handle links/link-recommendation collision (T332309)]] (duration: 10m 28s) |
[production] |
20:49 |
<kharlan@deploy2002> |
kharlan: Backport for [[gerrit:901146|TryNewTask: Set an array fallback if TryNewTaskOptOuts is null]], [[gerrit:900685|PostEdit: Increment the edit-count-for-task-type count (T332319)]], [[gerrit:900684|LevelingUpManager: Handle links/link-recommendation collision (T332309)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmn |
[production] |
20:47 |
<kharlan@deploy2002> |
Started scap: Backport for [[gerrit:901146|TryNewTask: Set an array fallback if TryNewTaskOptOuts is null]], [[gerrit:900685|PostEdit: Increment the edit-count-for-task-type count (T332319)]], [[gerrit:900684|LevelingUpManager: Handle links/link-recommendation collision (T332309)]] |
[production] |
19:48 |
<mutante> |
miscweb1003 - manually edit /srv/deployment/iegreview/iegreview-cache/.config and replace tin.eqiad.wmnet with deployment.eqiad.wmnet (which is an alias for deploy2002.codfw.wmnet) T257317 T332623 T331896 |
[production] |
19:13 |
<ebernhardson@deploy2002> |
Finished deploy [airflow-dags/search@b16917e]: fix templating in SimpleSkeinOperator (duration: 00m 13s) |
[production] |
19:13 |
<ebernhardson@deploy2002> |
Started deploy [airflow-dags/search@b16917e]: fix templating in SimpleSkeinOperator |
[production] |
18:56 |
<ejegg> |
switched back to new PayPal pending transaction resolver |
[production] |
18:48 |
<akosiaris@deploy2002> |
Synchronized private/PrivateSettings.php: (no justification provided) (duration: 06m 28s) |
[production] |
18:47 |
<akosiaris> |
emergency rollover of redis password complete |
[production] |
18:45 |
<akosiaris> |
re-enable puppet on rdb*, netbox*, ores*, registry* |
[production] |
18:42 |
<ebernhardson@deploy2002> |
Finished deploy [airflow-dags/search@3aaecb7]: safely quote spark args in skein script (duration: 00m 13s) |
[production] |
18:42 |
<ebernhardson@deploy2002> |
Started deploy [airflow-dags/search@3aaecb7]: safely quote spark args in skein script |
[production] |
18:42 |
<ejegg> |
civicrm upgraded from 3d3606f1 to 09373b9d |
[production] |
18:32 |
<akosiaris@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:32 |
<akosiaris@deploy2002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:32 |
<akosiaris@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:32 |
<akosiaris@deploy2002> |
helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:31 |
<akosiaris@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:30 |
<akosiaris@deploy2002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:30 |
<akosiaris@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:30 |
<akosiaris@deploy2002> |
helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:30 |
<akosiaris@deploy2002> |
helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:30 |
<akosiaris@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:28 |
<akosiaris@deploy2002> |
helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync |
[production] |
18:28 |
<akosiaris@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync |
[production] |