651-700 of 10000 results (80ms)
2023-03-21 §
09:39 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on kafka-main1005.eqiad.wmnet with reason: Stop kafka, attempt to reimage [production]
09:25 <phedenskog@deploy2002> Finished deploy [performance/navtiming@d2b97ad]: (no justification provided) (duration: 00m 06s) [production]
09:25 <phedenskog@deploy2002> Started deploy [performance/navtiming@d2b97ad]: (no justification provided) [production]
09:06 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on cephosd[1001-1005].eqiad.wmnet with reason: Systemd units failing, pupper tries to bring them up periodically, spam on IRC [production]
09:05 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on cephosd[1001-1005].eqiad.wmnet with reason: Systemd units failing, pupper tries to bring them up periodically, spam on IRC [production]
08:31 <elukey> move purged daemons on cp nodes to a new CA bundle (to allow accepting kafka clients using PKI tls certs) - T319372 [production]
06:50 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 13150 [production]
06:49 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 13150 [production]
03:57 <mwpresync@deploy2002> Pruned MediaWiki: 1.40.0-wmf.26 (duration: 02m 18s) [production]
03:55 <mwpresync@deploy2002> Finished scap: testwikis wikis to 1.41.0-wmf.1 refs T330207 (duration: 52m 38s) [production]
03:02 <mwpresync@deploy2002> Started scap: testwikis wikis to 1.41.0-wmf.1 refs T330207 [production]
2023-03-20 §
22:00 <samtar@deploy2002> Finished scap: Backport for [[gerrit:901275|Add languages to Minerva HTML (T331905)]] (duration: 09m 45s) [production]
21:52 <samtar@deploy2002> jdlrobson and samtar: Backport for [[gerrit:901275|Add languages to Minerva HTML (T331905)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
21:50 <samtar@deploy2002> Started scap: Backport for [[gerrit:901275|Add languages to Minerva HTML (T331905)]] [production]
21:34 <TheresNoTime> `[samtar@mwmaint2002 ~]$ mwscript maintenance/namespaceDupes.php --wiki shwiki --fix` T332614 [production]
21:25 <TheresNoTime> closing UTC late backport window, extended [production]
21:22 <samtar@deploy2002> Finished scap: Backport for [[gerrit:901276|Rename project and project talk namespace for shwiki (T332614)]] (duration: 12m 22s) [production]
21:11 <samtar@deploy2002> samtar and aleksandar: Backport for [[gerrit:901276|Rename project and project talk namespace for shwiki (T332614)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet [production]
21:10 <samtar@deploy2002> Started scap: Backport for [[gerrit:901276|Rename project and project talk namespace for shwiki (T332614)]] [production]
21:09 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@1302ca2]: ensure swift_upload delete_after is an integer (duration: 00m 13s) [production]
21:09 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@1302ca2]: ensure swift_upload delete_after is an integer [production]
21:09 <samtar@deploy2002> Finished scap: Backport for [[gerrit:898845|Enable new Vector (2022) "Add topic" button at arwiki (T331313)]], [[gerrit:898846|Enable DiscussionTools usability improvements at arwiki (T329407)]] (duration: 08m 34s) [production]
21:02 <samtar@deploy2002> matmarex and samtar: Backport for [[gerrit:898845|Enable new Vector (2022) "Add topic" button at arwiki (T331313)]], [[gerrit:898846|Enable DiscussionTools usability improvements at arwiki (T329407)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
21:00 <TheresNoTime> extending UTC late backport window [production]
21:00 <samtar@deploy2002> Started scap: Backport for [[gerrit:898845|Enable new Vector (2022) "Add topic" button at arwiki (T331313)]], [[gerrit:898846|Enable DiscussionTools usability improvements at arwiki (T329407)]] [production]
20:58 <kharlan@deploy2002> Finished scap: Backport for [[gerrit:901146|TryNewTask: Set an array fallback if TryNewTaskOptOuts is null]], [[gerrit:900685|PostEdit: Increment the edit-count-for-task-type count (T332319)]], [[gerrit:900684|LevelingUpManager: Handle links/link-recommendation collision (T332309)]] (duration: 10m 28s) [production]
20:49 <kharlan@deploy2002> kharlan: Backport for [[gerrit:901146|TryNewTask: Set an array fallback if TryNewTaskOptOuts is null]], [[gerrit:900685|PostEdit: Increment the edit-count-for-task-type count (T332319)]], [[gerrit:900684|LevelingUpManager: Handle links/link-recommendation collision (T332309)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmn [production]
20:47 <kharlan@deploy2002> Started scap: Backport for [[gerrit:901146|TryNewTask: Set an array fallback if TryNewTaskOptOuts is null]], [[gerrit:900685|PostEdit: Increment the edit-count-for-task-type count (T332319)]], [[gerrit:900684|LevelingUpManager: Handle links/link-recommendation collision (T332309)]] [production]
19:48 <mutante> miscweb1003 - manually edit /srv/deployment/iegreview/iegreview-cache/.config and replace tin.eqiad.wmnet with deployment.eqiad.wmnet (which is an alias for deploy2002.codfw.wmnet) T257317 T332623 T331896 [production]
19:13 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@b16917e]: fix templating in SimpleSkeinOperator (duration: 00m 13s) [production]
19:13 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@b16917e]: fix templating in SimpleSkeinOperator [production]
18:56 <ejegg> switched back to new PayPal pending transaction resolver [production]
18:48 <akosiaris@deploy2002> Synchronized private/PrivateSettings.php: (no justification provided) (duration: 06m 28s) [production]
18:47 <akosiaris> emergency rollover of redis password complete [production]
18:45 <akosiaris> re-enable puppet on rdb*, netbox*, ores*, registry* [production]
18:42 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@3aaecb7]: safely quote spark args in skein script (duration: 00m 13s) [production]
18:42 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@3aaecb7]: safely quote spark args in skein script [production]
18:42 <ejegg> civicrm upgraded from 3d3606f1 to 09373b9d [production]
18:32 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
18:32 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
18:32 <akosiaris@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
18:32 <akosiaris@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: sync [production]
18:31 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
18:30 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
18:30 <akosiaris@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
18:30 <akosiaris@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: sync [production]
18:30 <akosiaris@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
18:30 <akosiaris@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: sync [production]
18:28 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
18:28 <akosiaris@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync [production]