3801-3850 of 10000 results (59ms)
2024-01-10 §
06:50 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2143.codfw.wmnet with reason: host reimage [production]
06:32 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db2143.codfw.wmnet with OS bookworm [production]
2024-01-09 §
23:37 <andrewbogott> restarting harbor-db in an attempt to reform harbor -- T354714 [tools]
23:30 <andrewbogott> rebooting tools-harbor-1 in a feeble attempt to get it to work (docker-compose can't restart it) [tools]
23:12 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds-builder [tools]
23:12 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-builder [tools]
23:11 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component builds.builder [tools]
23:11 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds.builder [tools]
21:33 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
21:28 <aqu> airflow-dags/analytics(_test) are both deployed [analytics]
21:23 <aqu@deploy2002> Finished deploy [airflow-dags/analytics@ea53374]: Regular airflow-dags/analytics weekly train [airflow-dags@ea53374f] (duration: 00m 28s) [production]
21:22 <aqu@deploy2002> Started deploy [airflow-dags/analytics@ea53374]: Regular airflow-dags/analytics weekly train [airflow-dags@ea53374f] [production]
21:21 <aqu@deploy2002> Finished deploy [airflow-dags/analytics_test@ea53374]: Regular airflow-dags/analytics_test weekly train [airflow-dags@ea53374f] (duration: 00m 12s) [production]
21:21 <aqu@deploy2002> Started deploy [airflow-dags/analytics_test@ea53374]: Regular airflow-dags/analytics_test weekly train [airflow-dags@ea53374f] [production]
21:18 <aqu> analytics/refinery not deployed fully on test cluster. Ticket for the bug here: https://phabricator.wikimedia.org/T354703 [analytics]
21:07 <aqu> Deployed refinery using scap, then deployed onto hdfs [analytics]
21:03 <aqu@deploy2002> Finished deploy [analytics/refinery@c4fed56] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@c4fed56c] (test number 2 after permission error) (duration: 00m 05s) [production]
21:03 <aqu@deploy2002> Started deploy [analytics/refinery@c4fed56] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@c4fed56c] (test number 2 after permission error) [production]
21:02 <aqu@deploy2002> Finished deploy [analytics/refinery@c4fed56] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@c4fed56c] (duration: 03m 33s) [production]
20:59 <aqu@deploy2002> Started deploy [analytics/refinery@c4fed56] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@c4fed56c] [production]
20:59 <aqu@deploy2002> Finished deploy [analytics/refinery@c4fed56] (thin): Regular analytics weekly train THIN [analytics/refinery@c4fed56c] (duration: 00m 06s) [production]
20:58 <aqu@deploy2002> Started deploy [analytics/refinery@c4fed56] (thin): Regular analytics weekly train THIN [analytics/refinery@c4fed56c] [production]
20:58 <aqu@deploy2002> Finished deploy [analytics/refinery@c4fed56]: Regular analytics weekly train [analytics/refinery@c4fed56c] (duration: 09m 06s) [production]
20:49 <eevans@cumin1002> conftool action : set/weight=0; selector: cluster=restbase,dc=codfw,name=restbase2019.codfw.wmnet [production]
20:49 <eevans@cumin1002> conftool action : set/weight=0; selector: cluster=restbase,dc=codfw,name=restbase2014.codfw.wmnet [production]
20:49 <eevans@cumin1002> conftool action : set/weight=0; selector: cluster=restbase,dc=codfw,name=restbase2013.codfw.wmnet [production]
20:49 <aqu@deploy2002> Started deploy [analytics/refinery@c4fed56]: Regular analytics weekly train [analytics/refinery@c4fed56c] [production]
20:48 <aqu> about to deploy analytics/refinery - weekly train [production]
20:48 <aqu> about to deploy analytics/refinery - weekly train [analytics]
20:40 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: group0 wikis to 1.42.0-wmf.13 refs T350089 [production]
20:26 <jhuneidi@deploy2002> Finished scap: testwikis wikis to 1.42.0-wmf.13 refs T350089 (duration: 23m 33s) [production]
20:03 <jhuneidi@deploy2002> Started scap: testwikis wikis to 1.42.0-wmf.13 refs T350089 [production]
19:44 <mutante> mwmaint1002 - rm -rf 1.42.0-wmf.7 ; mwmamint2002 - rm -rf php-1.39.0-wmf.25 [production]
19:35 <mutante> mwmaint1002 - rm -rf /srv/mediawiki/php-1.40.0-wmf.17 [production]
19:33 <mutante> mwmaint1002 - rm -rf /srv/mediawiki/php-1.39.0-wmf.25 after monitoring alerted about 99% disk usage on /srv [production]
19:25 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: all wikis to 1.42.0-wmf.12 refs T350089 [production]
19:16 <urandom> decommissioning cassandra, restbase2013-{a,b,c} — T352469 [production]
19:14 <jhuneidi@deploy2002> Finished scap: testwikis wikis to 1.42.0-wmf.13 refs T350089 (duration: 45m 48s) [production]
18:42 <cmooney@cumin1002> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1001-1002].eqiad.wmnet with reason: Release v0.6.5 - cmooney@cumin1002 [production]
18:40 <cmooney@cumin1002> START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1001-1002].eqiad.wmnet with reason: Release v0.6.5 - cmooney@cumin1002 [production]
18:34 <wm-bot> <anticomposite> ./stewardbots/StewardBot/manage.sh restart # RC not working [tools.stewardbots]
18:29 <jhuneidi@deploy2002> Started scap: testwikis wikis to 1.42.0-wmf.13 refs T350089 [production]
18:04 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:04 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add new reverse entries for mr1 -> lsw1-a2 link in codfw - cmooney@cumin1002" [production]
18:02 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add new reverse entries for mr1 -> lsw1-a2 link in codfw - cmooney@cumin1002" [production]
18:00 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
17:50 <wmbot~fran@wmf3169> END (PASS) - Cookbook wmcs.do_log_msg (exit_code=0) (T346631) [admin]
17:49 <wmbot~fran@wmf3169> test message2 from local cookbook (T346631) [admin]
17:49 <wmbot~fran@wmf3169> START - Cookbook wmcs.do_log_msg (T346631) [admin]
17:43 <wmbot~fran@wmf3169> %(message)s (T346631) [admin]