| 2024-05-16
      
      ยง | 
    
  | 22:02 | <ebernhardson@deploy1002> | Finished deploy [airflow-dags/search@cb359e4]: add dags to collect daily webrequest and satisfaction search metrics (duration: 00m 25s) | [production] | 
            
  | 22:02 | <ebernhardson@deploy1002> | Started deploy [airflow-dags/search@cb359e4]: add dags to collect daily webrequest and satisfaction search metrics | [production] | 
            
  | 21:52 | <jsn@deploy1002> | cscott and jsn: Backport for [[gerrit:1032435|[JsonCodec, ParserCache] Improve debugging of serializability failures (T365036)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 21:49 | <jsn@deploy1002> | Started scap: Backport for [[gerrit:1032435|[JsonCodec, ParserCache] Improve debugging of serializability failures (T365036)]] | [production] | 
            
  | 21:31 | <jsn@deploy1002> | Finished scap: Backport for [[gerrit:1032571|Update VE core submodule to master (27296e0e3) (T230323 T365052)]] (duration: 25m 10s) | [production] | 
            
  | 21:11 | <jsn@deploy1002> | jsn and esanders: Continuing with sync | [production] | 
            
  | 21:09 | <mutante> | LDAP - added uid rickijay to group nda (T365138) | [production] | 
            
  | 21:08 | <jsn@deploy1002> | jsn and esanders: Backport for [[gerrit:1032571|Update VE core submodule to master (27296e0e3) (T230323 T365052)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 21:06 | <jsn@deploy1002> | Started scap: Backport for [[gerrit:1032571|Update VE core submodule to master (27296e0e3) (T230323 T365052)]] | [production] | 
            
  | 21:05 | <mutante> | LDAP - added uid dmuthuri to group wmf T364320 | [production] | 
            
  | 20:43 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 20:43 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 20:43 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1227 (T352010)', diff saved to https://phabricator.wikimedia.org/P62548 and previous config saved to /var/cache/conftool/dbconfig/20240516-204342-ladsgroup.json | [production] | 
            
  | 20:33 | <eevans@cumin1002> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for aqs1013.eqiad.wmnet | [production] | 
            
  | 20:33 | <eevans@cumin1002> | START - Cookbook sre.hosts.remove-downtime for aqs1013.eqiad.wmnet | [production] | 
            
  | 20:33 | <mutante> | contint2002 - as usual have to manually "a2dismod mpm_event" on a machine using apache that has just been installed to fix the race condition with apache modules | [production] | 
            
  | 20:33 | <dzahn@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host contint2002.wikimedia.org with OS bullseye | [production] | 
            
  | 20:31 | <jdrewniak@deploy1002> | Finished scap: Backport for [[gerrit:1032398|Fix exclude list for dark mode (T365084)]] (duration: 22m 36s) | [production] | 
            
  | 20:28 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P62547 and previous config saved to /var/cache/conftool/dbconfig/20240516-202834-ladsgroup.json | [production] | 
            
  | 20:14 | <dzahn@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on contint2002.wikimedia.org with reason: host reimage | [production] | 
            
  | 20:13 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P62546 and previous config saved to /var/cache/conftool/dbconfig/20240516-201326-ladsgroup.json | [production] | 
            
  | 20:12 | <jdrewniak@deploy1002> | jdrewniak and mabualruz: Continuing with sync | [production] | 
            
  | 20:11 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on contint2002.wikimedia.org with reason: host reimage | [production] | 
            
  | 20:11 | <jdrewniak@deploy1002> | jdrewniak and mabualruz: Backport for [[gerrit:1032398|Fix exclude list for dark mode (T365084)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 20:08 | <wmbot~lucaswerkmeister@tools-bastion-13> | deployed 8f374ee202 (l10n updates: es, zh-hans) | [tools.wd-image-positions] | 
            
  | 20:08 | <ryankemper> | [Hadoop] Restarted `hadoop-hdfs-datanode` on `an-worker1172` | [production] | 
            
  | 20:08 | <jdrewniak@deploy1002> | Started scap: Backport for [[gerrit:1032398|Fix exclude list for dark mode (T365084)]] | [production] | 
            
  | 20:06 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db1168 (T352010)', diff saved to https://phabricator.wikimedia.org/P62545 and previous config saved to /var/cache/conftool/dbconfig/20240516-200618-ladsgroup.json | [production] | 
            
  | 20:06 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1168.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 20:06 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1168.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 20:05 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1165 (T352010)', diff saved to https://phabricator.wikimedia.org/P62544 and previous config saved to /var/cache/conftool/dbconfig/20240516-200552-ladsgroup.json | [production] | 
            
  | 20:03 | <ryankemper@cumin2002> | END (FAIL) - Cookbook sre.hadoop.roll-restart-workers (exit_code=99) restart workers for Hadoop analytics cluster: Roll restart of jvm daemons for openjdk upgrade. | [production] | 
            
  | 19:58 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1227 (T352010)', diff saved to https://phabricator.wikimedia.org/P62543 and previous config saved to /var/cache/conftool/dbconfig/20240516-195817-ladsgroup.json | [production] | 
            
  | 19:55 | <dzahn@cumin1002> | START - Cookbook sre.hosts.reimage for host contint2002.wikimedia.org with OS bullseye | [production] | 
            
  | 19:50 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P62542 and previous config saved to /var/cache/conftool/dbconfig/20240516-195044-ladsgroup.json | [production] | 
            
  | 19:46 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db2166 (T364299)', diff saved to https://phabricator.wikimedia.org/P62541 and previous config saved to /var/cache/conftool/dbconfig/20240516-194613-marostegui.json | [production] | 
            
  | 19:46 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2166.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 19:45 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2166.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 19:45 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2165 (T364299)', diff saved to https://phabricator.wikimedia.org/P62540 and previous config saved to /var/cache/conftool/dbconfig/20240516-194548-marostegui.json | [production] | 
            
  | 19:35 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P62539 and previous config saved to /var/cache/conftool/dbconfig/20240516-193535-ladsgroup.json | [production] | 
            
  | 19:30 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2165', diff saved to https://phabricator.wikimedia.org/P62538 and previous config saved to /var/cache/conftool/dbconfig/20240516-193040-marostegui.json | [production] | 
            
  | 19:20 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1165 (T352010)', diff saved to https://phabricator.wikimedia.org/P62537 and previous config saved to /var/cache/conftool/dbconfig/20240516-192027-ladsgroup.json | [production] | 
            
  | 19:15 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2165', diff saved to https://phabricator.wikimedia.org/P62536 and previous config saved to /var/cache/conftool/dbconfig/20240516-191532-marostegui.json | [production] | 
            
  | 19:00 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2165 (T364299)', diff saved to https://phabricator.wikimedia.org/P62535 and previous config saved to /var/cache/conftool/dbconfig/20240516-190024-marostegui.json | [production] | 
            
  | 18:58 | <dzahn@cumin2002> | START - Cookbook sre.hosts.reimage for host contint2002.wikimedia.org with OS buster | [production] | 
            
  | 18:46 | <dzahn@cumin2002> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host contint2002.wikimedia.org with OS bullseye | [production] | 
            
  | 18:32 | <vriley@cumin1002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-main1006.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:17 | <dzahn@cumin2002> | START - Cookbook sre.hosts.reimage for host contint2002.wikimedia.org with OS bullseye | [production] | 
            
  | 18:15 | <cmooney@cumin1002> | END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host contint2002.wikimedia.org | [production] | 
            
  | 18:13 | <cmooney@cumin1002> | START - Cookbook sre.hosts.dhcp for host contint2002.wikimedia.org | [production] |