[06:24:29] first time I see transfer.py fail: [06:24:36] https://www.irccloud.com/pastebin/PFEH7qy0/ [08:19:23] FIRING: SystemdUnitFailed: prometheus-mysqld-exporter.service on db2246:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [09:21:15] Revert for trixie? [09:21:40] ah, no, revert of a revert [09:22:21] Nah, just an if missing [09:22:23] I will do it in a bit [09:22:32] ah, I see it on the revery comment [09:23:25] thank you for the deletion [09:24:56] what deletion? [09:24:57] ah the files [09:24:59] no worries :) [09:31:35] btullis: do you know what an-test-master1002 is? [09:31:46] does it need backups? [12:07:37] @marostegui we have the new test-db* VMs in case it can be useful for your tests with Trixie [12:07:52] Thank you! [12:08:03] are them on orchestrator? [12:08:14] Or how can I find them? [12:25:39] * federico3 rummages in the lowest drawer [12:26:38] they are yet to be added to zarcillo/orch/puppet, I can do it now actually [12:27:25] but you can ssh into them e.g. db-test1003.eqiad.wmnet [13:31:37] jynus: Yes, that's one of ours. It is the standby namenode for the HDFS filesystem on the analytics_test Hadoop cluster. It does take backups and we would prefer to keep them, it that's OK. [13:33:43] ok, the test name would had told me backups were redundant [13:33:50] thanks for the explanation [13:35:25] Yes, it's not a particularly high value file system, but it still has production-like pipelines running on it with e.g. 1/100 sampling of the production webrequest and that kind of thing.