Federation troubleshooting

Ruud@lemmy.world · 1 year ago

Federation troubleshooting

tal@kbin.social · 1 year ago

Thanks for your work and sharing results!

I think that kbin and lemmy are going to ultimately have to record per-instance response time and back off on a given instance. Like, if another instance is failing or overloaded, it’s going to have to reduce the frequency with which it attempts to communicate with that instance, to avoid having a ton of workers tied up trying to communicate with that instance.

The Quuuuuill@slrpnk.net · 1 year ago

I’d probably recommend exponential backoff with a low max retries

NuclearArmWrestling@lemmy.world · 1 year ago

Ideally, multiple instances could band together and create something like a hub that they all push and pull from. It’s a little more centralization, but would likely significantly reduce overall network and CPU consumption.