Is your feature request related to a problem? Please describe. Currently somewhere in my collection is a lora for which the links to example images are 404ing on ...
We use this server to run Unmute; on a L40S GPU, we can serve 64 simultaneous connections at a real-time factor of 3x. I'm running the 1b model with the default config provided in the readme. Is this ...