How hard is it to deploy pretrained models on GPUs without tons of YAML files and unmanaged instances?
Thanks for writing this, it was a great read. What are some things you think Modal is still lacking?
Maybe fine-grained control over the container scheduling algorithm? There's an experimental feature for this: https://modal.com/docs/guide/concurrent-inputs
In regards to storage, there is support for persistent file systems but it's a bit different than S3 - so maybe object storage support as well?
Thanks for writing this, it was a great read. What are some things you think Modal is still lacking?
Maybe fine-grained control over the container scheduling algorithm? There's an experimental feature for this: https://modal.com/docs/guide/concurrent-inputs
In regards to storage, there is support for persistent file systems but it's a bit different than S3 - so maybe object storage support as well?