admin管理员组文章数量:1390822
I'm working with NVIDIA's MIG technology for parallel inference execution. Although the technology works well, I wanted to know if anyone knows how to reassign resources to an already created partition, whether they are unused resources or how to remove them from an in-use partition and assign them to another.
For now, my approach consists of stopping a process, destroying the partition, and regenerating it, but that causes a very large overhead, both due to the technology itself and also in terms of the time required to load the model, especially when resources need to be allocated from a partition in use. Does anyone know if these reassignments can be done on the fly to avoid overhead? I've seen that in Kubernetes-based systems it's possible. Does anyone know if it's possible to do something similar without relying on containers or virtual environments?
本文标签: cudaIs it possible to dynamically allocate resources in MIG partitions on NVIDIA GPUsStack Overflow
版权声明:本文标题:cuda - Is it possible to dynamically allocate resources in MIG partitions on NVIDIA GPUs? - Stack Overflow 内容由网友自发贡献,该文观点仅代表作者本人, 转载请联系作者并注明出处:http://www.betaflare.com/web/1744753855a2623342.html, 本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容,一经查实,本站将立刻删除。
发表评论