Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Zabreyko, Anton"'
We present MLTCP, a technique to augment today's congestion control algorithms to accelerate DNN training jobs in shared GPU clusters. MLTCP enables the communication phases of jobs that compete for network bandwidth to interleave with each other, th
Externí odkaz:
http://arxiv.org/abs/2402.09589
Serverless computing has made it easier than ever to deploy applications over scalable cloud resources, all the while driving higher utilization for cloud providers. While this technique has worked well for easily divisible resources like CPU and loc
Externí odkaz:
http://arxiv.org/abs/2212.08146