Limiting Hugging Face Bandwidth

Limiting Hugging Face Bandwidth

Downloading LLM models locally is something you do occasionally, but when you pull a massive 122B parameter model like Sehyo/Qwen3.5-122B-A10B-NVFP4, the download hogs the entire connection and leaves the rest of the household without internet. The Hugging Face CLI (huggingface-cli or hf) doesn’t have a --limit-rate flag, so you need to find alternatives. In this post I explain two ways to limit bandwidth on Linux using Docker (my preferred method) or Wondershaper at the host level. …

February 22, 2026 · 9 min