2000元阿里云代金券免费领取,2核4G云服务器仅664元/3年,新老用户都有优惠,立即抢购>>>
"您用部署试试,RAY_memory_monitor_refresh_ms=0 CUDA_VISIBLE_DEVICES=0,1 swift deploy --model_type qwen-7b-chat --tensor_parallel_size 2
https://github.com/modelscope/swift/blob/main/docs/source/LLM/VLLM%E6%8E%A8%E7%90%86%E5%8A%A0%E9%80%9F%E4%B8%8E%E9%83%A8%E7%BD%B2.md 此回答整理自钉群“魔搭ModelScope开发者联盟群 ①”"