Skip to content

pmetal pack-experts

Pack expert weights from a local model directory so sparse MoE models can use SSD-offloaded expert storage.

Terminal window
pmetal pack-experts \
--model <MODEL_DIR> \
--output <PACKED_EXPERT_DIR> \
[OPTIONS]
Terminal window
pmetal pack-experts \
--model ./Qwen3.5-122B-A10B \
--output ./packed_experts \
--bits 4
pmetal infer \
--model ./Qwen3.5-122B-A10B \
--experts-dir ./packed_experts \
--prompt "Explain routed expert offload" \
--chat
ParameterDefaultDescription
--modelrequiredModel directory containing config.json and safetensors
--output./packed_expertsOutput directory for packed expert files
--bitsOptional quantization bit width, usually 4 or 2