pmetal bench
Benchmark various aspects of PMetal’s performance on your hardware.
Subcommands
Section titled “Subcommands”Benchmark training throughput (tokens/second, step time).
pmetal bench --model Qwen/Qwen3-0.6B --batch-size 4bench-gen
Section titled “bench-gen”Benchmark the generation loop — tokens per second, time to first token, and decode latency.
pmetal bench-gen --model Qwen/Qwen3-0.6B --prompt "Hello" --max-tokens 100bench-ffi
Section titled “bench-ffi”Benchmark FFI overhead between Rust and Metal/MLX.
pmetal bench-ffiSee Also
Section titled “See Also”- Hardware Support — Hardware capabilities
- Kernel Tuning — Per-tier optimizations