llm --help

Let's help help help devs.

Target: <200ms. Several important LLM CLI tools take multiple seconds. PRs welcome ❤️

library cold warm (10 runs) version measured on
vllm --help14015ms6269ms0.20.2+cpu2026-05-12T09:22Z
sglang --help13656ms5536msv0.5.112026-05-12T09:19Z
VLMEvalKit --help13984ms5484msv0.22026-05-12T09:24Z
tensorrt-llm --help5743ms2190ms1.2.12026-05-12T09:17Z
datasets --help3424ms973ms4.8.52026-05-12T09:10Z
llm --help1412ms644ms0.312026-05-12T09:11Z
hf --help1038ms380ms1.14.02026-05-12T09:10Z
lm-eval --help1910ms330ms0.4.122026-05-12T09:13Z
langchain-cli --help819ms262ms0.0.372026-05-12T09:12Z
ollama --help15ms13ms0.23.22026-05-12T09:09Z
llama.cpp --help13ms12msb91142026-05-12T09:12Z
transformers --help0ms0ms5.8.02026-05-12T09:13Z
openai --help1ms0ms2.36.02026-05-12T09:11Z