CLI
python -m lemmas <subcommand> "<query>" [flags]
Works offline against the deterministic echo stub OR with OPENAI_API_KEY
against gpt-4o-mini. All subcommands accept --json for piping.
Subcommands
python -m lemmas cove "Who invented the laser?" --n-questions 4
python -m lemmas self_consistency "What is 13 * 17?" --n 5 --extractor last_number
python -m lemmas best_of_n "Write a haiku" --n 5 --keywords "snow,arctic"
python -m lemmas drift "hello world" # requires OPENAI_API_KEY
Common flags
| Flag |
Default |
Notes |
--model |
gpt-4o-mini |
Only used with OPENAI_API_KEY. |
--temperature |
0.7 |
|
--max-tokens |
512 |
|
--json |
off |
Emit JSON instead of human-readable output. |
Per-subcommand flags
| Subcommand |
Flag |
Notes |
cove |
--n-questions N |
Verification questions to plan (default 4). |
self_consistency |
--n N |
Sample count. |
self_consistency |
--extractor X |
last_line, last_number, regex, similarity. |
self_consistency |
--regex PATTERN |
Required when --extractor regex. |
best_of_n |
--n N |
Sample count. |
best_of_n |
--target-chars N |
For default length_scorer. |
best_of_n |
--keywords k1,k2 |
Use keyword_scorer instead. |
drift |
--bucket NAME |
Bucket id (default default). |
drift |
--z-threshold F |
Drift-flag threshold (default 3.0). |
drift |
--warmup-n N |
Observations before drift can fire (default 20). |