ArgMojo — Overall Planning

A command-line argument parser library for Mojo.

1. Why ArgMojo?

I created this project to support my experiments with a CLI-based Chinese character search engine in Mojo, as well as a CLI-based calculator for Decimo.

At the moment, Mojo does not have a mature command-line argument parsing library. This is a fundamental component for any CLI tool, and building it from scratch will benefit my projects and future projects.

2. Cross-Language Research Summary

This section summarises the key design patterns and features from well-known arg parsers across multiple languages. The goal is to extract universally useful ideas that are feasible in Mojo 0.26.1, and to exclude features that depend on language-specific capabilities (macros, decorators, reflection, closures-as-first-class) that Mojo does not yet provide.

2.1 Libraries Surveyed

Library	Language	Style	Key Insight for ArgMojo
argparse (stdlib)	Python	Builder (add_argument)	Comprehensive feature set; nargs, choices, type conversion, subcommands, argument groups, mutually exclusive groups, metavar, suggest_on_error, BooleanOptionalAction
Click	Python	Decorator-based	Composable commands, lazy-loaded subcommands, context passing — decorator approach not applicable
cobra + pflag	Go	Struct-based builder	Subcommands with persistent/local flags, flag groups (mutually exclusive, required together, one required), command aliases, Levenshtein-distance suggestions, positional arg validators (ExactArgs, MinimumNArgs, etc.)
clap	Rust	Builder + Derive	Builder API is the reference model; Derive API uses macros (not available in Mojo)
docopt	Python/multi	Usage-string-driven	Generates parser from help text — elegant but too implicit for a typed language

2.2 Universal Features Worth Adopting

These features appear across multiple libraries and depend only on string operations and basic data structures.

Feature	argparse	Click	cobra	clap	Other	Planned phase
Long/short options with values	✓	✓	✓	✓		Done
Positional arguments	✓	✓	✓	✓		Done
Boolean flags	✓	✓	✓	✓		Done
Default values	✓	✓	✓	✓		Done
Required argument validation	✓	✓	✓	✓		Done
`--` stop marker	✓	✓	✓	✓		Done
Auto `--help` / `-h` / `-?`	✓	✓	✓	✓		Done
Auto `--version` / `-V`	✓	✓	✓	✓		Done
Short flag merging (`-abc`)	✓	—	✓	✓		Done
Display name for value	✓	—	—	✓		Done
Positional arg count validation	—	—	✓	✓		Done
Choices / enum validation	✓	✓	—	✓		Done
Mutually exclusive flags	✓	—	✓	✓		Done
Flags required together	—	—	✓	—		Done
`--no-X` negation flags	✓ (3.9)	—	—	✓		Done
Long option prefix matching	✓	—	—	—		Done
Append / collect action	✓	✓	✓	✓		Done
One-required group	—	—	✓	✓		Done
Value delimiter (`--tag a,b,c`)	—	—	✓	✓		Done
Colored help (customisable)	—	✓	—	✓	pixi	Done
Colored warning and error messages	-	✓	-	✓		Done
Number of values per option	✓	✓	—	✓		Done
Conditional requirement	—	—	✓	✓		Done
Numeric range validation	—	—	—	—		Done
Key-value map (`-Dkey=val`)	—	—	—	—	Java `-D`, Docker `-e`	Done
Aliases for long names	—	—	✓	✓		Done
Deprecated arguments	✓ (3.13)	—	✓	—		Done
Negative number passthrough	✓	—	—	✓	Essential for `decimo`	Done
Subcommands	✓	✓	✓	✓		Done
Auto-added `help` subcommand	—	—	✓	✓	git, cargo, kubectl	Done
Persistent (global) flags	—	—	✓	✓	git `--no-pager` etc.	Done
Suggest on typo (Levenshtein)	✓ (3.14)	—	✓	✓		Done
Subcommand aliases	—	—	✓	✓	cobra, clap	Done
Count with ceiling	-	—	—	-		Done
Cap and floor (clamp) for ranges	-	✓	—	—	Click `IntRange(clamp=True)`	Done
Hidden subcommands	—	—	✓	✓		Done
`NO_COLOR` env variable	—	—	—	—	I need it personally	Done
Response file (`@args.txt`)	✓	—	—	—	javac, MSBuild	Done
Argument parents (shared args)	✓	—	—	—		Phase 5
Interactive prompting	—	✓	—	—		Done
Password / masked input	—	✓	—	—		Phase 5
Confirmation (`--yes` / `-y`)	—	✓	—	—		Phase 5
Pre/Post run hooks	—	—	✓	—		Phase 5
REMAINDER number_of_values	✓	—	—	—		Done
Partial parsing (known args)	✓	—	—	✓		Done
Require equals syntax	—	—	—	✓		Done
Default-if-no-value	✓	—	—	✓		Done
Mutual implication (`implies`)	—	—	—	—	ArgMojo unique feature	Done
Stdin value (`-` convention)	—	—	✓	—	Unix convention	Phase 5
Shell completion script generation	—	✓	✓	✓	bash / zsh / fish	Done
Argument groups in help	✓	—	✓	✓		Done
Value-name wrapping control	—	—	—	✓	clap, cargo, pixi, git	Done
CJK-aware help formatting	—	—	—	—	I need it personally	Done
CJK full-to-half-width correction	—	—	—	—	I need it personally	Done
CJK punctuation detection	—	—	—	—	I need it personally	Done
Typed retrieval (`get_int()` etc.)	✓	✓	✓	✓		Done
Comptime `StringLiteral` params	—	—	—	✓	clap derive macros	Done
Registration-time name validation	—	—	—	✓	clap panic on unknown ID	Done
`Parseable` trait for type params	—	—	—	✓		Phase unknown
Derive / struct-based schema	—	—	—	✓	Requires Mojo macros	Phase unknown
Enum → type mapping (real enums)	—	—	—	✓	Requires reflection	Phase unknown
Subcommand variant dispatch	—	—	—	✓	Requires sum types	Phase unknown

2.3 Features Excluded (Infeasible or Inappropriate)

Feature	Reason for Exclusion
Derive / decorator API	Mojo has no macros or decorators
Usage-string-driven parsing (docopt style)	Too implicit; not a good fit for a typed systems language
Type-conversion callbacks	Use `get_int()` / `get_string()` pattern instead
Config file reading (`fromfile_prefix_chars`)	Out of scope; users can pre-process argv
Environment variable fallback	Can be done externally; not core parser responsibility
Template-customisable help (Go cobra style)	Mojo has no template engine; help format is hardcoded
Path / URL / Duration value types	Mojo stdlib has no `Path` / `Url` / `Duration` types yet

3. Technical Foundations

3.1 `sys.argv()` ✓ Available

Mojo provides sys.argv() to access command-line arguments:

from sys import argv

fn main():
    var args = argv()
    for i in range(len(args)):
        print("arg[", i, "] =", args[i])

This gives us the raw list of argument strings, and the remaining task is to implement the parsing logic.

3.2 Mojo's string operations ✓ Sufficient

Operation	Mojo Support	Usage
Prefix check	`str.startswith("--")`	Detect option type
String compare	`str == "value"`	Match names
Substring	Slicing / `find`	Split `key=value`
Split	`str.split("=")`	Parse equals syntax
Concatenation	`str + str`	Build help text

3.3 Mojo's data structures ✓ Sufficient

Structure	Purpose
`List[String]`	Store argument list, positional names
`Dict[String, Bool]`	Flag values
`Dict[String, String]`	Named values
`struct` with builder pattern	Argument, Command, ParseResult types

4. Current Implementation Status

4.1 Repository Structure

src/argmojo/
├── __init__.mojo                   # Package exports (Argument, Command, ParseResult)
├── argument.mojo                   # Argument struct — argument definition with builder pattern
├── command.mojo                    # Command struct — command definition & parsing
├── parse_result.mojo               # ParseResult struct — parsed values
└── utils.mojo                      # Internal utilities — ANSI colours, display helpers
tests/
├── test_parse.mojo                 # Core parsing tests (flags, values, shorts, etc.)
├── test_groups.mojo                # Group constraint tests (exclusive, conditional, etc.)
├── test_collect.mojo               # Collection feature tests (append, delimiter, number_of_values)
├── test_help.mojo                  # Help output tests (formatting, colours, alignment)
├── test_extras.mojo                # Range, map, alias, deprecated tests
├── test_subcommands.mojo           # Subcommand tests (dispatch, help sub, unknown sub, etc.)
├── test_negative_numbers.mojo      # Negative number passthrough tests
├── test_persistent.mojo            # Persistent (global) flag tests
├── test_typo_suggestions.mojo      # Levenshtein typo suggestion tests
├── test_completion.mojo            # Shell completion script generation tests
├── test_implies.mojo               # Mutual implication and cycle detection tests
├── test_const_require_equals.mojo  # default_if_no_value and require_equals tests
├── test_response_file.mojo         # response file (@args.txt) expansion tests
├── test_remainder_known.mojo       # remainder, parse_known_arguments, allow_hyphen_values tests
├── test_fullwidth.mojo             # full-width → half-width auto-correction tests
├── test_groups_help.mojo           # argument groups in help + value_name wrapping tests
└── test_prompt.mojo               # interactive prompting tests
examples/
├── demo.mojo                       # comprehensive showcase of all ArgMojo features
├── mgrep.mojo                      # grep-like CLI example (no subcommands)
├── mgit.mojo                       # git-like CLI example (with subcommands)
└── yu.mojo                         # Chinese-language CLI example (CJK-aware help)

4.2 What's Already Done ✓

Feature	Status	Tests
`Argument` struct with builder pattern	✓	—
`Command` struct with `add_argument()`	✓	—
`ParseResult` with `get_flag()`, `get_string()`, `get_int()`, `has()`	✓	✓
Long flags `--verbose`	✓	✓
Short flags `-v`	✓	✓
Key-value `--key value`, `--key=value`, `-k value`	✓	✓
Positional arguments	✓	✓
Default values for positional and named args	✓	✓
Required argument validation	✓	—
`--` stop marker	✓	✓
Auto `--help` / `-h` with generated help text	✓	—
Auto `--version` / `-V`	✓	—
Demo binary (`mojo build`)	✓	—
Short flag merging (`-abc` → `-a -b -c`)	✓	✓
Short option with attached value (`-ofile.txt`)	✓	✓
Choices validation (`.choice[]()`)	✓	✓
Value Name (`.value_name["FILE"]()`)	✓	✓
Hidden arguments (`.hidden()`)	✓	✓
Count action (`-vvv` → 3) with ceiling (`.max(N)`)	✓	✓
Positional arg count validation	✓	✓
Clean exit for `--help` / `--version`	✓	—
Mutually exclusive groups	✓	✓
Required-together groups	✓	✓
Negatable flags (`.negatable()` → `--no-X`)	✓	✓
Long option prefix matching (`--verb` → `--verbose`)	✓	✓
Append / collect action (`--tag x --tag y` → list)	✓	✓
One-required groups (`command.one_required(["json", "yaml"])`)	✓	✓
Value delimiter (`.delimiter[","]()` → split into list)	✓	✓
Number of values (`.number_of_values[N]()` → consume N values per occurrence)	✓	✓
Conditional requirements (`command.required_if("output", "save")`)	✓	✓
Numeric range validation (`.range[1, 65535]()`)	✓	✓
Key-value map option (`.map_option()` → `Dict[String, String]`)	✓	✓
Aliases (`.alias_name["color"]()` for `--colour` / `--color`)	✓	✓
Deprecated arguments (`.deprecated["msg"]()` → stderr warning)	✓	✓
Negative number passthrough (`-9`, `-3.14`, `-1.5e10` as positionals)	✓	✓
Subcommand data model (`add_subcommand()`, dispatch, `help` sub)	✓	✓
Colored warning and error messages (`_warn()`, `_error()`, all errors printed in colour to stderr)	✓	✓
Range clamping (`.range[1, 100]().clamp()` → adjust + warn instead of error)	✓	✓
Default-if-no-value (`.default_if_no_value["gzip"]()` → optional value with fallback)	✓	✓
Require equals syntax (`.require_equals()` → `--key=value` only)	✓	✓
Response file (`command.response_file_prefix()` → `@args.txt` expands file contents)	✓ ⚠	✓
Typo suggestions (Levenshtein "did you mean ...?" for long options and subcommands)	✓	✓
Flag counter ceiling (`.count().max[3]()` → cap with warning)	✓	✓
Shell completion script generation (`generate_completion["bash"]()` or `generate_completion("bash")`)	✓	✓
Subcommand aliases (`command_aliases(["co"])`)	✓	✓
Hidden subcommands (`sub.hidden()` → excluded from help, completions, errors)	✓	✓
`NO_COLOR` env variable (suppress ANSI output when set)	✓	✓
Mutual implication (`command.implies("debug", "verbose")` with chained + cycle detection)	✓	✓
Remainder positional (`.remainder()` → consume all remaining tokens)	✓	✓
Partial parsing (`parse_known_arguments()` → collect unknown options)	✓	✓
Allow hyphen values (`.allow_hyphen_values()` → accept `-x` as positional value)	✓	✓
Value name rename (`.metavar()` → `.value_name()`)	✓	✓
CJK-aware help formatting (`_display_width` for column alignment)	✓	✓
Full-width → half-width auto-correction (fullwidth ASCII + `U+3000` space)	✓	✓
CJK punctuation auto-correction (em-dash `U+2014` → hyphen-minus)	✓	✓
Compile-time `StringLiteral` builder params (`.long[]`, `.short[]`, `.choice[]`, colours, etc.)	✓	—
Registration-time validation for group constraints (`mutually_exclusive`, `required_together`, etc.)	✓	✓
Interactive prompting (`.prompt()`, `.prompt["..."]()` → prompt for missing args)	✓	✓

⚠ Response file support is temporarily disabled due to a Mojo compiler deadlock under -D ASSERT=all. The implementation is preserved and will be re-enabled when the compiler bug is fixed.

4.3 API Design (Current)

from argmojo import Command, Argument

fn main() raises:
    var command = Command("demo", "A CJK-aware text search tool which supports pinyin and Yuhao IME")

    # Positional arguments
    command.add_argument(Argument("pattern", help="Search pattern").required().positional())
    command.add_argument(Argument("path", help="Search path").positional().default["."]())

    # Optional arguments
    command.add_argument(Argument("ling", help="Use Yuhao Lingming encoding").long["ling"]().short["l"]().flag())
    command.add_argument(Argument("ignore-case", help="Case insensitive search").long["ignore-case"]().short["i"]().flag())
    command.add_argument(Argument("max-depth", help="Maximum directory depth").long["max-depth"]().short["d"]().takes_value())

    var result = command.parse()

    var pattern = result.get_string("pattern")
    var use_ling = result.get_flag("ling")
    var max_depth = result.get_int("max-depth")

4.4 Command-line syntax supported

# Long options
--flag              # Boolean flag
--key value         # Key-value (space separated)
--key=value         # Key-value (equals separated)
--key=value         # Require-equals syntax (when .require_equals())
--key               # Default-if-no-value (when .default_if_no_value())
--no-flag           # Negation (when .negatable())
--verb              # Prefix match → --verbose (if unambiguous)

# Short options
-f                  # Boolean flag
-k value            # Key-value
-abc                # Merged short flags → -a -b -c
-ofile.txt          # Attached short value → -o file.txt
-abofile.txt        # Mixed: -a -b -o file.txt
-vvv                # Count flag → verbose = 3

# Positional arguments
pattern             # By order of add_argument() calls

# Special
--                  # Stop parsing options; rest becomes positional
--help / -h / -?    # Show auto-generated help
--version / -V      # Show version
@args.txt           # Response file expansion (when enabled)
cmd rest...         # Remainder positional (consume all remaining tokens)

# Subcommands
app search pattern  # Dispatch to subcommand
app help search     # Show subcommand help
app --verbose search  # Persistent flags before subcommand

4.5 Validation & Help Behavior Matrix

Positional arguments and named options are validated independently — a command can fail on either or both. The two matrices below show each dimension's behavior separately; the combined scenario table shows practical cross-product outcomes.

Per-Dimension Behavior

Positional arguments:

Command config ↓ \ User input →	Enough positionals provided	Not enough positionals provided
Has required positional(s)	✓ Proceed	✗ Error + usage
No required positional(s)	✓ Proceed	N/A — always "enough"

Named options:

Command config ↓ \ User input →	Enough options provided	Not enough options provided
Has required option(s)	✓ Proceed	✗ Error + usage
No required option(s)	✓ Proceed	N/A — always "enough"

Cross-Dimension Matrix (4 × 4)

When rows and columns refer to different dimensions (e.g., "has required positionals" × "enough options"), the outcome depends on the other dimension — marked ? below.

	Enough pos. args	Not enough pos. args	Enough options	Not enough options
Has required positional(s)	✓ Proceed	✗ Error + usage	? depends on pos.	? depends on pos.
No required positional(s)	✓ Proceed	(N/A)	? always ok for pos.	? always ok for pos.
Has required option(s)	? depends on opt.	? depends on opt.	✓ Proceed	✗ Error + usage
No required option(s)	? always ok for opt.	? always ok for opt.	✓ Proceed	(N/A)

Combined Scenario Table

The practical view — both dimensions checked together at parse time:

Command Profile	Nothing provided	Pos. ✗ Opt. ✓	Pos. ✓ Opt. ✗	All ✓
Required pos. + required opt.	✗ Error + usage	✗ Error (missing pos.)	✗ Error (missing opt.)	✓ Proceed
Required pos. only	✗ Error + usage	✗ Error (missing pos.)	✓ Proceed	✓ Proceed
Required opt. only	✗ Error + usage	✓ Proceed	✗ Error (missing opt.)	✓ Proceed
No requirements	✓ Proceed	✓ Proceed	✓ Proceed	✓ Proceed
Has subcommands (group)	✓ Proceed *	—	—	✓ Dispatch

* Group commands with subcommands typically do nothing useful with no input — help_on_no_arguments() is recommended.

Effect of `help_on_no_arguments()`

Scenario	Default (off)	With `help_on_no_arguments()`
Zero args (only program name)	Validation runs → error if requirements exist; proceed if none	Show full help (exit 0)
Some args provided (insufficient)	✗ Error + usage	✗ Error + usage (same)
All requirements satisfied	✓ Proceed	✓ Proceed (same)

Key: help_on_no_arguments() only overrides the zero-argument case. Once any argument is provided, normal validation takes over regardless.

Industry Consensus (clap / cobra / argparse / click / docker / git / kubectl)

Error, not help. When the user provides a partial or incorrect invocation, the standard is a short error message naming the missing argument + a compact usage line. Full help is reserved for --help or bare group commands. This is the dominant pattern across clap, argparse, click, commander.js, cargo.
No special-casing "zero args" by default. The vast majority of frameworks do NOT treat "provided nothing" differently from "provided some but not all." clap's arg_required_else_help(true) is the only first-class opt-in — ArgMojo's help_on_no_arguments() mirrors this.
Two-tier pattern for subcommands. Every tool examined follows the same convention:
- Group/parent command with no subcommand given → show full help (list available subcommands)
- Leaf subcommand with missing required args → show error + usage line (not full help)
- Rationale: at the group level, the user needs guidance on what to do; at the leaf level, they know what they want but forgot how.
Error batching. Split across tools — clap and argparse report all missing arguments at once; click and commander report the first one. ArgMojo currently reports the first missing argument (validation order: required args → positional count → exclusive groups → together groups → one-required → conditional → range).
Exit codes. POSIX-influenced tools (argparse, clap, click) use exit code 2 for argument parse errors. Go-based tools (cobra, docker, kubectl) use exit code 1. ArgMojo currently raises an Error (caller decides exit code).
Error output format consensus (clap / argparse / click / cargo):
```
error: <command>: <what's wrong>

Usage: <command> <required> [optional] [OPTIONS]
For more information, try '<command> --help'.
```
NOT full help with all flags listed (only cobra does that by default, and it provides SilenceUsage to opt out).

5. Development Roadmap

Phase 1: Skeleton

Establish module structure
Implement Argument struct and builder methods
Implement basic Command struct
Implement a small demo CLI tool to test the library

Phase 2: Parsing Enhancements ✓

Short flag merging — -abc expands to -a -b -c (argparse, cobra, clap all support this)
Short option with attached value — -ofile.txt means -o file.txt (argparse, clap)
Choices validation — restrict values to a set: .choice["debug"]().choice["info"]().choice["warn"]().choice["error"]()
Value Name — display name for values in help: .value_name["FILE"]() → --output FILE
Positional arg count validation — fail if too many positional args
Hidden arguments — .hidden() to exclude from help output (cobra, clap)
count action — -vvv → get_count("verbose") == 3 (argparse -v counting)
Clean exit for --help/--version — use sys.exit(0) instead of raise Error

Phase 3: Relationships & Validation (for v0.2)

Phase 4: Subcommands (for v0.2)

Subcommands (app <subcommand> [args]) are the first feature that turns ArgMojo from a single-parser into a parser tree. The core insight is that a subcommand is just another Command instance — it already has parse_arguments(), _generate_help(), and all validation logic. No new parser, tokenizer, or separate module files are needed.

Architecture: composition inside `Command`

No file split. Core logic stays in command.mojo. Mojo has no partial structs, so splitting would force free functions + parameter threading for little gain. ANSI colour constants and small utility functions live in utils.mojo (internal-only, all symbols _-prefixed).
No tokenizer. Mojo standard library provides sys.argv() which already gives us a pre-split list of argument strings. We can work with this directly in parse_arguments() without a separate tokenization step.
Composition-based. Command gains a child command list. When parse_arguments() hits a non-option token matching a registered subcommand, it delegates the remaining argv slice to the child's own parse_arguments(). 100% logic reuse, zero duplication.

Pre-requisite refactor (Step 0)

Before adding subcommand routing, clean up parse_arguments() so root and child can each call the same validation/defaults path:

Extract _apply_defaults(mut result) — move the ~20-line defaults block into a private method
Extract _validate(result) — move the ~130-line validation block (required, exclusive, together, one-required, conditional, range) into a private method
Verify all existing tests still pass after this refactor (143 original + 17 new Step 0 tests = 160 total, all passing)

Step 1 — Data model & API surface

Add subcommands: List[Command] field on Command (Matryoshka doll :D)
Add add_subcommand(mut self, sub: Command) builder method
Add subcommand: String field on ParseResult (name of selected subcommand, empty if none)
Add subcommand_result: List[ParseResult] or similar on ParseResult to hold child results

Target API:

var app = Command("app", "My CLI tool", version="0.3.0")
app.add_argument(Argument("verbose", help="Verbose output").long["verbose"]().short["v"]().flag())

var search = Command("search", "Search for patterns")
search.add_argument(Argument("pattern", help="Search pattern").required().positional())
search.add_argument(Argument("max-depth", help="Max depth").long["max-depth"]().short["d"]().takes_value())

var init = Command("init", "Initialize a new project")
init.add_argument(Argument("name", help="Project name").required().positional())

app.add_subcommand(search)
app.add_subcommand(init)

var result = app.parse()
if result.subcommand == "search":
    var sub = result.subcommand_result
    var pattern = sub.get_string("pattern")

Step 2 — Parse routing (I need to be very careful)

In parse_arguments(), when the current token is not an option and subcommands are registered, check if it matches a subcommand name
On match: record result.subcommand = name, build child argv (remaining tokens), call child.parse_arguments(child_argv), store child result
On no match and subcommands exist: treat as positional (existing behavior)
-- before subcommand boundary: all subsequent tokens are positional for root, no subcommand dispatch
Handle app help <sub> as equivalent to app <sub> --help via auto-registered help subcommand (strategy B); _is_help_subcommand flag; .disable_help_subcommand() opt-out API

Step 3 — Global (persistent) flags

Add .persistent() builder method on Argument (sets is_persistent: Bool)
Before child parse, inject copies of parent's persistent args into the child's arg list (or make child parser aware of them)
Root-level persistent flag values are parsed before dispatch and merged into child result
Conflict policy: reject duplicate long/short names between parent persistent args and child local args at registration time (add_subcommand raises)
Bidirectional sync: bubble-up (flag after subcommand → root result) + push-down (flag before subcommand → child result)

Step 4 — Help & UX

Root _generate_help() appends a "Commands:" section listing subcommand names + descriptions (aligned like options)
app <sub> --help delegates to sub._generate_help() directly
app help <sub> routing via auto-registered real subcommand: add_subcommand() auto-inserts a help Command with _is_help_subcommand = True; dispatch path detects the flag and routes to sibling help
.disable_help_subcommand() opt-out API on Command
Child help includes inherited persistent flags under a "Global Options:" heading
Usage line shows full command path: app search [OPTIONS] PATTERN

Step 5 — Error handling

Unknown subcommand: "Unknown command '<name>'. Available commands: search, init"
Errors inside child parse: prefix with command path for clarity (e.g. "app search: Option '--foo' requires a value")
Exit codes consistent with current behavior (exit 2 for parse errors)
allow_positional_with_subcommands() — guard preventing accidental mixing of positional args and subcommands on the same Command (following cobra/clap convention); requires explicit opt-in

Step 6 — Tests

Step 7 — Documentation & examples

Add examples/mgrep.mojo — grep-like CLI demonstrating all single-command features
Add examples/mgit.mojo — git-like CLI demonstrating subcommands, nested subcommands, persistent flags, and all group constraints
Update user manual with subcommand usage patterns
Document persistent flag behavior and conflict rules

Phase 5: Polish (v0.3 shipped; remaining features for v0.4+)

Some features shipped in v0.3.0, others completed in the unreleased update branch. Remaining items may be deferred to v0.4+.

Pre-requisite refactor

Before adding Phase 5 features, further decompose parse_arguments() for readability and maintainability:

Features

Explicitly Out of Scope in This Phase

These will NOT be implemented in this phase, but will be considered in future.

Derive/decorator-based API (no macros in Mojo)
Usage-string-driven parsing (docopt style)
Config file parsing (users can pre-process argv)
Environment variable fallback
Template-based help formatting

Phase 6: CJK Features (hopefully for v0.4 because I need it personally)

ArgMojo's differentiating features — no other CLI library addresses CJK-specific pain points.

這部分主要是為了讓 ArgMojo 在 CJK 環境下的使用體驗更好，解決一些常見的問題，比如幫助信息對齊、全角字符自動轉半角、CJK 標點檢測等。畢竟我總是忘了切換輸入法，打出中文的全角標點，然後被 CLI 報錯。

6.1 CJK-aware help formatting ✓

Problem: All Western CLI libraries (argparse, cobra, clap) assume 1 char = 1 column. CJK characters occupy 2 terminal columns (full-width), causing misaligned --help output when descriptions mix CJK and ASCII:

  --format <FMT>   Output format              ← aligned
  --ling           使用宇浩靈明編碼           ← CJK chars each take 2 columns, misaligned

Implementation:

Implement _display_width(s: String) -> Int in utils.mojo, traversing each code point:
- CJK Unified Ideographs, CJK Ext-A/B/C/D/E/F/G/H/I/J, fullwidth forms → width 2
- Other visible characters → width 1 (zero-width joiners and combining marks are rare in CLI help text and are not special-cased)
Replace len() with _display_width() in all help formatting padding calculations (_help_positionals_section, _help_options_section, _help_commands_section)
Add tests with mixed CJK/ASCII help text verifying column alignment

References: POSIX wcwidth(3), Python unicodedata.east_asian_width(), Rust unicode-width crate.

6.2 Full-width → half-width auto-correction ✓

Problem: CJK users frequently forget to switch input methods, typing full-width ASCII:

－－ｖｅｒｂｏｓｅ instead of --verbose
＝ instead of =

Implementation:

Implement _fullwidth_to_halfwidth(token: String) -> String in utils.mojo:
- Full-width ASCII range: U+FF01–U+FF5E → subtract 0xFEE0 to get half-width
- Full-width space U+3000 → half-width space U+0020. --name\u3000yuhao\u3000--verbose is originally scanned by sys.argv as a single token with embedded full-width spaces, so we need to handle this case too by replacing the original list of arguments with the corrected split. There are also other spaces in the Unicode standard, we can also support them by adding a method like whitespace_characters(chars: List[String]) that allows users to specify additional code points to treat as whitespace (e.g. U+2003 EM SPACE).
In parse_arguments(), scan each token before parsing; if full-width characters are detected in option tokens (-- or - prefixed), auto-correct and print a coloured warning:
```
warning: detected full-width characters in '－－ｖｅｒｂｏｓｅ', auto-corrected to '--verbose'
```
Only correct option names (tokens starting with -), not positional values (user may intentionally input full-width content)
Add .disable_fullwidth_correction() opt-out API on Command
Add tests for full-width flag, full-width = in --key＝value, and opt-out
Let users know that this feature is by default on and can be disabled if they prefer strict parsing.

Note that the following punctuation characters are already handled by the full-width correction step, since they fall within the U+FF01–U+FF5E range:

U+FF0D FULLWIDTH HYPHEN-MINUS (－) → U+002D HYPHEN-MINUS (-)
U+FF1A FULLWIDTH COLON (：) → U+003A COLON (:)
U+FF0C FULLWIDTH COMMA (，) → U+002C COMMA (,)

6.3 CJK punctuation detection

Problem: Users accidentally type Chinese punctuation:

——verbose (em-dash U+2014 × 2) instead of --verbose
--key：value (full-width colon U+FF1A) instead of --key=value

Implementation:

Phase 7: Type-Safe API (aspirational — blocked on Mojo language features)

These features represent the "next generation" of CLI parser design, inspired by Rust clap's derive API. They require Mojo language features that do not yet exist (macros, reflection, sum types). Tracked here as aspirational goals.

Note on clap's success: The claim that "clap succeeded because of strong typing" is partially misleading. clap's builder API (matches.get_one::<String>("name")) is structurally identical to ArgMojo's result.get_string("name") — both are runtime-typed string-keyed lookups. clap was the dominant Rust CLI library for years (v1–v3) before the derive macro was stabilised. The derive API's real value is boilerplate reduction (one struct definition encodes name, type, help, default), not type safety per se. Python argparse (dynamic Namespace), Go cobra (GetString("name")), and Click all use the same runtime-lookup pattern and are the most popular parsers in their ecosystems.

Feature	What it needs	Status
`Parseable` trait	Mojo traits + parametric methods	Can prototype now
`add_arg[Int]("--port")` generic registration	`Parseable` trait + type-aware storage	Can prototype now
`@cli struct Args` derive	Mojo macros / decorators	Blocked — no macros
`enum Mode { Debug, Release }` → auto choices	Mojo reflection on enum variants	Blocked — no reflection
`variant Command { Commit(CommitArgs), Push(PushArgs) }`	Mojo sum types / enum with payloads	Blocked — no sum types
`file: String` (required) vs `output: String?` (optional)	Derive macro to map struct fields → args	Blocked — no macros
`Path` / `Url` / `Duration` value types	Mojo stdlib types	Blocked — stdlib gaps

What ArgMojo already provides (equivalent functionality)

"Missing" feature	ArgMojo equivalent	How
Typed retrieval	`get_flag()->Bool`, `get_int()->Int`, `get_string()->String`, `get_count()->Int`, `get_list()->List[String]`, `get_map()->Dict[String,String]`	Already typed at retrieval
Enum validation	`.choice["debug"]().choice["release"]()`	String-level enum; help shows `{debug,release}`
Required / optional	`.required()` / `.default["..."]()`	Parse-time enforcement with coloured errors
Flag counter (not just bool)	`.count()` + `get_count()`	`-vvv → 3`; `.count().max[N]()` caps at ceiling
Range clamping	`.range[min, max]().clamp()`	Adjusts out-of-range values with a warning
Subcommand dispatch	`result.subcommand == "search"` + `get_subcommand_result()`	Same pattern as Go cobra

6. Parsing Algorithm

Input: ["demo", "yuhao", "./src", "--ling", "-i", "--max-depth", "3"]

1. Initialize ParseResult and register positional names
2. If `help_on_no_arguments` is enabled and only argv[0] exists:
    print help and exit
3. Loop from argv[1] with cursor i:
    ├─ If args[i] == "--":
    │     Enter positional-only mode
    ├─ If positional-only mode is on:
    │     Append token to positional list
    ├─ If args[i] == "--help" or "-h" or "-?":
    │     Print help and exit
    ├─ If args[i] == "--version" or "-V":
    │     Print version and exit
    ├─ If args[i].startswith("--"):
    │     → _parse_long_option(raw_args, i, result) → new i
    │       (--key=value, --no-key negation, prefix match, count/flag/number_of_values/value)
    ├─ If args[i].startswith("-") and len > 1:
    │     ├─ IF _looks_like_number(token) AND (allow_negative_numbers OR no digit short opts):
    │     │     Treat as positional argument (negative number passthrough)
    │     └─ ELSE:
    │           ├─ Single char → _parse_short_single(key, raw_args, i, result) → new i
    │           └─ Multi char  → _parse_short_merged(key, raw_args, i, result) → new i
    ├─ If subcommands registered:
    │     → _dispatch_subcommand(arg, raw_args, i, result) → new i or -1
    │       (match → build child argv, inject persistent, recurse, sync; no match → -1)
    └─ Otherwise:
            Treat as positional argument
4. Apply defaults for missing arguments (named + positional slots)
5. Validate:
    ├─ Required arguments
    ├─ Positional count (too many positionals)
    ├─ Mutually exclusive groups
    ├─ Required-together groups
    ├─ One-required groups
    ├─ Conditional requirements
    ├─ Count ceilings (clamp + warn)
    └─ Numeric range constraints (error or clamp + warn)
6. Return ParseResult

6.1 Subcommand parsing flow

Input: ["app", "--verbose", "search", "pattern", "--max-depth", "3"]

1. Root parse_arguments() begins normal cursor walk from argv[1]
2. "--verbose" → starts with "--" → parsed as root-level long option (flag)
3. "search" → no "-" prefix → check registered subcommands:
    ├─ match found → record subcommand = "search"
    ├─ no match + subcommands registered → error (or treat as positional)
    └─ no subcommands registered → treat as positional (existing behavior)
4. Build child argv: ["app search", "pattern", "--max-depth", "3"]
   (argv[0] = command path for child help/error messages)
5. Inject persistent args from root into child's arg list
6. Call child.parse_arguments(child_argv) → child runs its own full parse loop
   (same code path: long/short/merged/positional/defaults/validation)
7. Store child ParseResult in root result:
    ├─ result.subcommand = "search"
    └─ result.subcommand_result = child_result
8. Root runs _apply_defaults() and _validate() for root-level args only
   (child already validated itself in step 6)
9. Return root ParseResult to application code

7. Naming Conventions

ArgMojo follows a consistent naming philosophy. When in doubt, apply these priorities in order:

Internal consistency — every name within ArgMojo should follow the same pattern. If we use Argument, then methods that refer to arguments should also spell out the word.
Mojo / Python style consistency — prefer snake_case for functions and methods, PascalCase for types. Follow Mojo stdlib conventions where they exist.
Cross-language familiarity — when a concept is well-known across CLI libraries (cobra, clap, Click, argparse), keep the name recognisable, but do not import abbreviations that conflict with priority 1.

Decisions made

Abbreviation (rejected)	Full form (adopted)	Rationale
`Arg`	`Argument`	Internal struct name; aligns with `add_argument()`
`parse_args()`	`parse_arguments()`	Consistent with `Argument` naming; `parse_args` was an argparse legacy
`help_on_no_args()`	`help_on_no_arguments()`	Same reason
`_aliases`	`_command_aliases`	Disambiguates from `Argument.aliases()` (option-level aliases)
`nargs()` / `nargs_count`	`number_of_values`	Full descriptive name

8. Notes on Mojo versions

Here are some important Mojo-specific patterns used throughout this project. Mojo is rapidly evolving, so these may need to be updated in the future.

These are all worthy being checked in Mojo Miji too.

Pattern	What & Why
`"""Tests..."""`	Docstring convention
`@fieldwise_init`	Replaces `@value`
`var self`	Used for builder methods instead of `owned self`
`String()`	Explicit conversion; `str()` is not available
`[a, b, c]` for `List`	List literal syntax instead of variadic constructor
`.copy()`	Explicit copy for non-ImplicitlyCopyable types
`Movable` conformance	Required for structs stored in containers

9. Pending Renames

Current Name	Target Name	Condition
`.alias_name[]()`	`.alias[]()`	Blocked: `alias` is a reserved keyword in Mojo (`alias X = Int`). Rename once Mojo fully deprecates or removes the `alias` keyword. Track upstream Mojo language changes.

FilesExpand file tree

argmojo_overall_planning.md

Latest commit

History

argmojo_overall_planning.md

File metadata and controls

ArgMojo — Overall Planning

1. Why ArgMojo?

2. Cross-Language Research Summary

2.1 Libraries Surveyed

2.2 Universal Features Worth Adopting

2.3 Features Excluded (Infeasible or Inappropriate)

3. Technical Foundations

3.1 sys.argv() ✓ Available

3.2 Mojo's string operations ✓ Sufficient

3.3 Mojo's data structures ✓ Sufficient

4. Current Implementation Status

4.1 Repository Structure

4.2 What's Already Done ✓

4.3 API Design (Current)

4.4 Command-line syntax supported

4.5 Validation & Help Behavior Matrix

Per-Dimension Behavior

Cross-Dimension Matrix (4 × 4)

Combined Scenario Table

Effect of help_on_no_arguments()

Industry Consensus (clap / cobra / argparse / click / docker / git / kubectl)

5. Development Roadmap

Phase 1: Skeleton

Phase 2: Parsing Enhancements ✓

Phase 3: Relationships & Validation (for v0.2)

Phase 4: Subcommands (for v0.2)

Architecture: composition inside Command

Pre-requisite refactor (Step 0)

Step 1 — Data model & API surface

Step 2 — Parse routing (I need to be very careful)

Step 3 — Global (persistent) flags

Step 4 — Help & UX

Step 5 — Error handling

Step 6 — Tests

Step 7 — Documentation & examples

Phase 5: Polish (v0.3 shipped; remaining features for v0.4+)

Pre-requisite refactor

Features

Explicitly Out of Scope in This Phase

Phase 6: CJK Features (hopefully for v0.4 because I need it personally)

6.1 CJK-aware help formatting ✓

6.2 Full-width → half-width auto-correction ✓

6.3 CJK punctuation detection

Phase 7: Type-Safe API (aspirational — blocked on Mojo language features)

What ArgMojo already provides (equivalent functionality)

6. Parsing Algorithm

6.1 Subcommand parsing flow

7. Naming Conventions

Decisions made

8. Notes on Mojo versions

9. Pending Renames

3.1 `sys.argv()` ✓ Available

Effect of `help_on_no_arguments()`

Architecture: composition inside `Command`