xai-grok-build

xAI's Grok Build TUI — agentic coding CLI for Grok models

$ anc audit grok --json

agent Rust https://x.ai/cli

MUST 19 / 28SHOULD 13 / 213 failing13 warningsaudited 2026-06-02 · anc 0.5.0

73pass rate

2/8principles met

Eight principles, scored

Behavioral and source checks. Every non-pass row carries a copy-paste remediation prompt.

P1
Non-Interactive by Default
All 6 checks pass.
pass

Structured, Parseable Output

--output/--format flag detected but could not validate JSON via safe probes (--help/--version override output flags in most CLIs) · CLI emits structured output but exposes no `schema` subcommand or `--schema` flag at top level or nested one level deep. Agents need a runtime-discoverable schema to pin against shape changes.

Remediation · P2

Make xai-grok-build satisfy P2 (Structured, Parseable Output) from the agent-native CLI standard. Address:
- Structured output support (--output/--format flag detected but could not validate JSON via safe probes (--help/--version override output flags in most CLIs))
- Structured-output CLI exposes its schema at runtime (CLI emits structured output but exposes no `schema` subcommand or `--schema` flag at top level or nested one level deep. Agents need a runtime-discoverable schema to pin against shape changes.)
- --json / --jsonl short aliases for --output (no --json or --jsonl short alias found. Agents and pipelines benefit from short forms alongside the canonical `--output` enum.)
- `--raw` flag for pipe-safe unformatted output (no `--raw` flag advertised. MAY-tier — useful for pipelines that want to strip formatting before piping to other tools.)
- Errors emit JSON envelope with `error`/`kind`/`message` under `--output json` (bad invocation under `--output json` produced no parseable JSON on stderr or stdout. JSON mode must emit a JSON error envelope, not plain text.)
Requirements: https://anc.dev/p2

fail

Progressive Help Discovery

no `examples` subcommand or `--examples` flag found. MAY-tier — a curated usage block keeps agents from hunting through long help text. · subcommands missing example invocations in their `--help`: agent, export, import, inspect, leader, login, logout, mcp, memory, models, plugin, sessions, setup, ssh, trace, update, version, worktree. Examples teach agents the call shape faster than option tables; use clap's `after_help` or a dedicated `Examples:` block.

Remediation · P3

Make xai-grok-build satisfy P3 (Progressive Help Discovery) from the agent-native CLI standard. Address:
- `examples` subcommand or `--examples` flag for curated usage patterns (no `examples` subcommand or `--examples` flag found. MAY-tier — a curated usage block keeps agents from hunting through long help text.)
- Each subcommand's `--help` ships at least one invocation example (subcommands missing example invocations in their `--help`: agent, export, import, inspect, leader, login, logout, mcp, memory, models, plugin, sessions, setup, ssh, trace, update, version, worktree. Examples teach agents the call shape faster than option tables; use clap's `after_help` or a dedicated `Examples:` block.)
- Help text pairs human and `--output json` example invocations (no paired text + `--output json` example found within 5 lines in top-level or any subcommand `--help`. Pairing keeps agents from reverse-engineering the JSON invocation from the text one.)
Requirements: https://anc.dev/p3

fail

Fail-Fast, Actionable Errors

errors under `--output json` are not JSON-formatted. Consumers parsing stdout-as-JSON cannot recover the failure without a separate text-parsing path; switch the error writer to honor the active output mode.

Remediation · P4

Make xai-grok-build satisfy P4 (Fail-Fast, Actionable Errors) from the agent-native CLI standard. Address:
- `--output json` produces JSON-formatted errors (errors under `--output json` are not JSON-formatted. Consumers parsing stdout-as-JSON cannot recover the failure without a separate text-parsing path; switch the error writer to honor the active output mode.)
Requirements: https://anc.dev/p4

warn

Safe Retries & Mutation Boundaries

write-pattern subcommand(s) present (update) but no read-pattern surface detected. If the CLI is write-only by design the MUST is satisfied vacuously; otherwise expose the read surface with agent-recognizable verbs (list/get/show/query/find/search).

Remediation · P5

Make xai-grok-build satisfy P5 (Safe Retries & Mutation Boundaries) from the agent-native CLI standard. Address:
- Read and write surfaces are both visible in subcommand list (write-pattern subcommand(s) present (update) but no read-pattern surface detected. If the CLI is write-only by design the MUST is satisfied vacuously; otherwise expose the read surface with agent-recognizable verbs (list/get/show/query/find/search).)
Requirements: https://anc.dev/p5

warn

Composable, Predictable Command Structure

pager referenced in --help but no --no-pager escape hatch advertised · 7/20 subcommand(s) follow standard verb names. Non-standard: agent, export, import, leader, mcp, memory, models, plugin, sessions, setup, ssh, trace, worktree. MAY-tier — community-standard verbs (get/list/create/update/delete) help agents predict subcommand behavior across CLIs.

Remediation · P6

Make xai-grok-build satisfy P6 (Composable, Predictable Command Structure) from the agent-native CLI standard. Address:
- Pager-using CLI ships --no-pager escape hatch (pager referenced in --help but no --no-pager escape hatch advertised)
- Subcommand verbs follow community-standard names (7/20 subcommand(s) follow standard verb names. Non-standard: agent, export, import, leader, mcp, memory, models, plugin, sessions, setup, ssh, trace, worktree. MAY-tier — community-standard verbs (get/list/create/update/delete) help agents predict subcommand behavior across CLIs.)
- `--color` flag for explicit color control (no `--color` flag advertised. MAY-tier — `auto|always|never` lets agents and pipelines override the TTY-based default.)
- Subcommand naming follows a consistent verb/noun convention (subcommand naming is inconsistent: 5 non-verb subcommand(s) (agent, leader, mcp, plugin, worktree) mix verb and non-verb children at the second level, so an agent cannot predict where the action lives. SHOULD-tier: pick a consistent shape (all verb-first, all noun-verb hierarchy, or any combination where each non-verb group's children are uniformly verbs). The verb list is a heuristic; inspect `--help` to confirm.)
Requirements: https://anc.dev/p6

warn

Bounded, High-Signal Responses

no --quiet/-q flag detected in --help output · no TTY-aware language found in `--help`. MAY-tier — automatic verbosity reduction when stdout is piped or redirected lets agents skip the explicit `--quiet` flag. Behavioral probes cannot simulate a real TTY without a pty crate, so this audit relies on documented intent.

Remediation · P7

Make xai-grok-build satisfy P7 (Bounded, High-Signal Responses) from the agent-native CLI standard. Address:
- Quiet mode available (no --quiet/-q flag detected in --help output)
- Help text advertises TTY-aware verbosity behavior (no TTY-aware language found in `--help`. MAY-tier — automatic verbosity reduction when stdout is piped or redirected lets agents skip the explicit `--quiet` flag. Behavioral probes cannot simulate a real TTY without a pty crate, so this audit relies on documented intent.)
Requirements: https://anc.dev/p7

warn

P8
Discoverable Through Agent Skill Bundles
All 3 checks pass.
pass

All Audits

P1: Non-Interactive by Default

PASS	Non-interactive by default
SKIP	Non-interactive gate flag advertised in --help	target satisfies P1 via alternative gate (help-on-bare or stdin-primary)
PASS	Flags advertise env-var bindings in --help
PASS	Secret-bearing flags expose stdin or *-file companion
PASS	`--help` advertises default values for flags
PASS	Rich-TUI affordance for TTY contexts

P2: Structured, Parseable Output

WARN	Structured output support	--output/--format flag detected but could not validate JSON via safe probes (--help/--version override output flags in most CLIs)
FAIL	Structured-output CLI exposes its schema at runtime	CLI emits structured output but exposes no `schema` subcommand or `--schema` flag at top level or nested one level deep. Agents need a runtime-discoverable schema to pin against shape changes.
WARN	--json / --jsonl short aliases for --output	no --json or --jsonl short alias found. Agents and pipelines benefit from short forms alongside the canonical `--output` enum.
WARN	`--raw` flag for pipe-safe unformatted output	no `--raw` flag advertised. MAY-tier — useful for pipelines that want to strip formatting before piping to other tools.
SKIP	`--output` advertises additional formats beyond text/json	no `--output` or `--format` flag advertised; vacuous skip for MAY-tier extra formats.
PASS	Bad invocation exits with structured usage-error code (2)
FAIL	Errors emit JSON envelope with `error`/`kind`/`message` under `--output json`	bad invocation under `--output json` produced no parseable JSON on stderr or stdout. JSON mode must emit a JSON error envelope, not plain text.
SKIP	JSON success and error envelopes share their non-payload key set	success-mode probe (`--help --output json`) produced no parseable JSON; cannot compare envelope shapes.

P3: Progressive Help Discovery

PASS	Help flag produces useful output
PASS	Version flag works (`--version` plus short alias)
PASS	Version flag works (`--version` plus short alias)
WARN	`examples` subcommand or `--examples` flag for curated usage patterns	no `examples` subcommand or `--examples` flag found. MAY-tier — a curated usage block keeps agents from hunting through long help text.
PASS	Short `-h` summary differs from `--help` long form
FAIL	Each subcommand's `--help` ships at least one invocation example	subcommands missing example invocations in their `--help`: agent, export, import, inspect, leader, login, logout, mcp, memory, models, plugin, sessions, setup, ssh, trace, update, version, worktree. Examples teach agents the call shape faster than option tables; use clap's `after_help` or a dedicated `Examples:` block.
WARN	Help text pairs human and `--output json` example invocations	no paired text + `--output json` example found within 5 lines in top-level or any subcommand `--help`. Pairing keeps agents from reverse-engineering the JSON invocation from the text one.

P4: Fail-Fast, Actionable Errors

PASS	Rejects invalid arguments
PASS	Error messages include a hint or remediation phrase
WARN	`--output json` produces JSON-formatted errors	errors under `--output json` are not JSON-formatted. Consumers parsing stdout-as-JSON cannot recover the failure without a separate text-parsing path; switch the error writer to honor the active output mode.

P5: Safe Retries & Mutation Boundaries

SKIP	Destructive subcommands require `--force` or `--yes`	no destructive subcommands detected; MUST applies conditionally to CLIs with destructive operations.
WARN	Read and write surfaces are both visible in subcommand list	write-pattern subcommand(s) present (update) but no read-pattern surface detected. If the CLI is write-only by design the MUST is satisfied vacuously; otherwise expose the read surface with agent-recognizable verbs (list/get/show/query/find/search).

P6: Composable, Predictable Command Structure

PASS	Handles SIGPIPE gracefully
WARN	Pager-using CLI ships --no-pager escape hatch	pager referenced in --help but no --no-pager escape hatch advertised
PASS	Respects NO_COLOR
WARN	Subcommand verbs follow community-standard names	7/20 subcommand(s) follow standard verb names. Non-standard: agent, export, import, leader, mcp, memory, models, plugin, sessions, setup, ssh, trace, worktree. MAY-tier — community-standard verbs (get/list/create/update/delete) help agents predict subcommand behavior across CLIs.
WARN	`--color` flag for explicit color control	no `--color` flag advertised. MAY-tier — `auto\|always\|never` lets agents and pipelines override the TTY-based default.
SKIP	Input-accepting commands read from stdin when no file is given	no input-accepting subcommand detected (process/parse/convert/transform/analyze/validate/format/lint/audit); vacuous skip for the conditional SHOULD.
WARN	Subcommand naming follows a consistent verb/noun convention	subcommand naming is inconsistent: 5 non-verb subcommand(s) (agent, leader, mcp, plugin, worktree) mix verb and non-verb children at the second level, so an agent cannot predict where the action lives. SHOULD-tier: pick a consistent shape (all verb-first, all noun-verb hierarchy, or any combination where each non-verb group's children are uniformly verbs). The verb list is a heuristic; inspect `--help` to confirm.
PASS	Operations are subcommands, not verb-shaped flags

P7: Bounded, High-Signal Responses

WARN	Quiet mode available	no --quiet/-q flag detected in --help output
PASS	`--verbose` flag for diagnostic escalation
SKIP	`--limit` / `--max-results` flag for list operations	no list-style subcommand detected (list/ls/search/query/find/show/get); vacuous skip for the list-only SHOULD.
SKIP	Cursor-based pagination flags for list traversal	no list-style subcommand detected; vacuous skip for the list-only MAY.
SKIP	`--timeout` flag for long-running operations	no long-running subcommand detected (serve/daemon/watch/tail/monitor/follow/run/start/stream); vacuous skip for the conditional SHOULD.
WARN	Help text advertises TTY-aware verbosity behavior	no TTY-aware language found in `--help`. MAY-tier — automatic verbosity reduction when stdout is piped or redirected lets agents skip the explicit `--quiet` flag. Behavioral probes cannot simulate a real TTY without a pty crate, so this audit relies on documented intent.

P8: Discoverable Through Agent Skill Bundles

PASS	Skill bundle has install path (`tool skill install [<host>]`)
PASS	`skill install --all` for multi-runtime install
PASS	`skill update` / `skill upgrade` for bundle refresh

Details

Version scored: 0.2.16
Audit date: 2026-06-02 17:00:39 UTC
Duration: 838ms
Platform: linux/x86_64
Mode: command
Anc build: 0.5.0
Install: curl -fsSL https://x.ai/cli/install.sh | bash

Embed the badge

This score (73%) clears the badge floor (70%). Copy this into your README:

[![agent-native](https://anc.dev/badge/grok.svg)](https://anc.dev/score/grok)

Preview:

Reproduce this scorecard for xai-grok-build locally and inspect the failing audits:

anc audit --command grok --output json

Install anc first if you don't have it. Add --output json to get the same JSON shape committed under scorecards/.