home / skills / georgekhananaev / claude-skills-vault / codex-cli

codex-cli skill

safe

This skill runs OpenAI Codex CLI for second-opinion audits, code review, and delegated non-interactive tasks to improve code quality.

npx playbooks add skill georgekhananaev/claude-skills-vault --skill codex-cli

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

3.7 KB

---
name: codex-cli
description: Run OpenAI Codex CLI for coding tasks and second-opinion audits. Use when a user asks to run/ask/use Codex, says "codex prompt", or wants Claude to delegate a logic/code review to OpenAI models.
---

# Codex CLI

Run OpenAI Codex CLI locally for second-opinion audits, code review, and non-interactive task execution.

## Prerequisites

Codex CLI must be installed and authenticated:

1. **Install:** `npm install -g @openai/codex`
2. **Auth:** `codex login`
3. **Verify:** `codex --version`

## Core Execution Pattern

Use `codex exec` for delegated prompts (non-interactive):

```bash
codex exec "Your prompt here"
```

When the user says "codex prompt", treat it as:

```bash
codex exec "<user prompt>"
```

## Model Guidance

Use the default configured model unless the user asks otherwise.

Latest tested working model in this environment:

```bash
codex exec -m gpt-5.3-codex "Your prompt"
```

Compatibility note:
- `gpt-5-codex` may fail if config uses `model_reasoning_effort = "xhigh"`.
- If you must use `gpt-5-codex`, set reasoning effort explicitly:

```bash
codex exec -m gpt-5-codex -c model_reasoning_effort="high" "Your prompt"
```

## Commands

### Non-Interactive Execution

```bash
# Basic task
codex exec "Audit this logic for edge cases"

# Explicit model
codex exec -m gpt-5.3-codex "Review this implementation strategy"

# Full-auto mode (sandboxed, lower friction)
codex exec --full-auto "Implement the requested refactor"

# Read-only sandbox (analysis only)
codex exec -s read-only "Find bugs in this code path"

# Workspace-write sandbox
codex exec -s workspace-write "Apply the fix and update tests"

# Custom working directory
codex exec -C /path/to/project "Evaluate this repository"

# Save final output to file
codex exec -o output.txt "Summarize key risks"

# JSONL event stream
codex exec --json "Produce structured findings"

# Pipe context from stdin
cat context.txt | codex exec -
```

### Code Review

Use `codex review` for repository diffs:

```bash
# Review uncommitted changes
codex review --uncommitted

# Review against a base branch
codex review --base main

# Review a specific commit
codex review --commit abc123

# Custom review instructions
codex review "Focus on security issues"

# Combined
codex review --base main "Check for performance regressions"
```

## Important Flag Placement

`--search` and `-a/--ask-for-approval` are top-level flags. Put them before `exec` or `review`.

Correct:

```bash
codex --search -a on-request exec "Your prompt"
codex --search -a on-request review --uncommitted
```

Avoid:

```bash
codex exec --search "Your prompt"
codex exec -a on-request "Your prompt"
```

## Useful Flags

| Flag | Description |
|------|-------------|
| `-m` | Model (recommended explicit example: `gpt-5.3-codex`) |
| `-s` | Sandbox: `read-only`, `workspace-write`, `danger-full-access` |
| `-a` | Approval policy (`untrusted`, `on-failure`, `on-request`, `never`) as a top-level flag |
| `-C` | Working directory |
| `-o` | Write last message to file |
| `--full-auto` | Sandboxed auto-execution (`-a on-request -s workspace-write`) |
| `--json` | JSONL event output |
| `--search` | Enable web search tool as a top-level flag |
| `--add-dir` | Additional writable directories |
| `-c key=value` | Override config (example: `-c model_reasoning_effort="high"`) |

## Best Practices

- Prefer `codex exec` for delegated prompts instead of interactive `codex`
- Start with `-s read-only` for audits and second opinions
- Use `--full-auto` only when you expect autonomous edits
- Keep prompts explicit about expected output format
- Add `-o` when another tool or agent must consume the result
- Run `codex review --uncommitted` before committing as a quick extra pass

Overview

This skill runs the OpenAI Codex CLI to perform non-interactive coding tasks, second-opinion audits, and automated code reviews. It provides a clear execution pattern, recommended flags, and model guidance so you can delegate checks, refactors, and repository diffs reliably. Use it to plug Codex into CI, local workflows, or when you want a compact delegated audit from an external model.

How this skill works

The skill maps user intents like “codex prompt” or requests to run Codex into concrete codex CLI commands (primarily codex exec and codex review). It recommends sandbox modes, model selection, and top-level flag placement to avoid common errors and to control write access. Outputs can be saved, streamed as JSONL, or emitted to stdout for downstream tools to consume.

When to use it

You want a second-opinion audit or logic review without interactive back-and-forth.
You ask explicitly to run/ask/use Codex or say “codex prompt.”
You need a non-interactive execution: audits, summaries, or automated fixes.
You want repository diffs reviewed (uncommitted changes, specific commits, or against a base branch).
You need to integrate Codex into scripts or CI pipelines for automated checks.

Best practices

Prefer codex exec for delegated prompts rather than interactive sessions.
Start with -s read-only for audits and escalate sandbox/write permissions only when necessary.
Place --search and -a/--ask-for-approval before exec or review as top-level flags.
Keep prompts explicit about expected output format and include -o when another tool will consume the result.
Use --full-auto only when you expect autonomous edits and have safe approval policies in place.

Example use cases

Quickly audit a tricky algorithm path: codex exec -s read-only "Audit this function for edge cases".
Automated PR sanity check in CI: codex review --base main --json to produce structured findings.
Apply an accepted quick fix in a sandboxed workspace: codex exec --full-auto "Apply fix for X and update tests".
Save a model summary for downstream tooling: codex exec -o output.txt "Summarize key risks in this module".
Review uncommitted local changes before committing: codex review --uncommitted "Check for security issues".

FAQ

Which model should I use by default?

Use the default configured model unless you need a specific one. gpt-5.3-codex is the latest tested example in this environment.

Where must I place approval or search flags?

--search and -a/--ask-for-approval must be top-level flags placed before exec or review, not after.

How do I avoid model failures with gpt-5-codex?

If using gpt-5-codex and you encounter failures tied to reasoning effort, explicitly set -c model_reasoning_effort="high".