home / skills / bdambrosio / cognitive_workbench / osworld-execute
This skill executes Python code in the OSWorld environment and returns execution results including status, duration, and I/O streams.
npx playbooks add skill bdambrosio/cognitive_workbench --skill osworld-executeReview the files below or copy the command above to add this skill to your agents.
---
name: osworld-execute
type: python
description: "Execute Python code (typically pyautogui commands) in the OSWorld environment. Returns execution result with success status, return code, duration, stdout, and stderr."
schema_hint:
value: "string (Python code)"
python: "string (Python code, alternative to value)"
return_observation: "bool (default: false)"
out: "$variable"
examples:
- '{"type":"osworld-execute","python":"pyautogui.click(100,200)","out":"$result"}'
- '{"type":"osworld-execute","value":"pyautogui.typewrite(\"hello\")","return_observation":true,"out":"$result"}'
---
# OSWorld Execute Tool (Level 4)
## Input
- `python` or `value`: Python code string (required) - typically pyautogui commands
- `return_observation`: bool (default: false) - include observation in response
- `value` parameter can be used as alternative to `python`
## Output
- Note ID (bound to `out` variable) containing:
- `text`: formatted execution result
- `format`: "text"
- `metadata`: execution data including:
- `success`: boolean - whether execution succeeded
- `returncode`: integer - return code (0 = success)
- `duration_ms`: integer - execution duration in milliseconds
- `step_counter`: integer - step counter after execution
- `stdout`: string - standard output
- `stderr`: string - standard error
- `python_code`: string - the executed code
- `observation`: dict (if return_observation=true) - observation after execution
- `timestamp`: float (if return_observation=true) - observation timestamp
## Configuration
- `OSWORLD_URL` environment variable (defaults to `http://localhost:3002`)
- Or pass `osworld_url` in character config's `osworld_config` section
## Common Workflow
```json
{"type":"osworld-observe","out":"$obs"}
{"type":"osworld-execute","python":"pyautogui.click(100,200)","out":"$result"}
{"type":"osworld-observe","out":"$obs2"}
```
## Notes
- Python code is executed directly in the OSWorld environment
- Common commands: `pyautogui.click(x, y)`, `pyautogui.typewrite(text)`, `pyautogui.press(key)`
- No retries or corrections - Jill owns error handling
- Execution is synchronous and blocking
This skill executes Python code inside an OSWorld runtime, typically to run pyautogui commands that control a virtual desktop. It returns a structured execution result that includes success status, return code, duration, stdout, and stderr. Use it when you need deterministic, synchronous control over the OSWorld environment from an agent.
You send a Python code string (or use the value parameter) to be executed directly in the OSWorld environment. The tool runs the code synchronously and returns a note containing execution metadata: success boolean, return code, duration in ms, stdout, stderr, and the executed Python code. Optionally it can include an observation snapshot and timestamp when return_observation is true.
How do I provide the Python code?
Pass the code as the python parameter or use value as an alias; the string is executed directly in OSWorld.
Can I get the environment observation after execution?
Yes. Set return_observation to true to include observation and its timestamp in the result metadata.