Talox 是 AI Skill Hub 本期精选Agent工作流之一。综合评分 7.5 分,整体质量较高。我们推荐使用将其纳入你的 AI 工具库,帮助提升工作效率。
基于Playwright的可状态化浏览器运行时,用于构建AI代理,具有隐身交互功能。突出其开源、可定制和隐身交互的优势。
Talox 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
基于Playwright的可状态化浏览器运行时,用于构建AI代理,具有隐身交互功能。突出其开源、可定制和隐身交互的优势。
Talox 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:npm 全局安装 npm install -g talox # 方式二:npx 直接运行(无需安装) npx talox --help # 方式三:项目依赖安装 npm install talox # 方式四:从源码运行 git clone https://github.com/AVANT-ICONIC/Talox cd Talox npm install npm start
# 命令行使用
talox --help
# 基本用法
talox [options] <input>
# Node.js 代码中使用
const talox = require('talox');
const result = await talox.run(options);
console.log(result);
# talox 配置说明 # 查看配置选项 talox --config-example > config.yml # 常见配置项 # output_dir: ./output # log_level: info # workers: 4 # 环境变量(覆盖配置文件) export TALOX_CONFIG="/path/to/config.yml"
<img src="https://capsule-render.vercel.app/api?type=waving&height=250&color=0:000000,40:0f172a,75:0f766e,100:0d9488&text=TALOX&fontColor=ffffff&fontSize=72&fontAlignY=35&desc=by%20AVANT%20ICONIC&descSize=15&descAlignY=52&animation=scaleIn" alt="Talox header" width="100%" />
<br />
<img src="./talox.webp" alt="Talox logo" width="72" />
<br />
<img src="https://readme-typing-svg.demolab.com?font=Inter&weight=600&size=22&pause=1200&color=2DD4BF¢er=true&vCenter=true&width=980&lines=Stateful+browser+runtime+for+AI+agents.;Persistent+profiles.+Deep+observability.+Structured+state+contracts.;Resilient+interaction+for+real-world+web+UIs." alt="Typing SVG" />
<br /> <br />
<p align="center"> <a href="#overview"><img src="https://img.shields.io/badge/overview-0f172a?style=for-the-badge&logo=readme&logoColor=white" alt="Overview" /></a> <a href="#key-capabilities"><img src="https://img.shields.io/badge/capabilities-0f766e?style=for-the-badge&logo=sparkles&logoColor=white" alt="Key Capabilities" /></a> <a href="#agent-overlay"><img src="https://img.shields.io/badge/overlay-0d9488?style=for-the-badge&logo=eye&logoColor=white" alt="Agent Overlay" /></a> <a href="#architecture"><img src="https://img.shields.io/badge/architecture-0d9488?style=for-the-badge&logo=gitbook&logoColor=white" alt="Architecture" /></a> <a href="#quick-start"><img src="https://img.shields.io/badge/quick%20start-134e4a?style=for-the-badge&logo=rocket&logoColor=white" alt="Quick Start" /></a> <a href="#talox-cli"><img src="https://img.shields.io/badge/Talox%20CLI-0a5b86?style=for-the-badge&logo=terminal&logoColor=white" alt="Talox CLI" /></a> <a href="#browser-lab"><img src="https://img.shields.io/badge/browser%20lab-0a5b86?style=for-the-badge&logo=experiment&logoColor=white" alt="Browser Lab" /></a> <a href="#contributing"><img src="https://img.shields.io/badge/contributing-115e59?style=for-the-badge&logo=githubsponsors&logoColor=white" alt="Contributing" /></a> </p>
<p align="center"> <img src="https://img.shields.io/badge/TypeScript-5.0+-3178C6?style=flat-square&logo=typescript&logoColor=white" alt="TypeScript" /> <img src="https://img.shields.io/badge/Playwright-Chromium-45ba4b?style=flat-square&logo=playwright&logoColor=white" alt="Playwright" /> <img src="https://img.shields.io/badge/Node.js-18+-339933?style=flat-square&logo=nodedotjs&logoColor=white" alt="Node.js" /> <img src="https://img.shields.io/badge/License-AGPL--3.0--only-0d9488?style=flat-square&logo=opensourceinitiative&logoColor=white" alt="AGPL-3.0-only" /> <img src="https://img.shields.io/github/v/release/AVANT-ICONIC/Talox?color=0d9488<img src="https://img.shields.io/badge/version-7.6.0-0d9488?style=flat-square" alt="v7.6.0" />style=flat-square" alt="version" /> </p>
<p align="center"> <strong>Local browser runtime for agents.</strong><br /> Stealth interaction layer. Structured page state. Resilient automation. Deep observability for real-world UIs. </p>
<p align="center"> <a href="./docs/TALOX-SPEC.md">Spec</a> · <a href="./docs/TALOX-ARCHITECTURE.md">Architecture</a> · <a href="./docs/TALOX-ROADMAP.md">Roadmap</a> · <a href="./CHANGELOG.md">Changelog</a> </p>
</div>
Talox is a local browser runtime — agents work inside a real browser with maximum stealth and human-like behavior. Everything is always on: HumanMouse (Bezier paths, Fitts's Law), BotDetector, AdaptationEngine, full AX-tree perception — no modes, no toggling. Every action returns a structured JSON contract: AX-Tree, DOM state, console output, network events, and visual diffs — ready for any agent to consume directly, without parsing HTML or interpreting screenshots.
import { TaloxController } from 'talox';
const talox = new TaloxController({ settings: { verbosity: 0 } }); // v4 shorthand
// Agent does everything with full stealth — always on
await talox.launch('my-agent', 'ops');
const state = await talox.navigate('https://example.com');
await talox.click('button[type=submit]'); // HumanMouse, stealth, always on
await talox.stop();
// Headed mode — shows browser with glow frame + fake cursor overlay
const talox = new TaloxController({ headed: true }); // v4 shorthand
// Human Takeover — agent pauses, human does a step (e.g., login, 2FA)
await talox.requestHumanTakeover('Need 2FA code');
// → cyan glow → amber, "▶ Resume Agent" button appears
// human does their thing
talox.resumeAgent(); // or auto-resumes after timeout
---
---
Talox ships two browser automation packages:
| Package | Role | When it's used |
|---|---|---|
| **playwright-core** | Chromium automation engine | All core automation: navigating, clicking, typing, AX-tree collection, console/network interception. Uses system Chrome for real browser fingerprint. |
Talox uses playwright-core with channel: "chrome" for maximum stealth — system Chrome provides a real browser fingerprint. The 19-layer JS stealth stack is injected via addInitScript before any page scripts run.
import { TaloxController } from 'talox';
// v4: config object as first arg
const talox = new TaloxController({ headless: true });
await talox.launch('my-agent', 'ops');
const state = await talox.navigate('https://example.com');
// Talox returns structured JSON — no HTML parsing needed
console.log('Title:', state.title);
console.log('Interactive elements:', state.interactiveElements.length);
console.log('Layout bugs detected:', state.bugs.length);
await talox.stop();
See examples/minimal-agent.ts for a copy-paste starting point.
npx playwright install chromium --with-deps ```
Playwright's Chromium requires system dependencies that aren't present on a bare Linux VPS. Run this once after install:
npx playwright install chromium --with-deps
Talox defaults to headless: true, so no display server is needed. The required Chromium flags (--no-sandbox, --disable-dev-shm-usage) are set automatically.
All features work fully headless — including screenshots, visual diff (Pixelmatch/SSIM), OCR (Tesseract.js), and GhostVisualizer. None of these require a display; they operate on pixel buffers and pure JS.
If you're on a low-memory VPS (< 1GB), set PLAYWRIGHT_CHROMIUM_SANDBOX=0 as an environment variable as well.
---
npm package coming soon. For now, install directly from GitHub:
```bash git clone https://github.com/AVANT-ICONIC/Talox.git cd Talox npm install npm run build
---
examples/browser-lab.ts walks through a sandbox profile that:
PRESETS.observe with headed overlay/recording enabled,talox-sessions/ (useful as a sanity check after npm install && npm run build + npx playwright install chromium).Run the demo to validate the packaged presets, tools, and reporting output in one headed experiment.
---
npm package coming soon. Until then, run the CLI from the repo: node dist/cli/talox.js observe --help
node dist/cli/talox.js observe starts the human-visible observe mode with headed browser, overlay, Markdown/HTML reporting, and the window.__taloxEmit__ bridge so you can annotate interactions while the agent runs.node dist/cli/talox.js run starts the autonomous task execution loop with an LLM planner for self-driving browser workflows.node dist/cli/talox.js skill create interactively creates a new SKILL.md file for per-site strategies.node dist/cli/talox.js init (aka the create-talox-app workflow) scaffolds a clean talox-app starter project with PRESETS.observe, ts-node/typescript tooling, Playwright install scripts, and examples/browser-lab.ts.ops, qa, observe, research, login-heavy) live in src/presets.ts so you can reuse curated verbosity, headedness, and human-takeover posture with a single spread or merge.getPracticalTools(talox) demonstrate background tabs, API response capture, Markdown snapshot export, on-site search, and visible structured content extraction, so your packaged profiles already include actionable browser lab helpers.Write a small script that uses TaloxController, then execute it with npx codex run scripts/codex-talox.mjs so Codex orchestrates Talox like any other skill.
import { TaloxController } from 'talox';
const talox = new TaloxController('./profiles');
await talox.launch('codex-agent', 'ops');
const state = await talox.navigate('https://example.com');
console.log(JSON.stringify(state, null, 2));
await talox.stop();
Codex can read the printed JSON, pass it into its function-calling loop, or pipe it into another skill for further analysis.
talox run # Start an autonomous task execution loop
talox skill create # Interactively create a new skill file
---
Talox ships as a flexible Node package with a CLI-first philosophy. The most efficient way to use Talox is through the CLI or direct Node.js scripting — no context-window bloat from MCP servers. For the community, an MCP server is also available for agents that prefer tool-use protocols.
| Module | Role |
|---|---|
AutonomousLoop | Plan-execute-observe cycle with convergence detection |
LLMPlanner | LLM-backed planner; decides next actions and generates skills from blockers |
SkillWriter | Writes SKILL.md files from blocker analysis |
SkillLoader | Auto-discovers and loads skills by hostname |
resolveChallenge() | Public API on TaloxController for programmatic challenge resolution |
talox.on('navigation', (event) => console.log('Navigated to:', event.data.url));
talox.on('consoleError', (event) => console.log('Error:', event.data.error));
talox.on('bugDetected', (event) => console.log('Bug:', event.data));
node dist/cli/talox.js observe --profile my-observe-run --class qa --browser chromium --output-dir ./talox-sessions --format both ```
That command opens a headed Chromium session with the overlay + annotation buffer already armed, logs console/network errors, and writes JSON/Markdown reports. Run talox observe --help to tune the profile class, browser, verbosity, or report directory without touching code.
Each session now lives inside its own subfolder under the configured output directory (default talox-sessions/session-{id}-{timestamp}). The folder contains report.json, report.md, report.html, timeline.json, event-log.json, failures.json, diffs.json, bugs.json, and trace.json, along with a screenshots/ directory for before/after snapshots. The HTML report surfaces the timeline, event log, diffs, bug summaries, and artifact trace so you can understand why clicks, selectors, or adaptations behaved the way they did.
This produces a Markdown report with every issue attached to the specific element where it was found — something impossible with traditional assertion-based tests.
---
Talox's mission is to be the obvious local-first browser runtime. The table below contrasts that focus with other well-known agent/browser automation options so the positioning stays sharp.
| Experience | Category | Why Talox wins |
|---|---|---|
| **Talox** | Local browser runtime for agents | Structured state contract, resilience-first interaction, takeover-ready observability, and optional headed overlay keep Talox grounded in real-world UI work. |
| **Webclaw** | Cloud automation + scraping | Heavy remote tooling; Talox keeps the browser local so agents control data, sessions, and human takeover without third-party lock-in. |
| **Crawl4AI** | Hosted crawling + QA bots | Built for fleets and scale; Talox trades scale for fidelity with persistent local sessions, biomechanical interactions, and deep debug artifacts. |
| **browser-use** | Playwright + heuristics | Useful for scripted flows but lacks Talox’s takeover hooks, verbose telemetry, and structured JSON contract — Talox is designed as an agent runtime, not just UI scripting. |
| **pebkac** | Operator cockpit | Inspires the operator mindset, but Talox keeps the runtime disciplined: optional tools/overlays, no hosted chaos. |
---
Talox 是一个本地浏览器运行时 — 代理在一个真实的浏览器中工作,最大限度地减少了隐蔽性和人类行为的行为。Everything 都是始终开启的:HumanMouse(Bezier 路径、Fitts 的法则)、BotDetector、AdaptationEngine、全 AX 树感知 —— 没有模式,没必要切换。每个动作都返回一个结构化的 JSON 合约:AX 树、DOM 状态、控制台输出、网络事件和视觉差异 —— 准备好任何 ag 的任何需求。
Talox 具有以下关键功能:
Talox 带来两个浏览器自动化包:
安装 Playwright Chromium(第一次或在新服务器上):
Talox CLI & 包装:
Talox 的工作流程和模块包括:
Talox是一个开源的AI工作流,基于Playwright构建,可状态化浏览器运行时,用于构建AI代理,具有隐身交互功能。其开源和可定制的优势值得关注,但其隐身交互功能可能存在风险。
该工具使用 NOASSERTION 协议,商用场景请仔细阅读协议条款,必要时咨询法律意见。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
📄 NOASSERTION — 请查阅原始协议条款了解具体使用限制。
经综合评估,Talox 在Agent工作流赛道中表现稳健,质量良好。如果你已有明确的使用需求,可以直接上手体验;如果还在评估阶段,建议对比同类工具后再做决策。
| 原始名称 | Talox |
| 原始描述 | 开源AI工作流:Stateful browser runtime for AI agents built on Playwright, with stealth interac。⭐9 · TypeScript |
| Topics | workflowaccessibility-treeagent-browserai-agentbrowser-automationbrowser-controllertypescript |
| GitHub | https://github.com/AVANT-ICONIC/Talox |
| License | NOASSERTION |
| 语言 | TypeScript |
收录时间:2026-05-18 · 更新时间:2026-05-19 · License:NOASSERTION · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端