browser-use 是什么工具？

browser-use 是一款Python开发的AI辅助工具。开源AI工作流：🌐 Make websites accessible for AI agents. Automate tasks online with ease.。⭐93.6k · Python

browser-use 如何安装和开始使用？

访问 browser-use 的 GitHub 仓库或官方网站，按照 README 文档中的步骤安装依赖并运行。通常需要 Python 3.8+ 或 Node.js 16+ 基础环境。

browser-use 是否免费？许可证是什么？

browser-use 完全免费，采用 MIT 许可证开源发布，任何人都可以免费使用、修改和分发。

browser-use 适合哪些用户使用？

browser-use 对初学者友好，无需深厚技术背景即可快速上手。同时也适合有经验的开发者和 AI 工程师进行深度定制。

browser-use 的社区活跃度和项目维护状况如何？

browser-use 在 GitHub 上已获得 93,640 个 Star，是社区广泛认可的热门项目，持续活跃更新，维护良好。

📄 工具详情 ⚙️ 安装教程 📚 使用教程

能力标签

🔌 MCP 🤖 Agent 💻 CLI 🧠 Claude ✨ GPT 🖥 本地 LLM

⚙️

Agent工作流

browser-use Agent工作流

基于 Python · 无代码搭建完整 AI 自动化流程

英文名：browser-use

⭐ 93.6k Stars 🍴 10.6k Forks 💻 Python 📄 MIT 🏷 AI 8.8分

8.8AI 综合评分

浏览器自动化AI智能体工作流网页自动化LLM应用

⬇ 下载源码 ZIP ⚙️ 配置说明

✦ AI Skill Hub 推荐

AI Skill Hub 强烈推荐：browser-use Agent工作流是一款优质的Agent工作流。在 GitHub 上收获超过 93.6k 颗 Star，AI 综合评分 8.8 分，在同类工具中表现稳健。如果你正在寻找可靠的Agent工作流解决方案，这是一个值得深入了解的选择。

📚 深度解析

browser-use Agent工作流是一套完整的 AI Agent 自动化工作流方案。随着 AI 能力的不断提升，基于 Agent 的自动化工作流正在成为提升个人和团队效率的核心方式。区别于传统的 RPA 自动化（模拟鼠标键盘操作），AI Agent 工作流通过理解任务意图、动态规划执行路径，能够处理更复杂的非结构化任务。

browser-use Agent工作流工作流的设计遵循"最小配置，最大复用"原则：核心逻辑已经封装好，用户只需配置自己的 API Key 和业务参数即可快速上手。工作流内置错误处理和重试机制，在网络波动或 API 限速等情况下仍能稳定运行，适合作为生产环境的自动化基础设施。

在实际部署时，建议先在测试环境中运行 3-5 次，验证各个环节的输出结果符合预期，再部署到生产环境。AI Skill Hub 评分 8.8 分，是同类 Agent 工作流中的精选推荐。

📋 工具概览

browser-use Agent工作流是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排，将复杂的多步骤任务拆解为清晰的自动化流程，实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成，适合构建数据处理管线、业务自动化和 AI 辅助决策系统。

GitHub Stars

⭐ 93.6k

开发语言

Python

支持平台

Windows / macOS / Linux

维护状态

活跃维护，更新频繁

开源协议

MIT

AI 综合评分

8.8 分

工具类型

Agent工作流

Forks

10.6k

📖 中文文档

以下内容由 AI Skill Hub 根据项目信息自动整理，如需查看完整原始文档请访问底部「原始来源」。

📌 核心特色

可视化 Agent 工作流编排，无需编写复杂代码
支持多步骤自动化任务链，实现全流程无人值守
与外部 API、数据库和第三方服务无缝集成
内置错误处理与自动重试机制，保障稳定运行
提供可复用的自动化模板，快速在同类场景部署

🎯 主要使用场景

自动化日常重复性工作，将精力集中于创造性任务
构建数据采集 → 处理 → 输出的完整自动化管线
实现跨平台、跨系统的数据流转和业务协同

以下安装命令基于项目开发语言和类型自动生成，实际以官方 README 为准。

安装命令

# 方式一：pip 安装（推荐）
pip install browser-use

# 方式二：虚拟环境安装（推荐生产环境）
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install browser-use

# 方式三：从源码安装（获取最新功能）
git clone https://github.com/browser-use/browser-use
cd browser-use
pip install -e .

# 验证安装
python -c "import browser_use; print('安装成功')"

📋 安装步骤说明

访问 GitHub 仓库获取工作流文件
在对应平台（Dify / Flowise / Make 等）中找到「导入工作流」功能
上传工作流文件
按照提示配置必要的环境变量和 API Key
运行测试确认流程正常后投入使用

以下用法示例由 AI Skill Hub 整理，涵盖最常见的使用场景。

常用命令 / 代码示例

# 命令行使用
browser-use --help

# 基本用法
browser-use input_file -o output_file

# Python 代码中调用
import browser_use

# 示例
result = browser_use.process("input")
print(result)

以下配置示例基于典型使用场景生成，具体参数请参照官方文档调整。

配置示例

# browser-use 配置文件示例（config.yml）
app:
  name: "browser-use"
  debug: false
  log_level: "INFO"

# 运行时指定配置文件
browser-use --config config.yml

# 或通过环境变量配置
export BROWSER_USE_API_KEY="your-key"
export BROWSER_USE_OUTPUT_DIR="./output"

📑 README 深度解析真实文档完整度 82/100 查看 GitHub 原文 →

以下内容由系统直接从 GitHub README 解析整理，保留代码块、表格与列表结构。

简介

---

🌤️ Want to skip the setup? Use our cloud for faster, scalable, stealth-enabled browser automation!

uvx browser-use install # Run if you don't have Chromium installed


**2. [Optional] Get your API key from [Browser Use Cloud](https://cloud.browser-use.com/new-api-key?utm_source=github&utm_medium=readme-quickstart-api-key):**

🤖 LLM Quickstart

Direct your favorite coding agent (Cursor, Claude Code, etc) to Agents.md
Prompt away!

👋 Human Quickstart

1. Create environment and install Browser-Use with uv (Python>=3.11): ```bash uv init && uv add browser-use && uv sync

💡See [more examples here ↗](https://docs.browser-use.com/examples) and give us a star!

🚀 Template Quickstart

Want to get started even faster? Generate a ready-to-run template:

uvx browser-use init --template default

This creates a browser_use_default.py file with a working example. Available templates: - default - Minimal setup to get started quickly - advanced - All configuration options with detailed comments - tools - Examples of custom tools and extending the agent

You can also specify a custom output path:

uvx browser-use init --template default --output my_agent.py

Demos

.env

BROWSER_USE_API_KEY=your-key

GOOGLE_API_KEY=your-key

ANTHROPIC_API_KEY=your-key


**3. Run your first agent:**

python from browser_use import Agent, Browser, ChatBrowserUse

Integrations, hosting, custom tools, MCP, and more on our [Docs ↗](https://docs.browser-use.com)

Open Source vs Cloud

We benchmark Browser Use across 100 real-world browser tasks. Full benchmark is open source: browser-use/benchmark.

Use the Open-Source Agent - You need custom tools or deep code-level integration - We recommend pairing with our cloud browsers for leading stealth, proxy rotation, and scaling - Or self-host the open-source agent fully on your own machines

Use the Fully-Hosted Cloud Agent (recommended) - Much more powerful agent for complex tasks (see plot above) - Easiest way to start and scale - Best stealth with proxy rotation and captcha solving - 1000+ integrations (Gmail, Slack, Notion, and more) - Persistent filesystem and memory

FAQ

<details> <summary>What's the best model to use?</summary>

We optimized ChatBrowserUse() specifically for browser automation tasks. On avg it completes tasks 3-5x faster than other models with SOTA accuracy.

Pricing (per 1M tokens): - Input tokens: $0.20 - Cached input tokens: $0.02 - Output tokens: $2.00

For other LLM providers, see our supported models documentation. </details>

<details> <summary>Should I use the Browser Use system prompt with the open-source preview model?</summary>

Yes. If you use ChatBrowserUse(model='browser-use/bu-30b-a3b-preview') with a normal Agent(...), Browser Use still sends its default agent system prompt for you.

You do not need to add a separate custom "Browser Use system message" just because you switched to the open-source preview model. Only use extend_system_message or override_system_message when you intentionally want to customize the default behavior for your task.

If you want the best default speed/accuracy, we still recommend the newer hosted bu-* models. If you want the open-source preview model, the setup stays the same apart from the model= value. </details>

<details> <summary>Can I use custom tools with the agent?</summary>

Yes! You can add custom tools to extend the agent's capabilities:

from browser_use import Tools

tools = Tools()

@tools.action(description='Description of what this tool does.')
def custom_tool(param: str) -> str:
    return f"Result: {param}"

agent = Agent(
    task="Your task",
    llm=llm,
    browser=browser,
    tools=tools,
)

</details>

Yes! Browser-Use is open source and free to use. You only need to choose an LLM provider (like OpenAI, Google, ChatBrowserUse, or run local models with Ollama). </details>

<details> <summary>Terms of Service</summary>

This open-source library is licensed under the MIT License. For Browser Use services & data policy, see our Terms of Service and Privacy Policy. </details>

<details> <summary>How do I handle authentication?</summary>

Check out our authentication examples: - Using real browser profiles - Reuse your existing Chrome profile with saved logins - If you want to use temporary accounts with inbox, choose AgentMail - To sync your auth profile with the remote browser, run curl -fsSL https://browser-use.com/profile.sh | BROWSER_USE_API_KEY=XXXX sh (replace XXXX with your API key)

These examples show how to maintain sessions and handle authentication seamlessly. </details>

<details> <summary>How do I solve CAPTCHAs?</summary>

For CAPTCHA handling, you need better browser fingerprinting and proxies. Use Browser Use Cloud which provides stealth browsers designed to avoid detection and CAPTCHA challenges. </details>

<details> <summary>How do I go into production?</summary>

Chrome can consume a lot of memory, and running many agents in parallel can be tricky to manage.

For production use cases, use our Browser Use Cloud API which handles: - Scalable browser infrastructure - Memory management - Proxy rotation - Stealth browser fingerprinting - High-performance parallel execution </details>

Tell your computer what to do, and it gets it done.

&emsp;&emsp;&emsp;

</div>

Made with ❤️ in Zurich and San Francisco

🇨🇳 中文文档镜像 AI 翻译 2026-05-23

英文原文章节由系统翻译为中文摘要，便于快速理解。完整原文见上方 "📑 README 深度解析"。

📌 简介

本项目是一个简介，介绍了项目的基本信息。

🛠 安装步骤（Docker/pip/源码）

如果您没有 Chromium，使用以下命令安装：uvx browser-use install。您还可以获取 Browser Use Cloud 的 API 密钥。

🚀 使用教程

使用 LLM Quickstart，直接将您的编码代理（如 Cursor、Claude Code 等）指向 Agents.md。或者，您可以使用 Human Quickstart，创建环境并安装 Browser-Use，使用 uv (Python>=3.11)。

⚙️ 配置说明（含 MCP / env）

配置文件中包含了 BROWSER_USE_API_KEY 等关键参数。您可以在 .env 文件中设置这些参数。

🔌 API 说明

API 文档中包含了 Google API 密钥、Anthropic API 密钥等信息。您需要在配置文件中设置这些参数。

🔄 工作流/模块

工作流 / 模块说明中介绍了项目的整体架构，包括 Integrations、hosting、custom tools、MCP 等功能。

❓ FAQ 摘要

FAQ 中回答了常见问题，包括最佳模型选择、定价信息等。

🎯 aiskill88 AI 点评 A 级 2026-05-20

工程化完善的AI自动化框架，9.3万Stars体现高热度。核心创新在将LLM与浏览器交互深度融合，降低Web自动化门槛。代码质量优秀，文档完整，生产环境友好。

📚 实用指南（长尾问题）

适合谁

需要让 Claude / Cursor 操作本地工具的 AI 工程师
构建多智能体协作系统的 Agent 开发者

最佳实践

配置 MCP 服务器时建议使用 stdio 传输 + JSON-RPC，避免暴露公网
本地部署优先选 GGUF 量化模型，节省显存并保持响应速度
Agent 任务先做 dry-run 验证工具调用链，再开启自主执行

常见错误

API key 直接提交到 git 仓库（请用 .env 并加入 .gitignore）
MCP 配置路径拼错或权限不足，重启 Claude Desktop 才生效
显存不足直接 OOM — 优先降低 context 或换更小的量化模型
Python 依赖冲突：建议用 venv / uv 隔离环境

部署方案

CLI：直接 npm install -g / pip install，命令行调用
本地部署：CPU 8GB 起，GPU 推荐 16GB+ 显存
云端托管：可放在 Vercel / Railway / Fly.io 等 PaaS 平台

⚡ 核心功能

可视化 Agent 工作流编排，无需编写复杂代码
支持多步骤自动化任务链，实现全流程无人值守
与外部 API、数据库和第三方服务无缝集成
内置错误处理与自动重试机制，保障稳定运行
提供可复用的自动化模板，快速在同类场景部署

👥 适合谁

需要让 Claude / Cursor 操作本地工具的 AI 工程师
构建多智能体协作系统的 Agent 开发者

⭐ 最佳实践

配置 MCP 服务器时建议使用 stdio 传输 + JSON-RPC，避免暴露公网
本地部署优先选 GGUF 量化模型，节省显存并保持响应速度
Agent 任务先做 dry-run 验证工具调用链，再开启自主执行

⚠️ 常见错误

API key 直接提交到 git 仓库（请用 .env 并加入 .gitignore）
MCP 配置路径拼错或权限不足，重启 Claude Desktop 才生效
显存不足直接 OOM — 优先降低 context 或换更小的量化模型
Python 依赖冲突：建议用 venv / uv 隔离环境

👥 适合人群

自动化工程师和运维人员项目经理和业务分析师希望减少重复性工作的专业人士数字化转型团队

🎯 使用场景

自动化日常重复性工作，将精力集中于创造性任务
构建数据采集 → 处理 → 输出的完整自动化管线
实现跨平台、跨系统的数据流转和业务协同

⚖️ 优点与不足

✅ 优点

+GitHub 93.6k Star，社区高度认可
+MIT 协议，可免费商用
+大幅减少重复性人工操作
+可视化流程，清晰直观
+可扩展性强，支持复杂场景

⚠️ 不足

−初始配置和调试需投入一定时间
−强依赖外部服务的稳定性
−复杂场景需具备一定技术基础

⚠️ 使用须知

AI Skill Hub 为第三方内容聚合平台，本页面信息基于公开数据整理，不对工具功能和质量作任何法律背书。

建议在沙箱或测试环境中充分验证后，再部署至生产环境，并做好必要的安全评估。

📄 License 说明

🔗 相关工具推荐

ai-agents-for-beginners Agent工作流

微软官方开源项目，提供12堂系统课程学习AI智能体框架。涵盖工作流设计、RAG检索增强、多智能体协作等核心技能。适合AI

LLM资源合集（精选）

精选100+可直接运行的AI Agent和RAG应用集合。包含完整工作流示例、智能代理框架和检索增强生成系统。适合AI开

📰 相关 AI 新闻

🍿 AI 圈相关吃瓜

配了5个 MCP 工具，Claude 一个都没用

AI 圈观察

AutoGPT 自主完成了任务：把我的文件夹全部重命名了

AI 圈观察

Filesystem MCP 帮 Claude 找文件，找了整个 node_modules

🗺️ 相关解决方案

🧩 你可能还需要

基于当前 Skill 的能力图谱，自动补全的工具组合

技能寻求者

MCP · Agent · 工作流

开源AI工具：RAG知识库系统

基于Vue.js前端的RAG知识库系统，提供高效的知识检索和生成功能，助力AI应用开发

DeepCode Agent工作流

MCP · Agent · 工作流

total-agent-memory MCP工具

为Claude Code和Codex CLI提供持久化记忆功能的开源MCP工具。自动提取知识图谱，支持多轮对话上下文保留，适合需要长期记忆和

natively-cluely-ai-assistant — Claude Skill 中文使用文档

免费开源的AI面试助手，实时转录，隐蔽模式，局部RAG，BYOK。无订阅，防止数据泄露。

kb-arena开源AI工作流

kb-arena是开源的AI工作流，用于Benchmark 9 retrieval architectures（向量、上下文、QnA、知识图

❓ 常见问题 FAQ

支持哪些浏览器？−

支持Chrome、Firefox等主流浏览器，基于Selenium或Playwright等自动化库

可以处理动态加载的内容吗？+

如何与LLM集成？+

什么是 Agent 工作流？和普通自动化有什么区别？+

导入工作流后，我需要修改哪些配置？+

工作流运行失败了，如何排查问题？+

这个工作流每次运行会产生哪些费用？+

工作流可以定时自动运行吗？+

💡 AI Skill Hub 点评

总体来看，browser-use Agent工作流是一款质量优秀的Agent工作流，在同类工具中具备一定竞争力。AI Skill Hub 将持续追踪其更新动态，建议收藏备用，结合自身场景选择合适时机引入使用。

⬇️ 获取与下载

⬇ 下载源码 ZIP

✅ MIT 协议 · 可免费商用 · 直接从 aiskill88 服务器下载，无需跳转 GitHub

📚 深入学习 browser-use Agent工作流

查看分步骤安装教程和完整使用指南，快速上手这款工具

⚙️ 安装教程 📚 使用教程

🌐 原始信息

原始名称	`browser-use`
原始描述	开源AI工作流：🌐 Make websites accessible for AI agents. Automate tasks online with ease.。⭐93.6k · Python
Topics	`浏览器自动化AI智能体工作流网页自动化LLM应用`
GitHub	https://github.com/browser-use/browser-use
License	MIT
语言	Python

🔗 原始来源

🐙 GitHub 仓库 https://github.com/browser-use/browser-use 🌐 官方网站 https://browser-use.com

收录时间：2026-05-13 · 更新时间：2026-05-16 · License：MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。

browser-use Agent工作流

简介

uvx browser-use install # Run if you don't have Chromium installed

🤖 LLM Quickstart

👋 Human Quickstart

💡See [more examples here ↗](https://docs.browser-use.com/examples) and give us a star!

🚀 Template Quickstart

Demos

.env

GOOGLE_API_KEY=your-key

ANTHROPIC_API_KEY=your-key

Integrations, hosting, custom tools, MCP, and more on our [Docs ↗](https://docs.browser-use.com)

Open Source vs Cloud

FAQ

🤖 交给 Agent 安装 · browser-use Agent工作流