经 AI Skill Hub 精选评估,Minutes会议记录助手 获评「强烈推荐」。已获得 1.2k 颗 GitHub Star,这款MCP工具在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 8.2 分,适合有一定技术背景的用户使用。
Minutes会议记录助手 是一款遵循 MCP(Model Context Protocol)标准协议的 AI 工具扩展。通过 MCP 协议,它可以让 Claude、Cursor 等主流 AI 客户端直接访问和操作外部工具、数据源和服务,实现 AI 能力的无缝扩展。无论是文件操作、数据库查询还是 API 调用,都可以通过自然语言在 AI 对话中直接触发,极大提升生产效率。
Minutes会议记录助手 是一款遵循 MCP(Model Context Protocol)标准协议的 AI 工具扩展。通过 MCP 协议,它可以让 Claude、Cursor 等主流 AI 客户端直接访问和操作外部工具、数据源和服务,实现 AI 能力的无缝扩展。无论是文件操作、数据库查询还是 API 调用,都可以通过自然语言在 AI 对话中直接触发,极大提升生产效率。
# 方式一:通过 Claude Code CLI 一键安装
claude skill install https://github.com/silverstein/minutes
# 方式二:手动配置 claude_desktop_config.json
{
"mcpServers": {
"minutes------": {
"command": "npx",
"args": ["-y", "minutes"]
}
}
}
# 配置文件位置
# macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
# Windows: %APPDATA%/Claude/claude_desktop_config.json
# 安装后在 Claude 对话中直接使用 # 示例: 用户: 请帮我用 Minutes会议记录助手 执行以下任务... Claude: [自动调用 Minutes会议记录助手 MCP 工具处理请求] # 查看可用工具列表 # 在 Claude 中输入:"列出所有可用的 MCP 工具"
// claude_desktop_config.json 配置示例
{
"mcpServers": {
"minutes______": {
"command": "npx",
"args": ["-y", "minutes"],
"env": {
// "API_KEY": "your-api-key-here"
}
}
}
}
// 保存后重启 Claude Desktop 生效
Open-source conversation memory. useminutes.app
Agents have run logs. Humans have conversations. minutes captures the human side — the decisions, the intent, the context that agents need but can't observe — and makes it queryable.
Record a meeting. Capture a voice memo on a walk. Ask Claude "what did I promise Sarah?" — and get an answer. Your AI remembers every conversation you've had.
Minutes is not just a meeting-notes app. It is local conversation infrastructure for agents: audio capture, transcripts, decisions, commitments, people, and provenance exposed through plain files, CLI commands, MCP tools, and live transcript streams.
Own every conversation you've ever had. Cloud meeting tools rent your own conversations back to you. Minutes writes every meeting to~/meetings/as plain markdown, which every AI you use (Claude Code, Codex, Gemini CLI, Cursor, OpenCode, Pi) reads directly. No SDK. No API key. No vendor to outlive. Ten years from now,grepstill works on your corpus. For agents → · Frontmatter schema →
<p align="center"> <img src="docs/assets/demo.gif" alt="minutes demo — record, dictate, phone sync, AI recall" width="750"> </p>
cargo install minutes-cli # macOS/Linux cargo install minutes-cli --no-default-features # Windows (see install notes below)
[summarization] engine = "mistral" mistral_model = "mistral-large-latest"
export CXXFLAGS="-I$(xcrun --show-sdk-path)/usr/include/c++/v1" cargo install --path crates/cli ```
Step 1: Create a sync folder — pick one that syncs between your phone and desktop:
```bash
claude plugin marketplace add silverstein/minutes claude plugin install minutes
winget install LLVM.LLVM [Environment]::SetEnvironmentVariable("LIBCLANG_PATH", "C:\Program Files\LLVM\bin", "User")
cargo install --path crates/cli
cargo install --path crates/cli --features metal
```bash
brew install ffmpeg # macOS
minutes setup --parakeet # Multilingual v3 (tdt-600m, ~1.2GB) minutes setup --parakeet --parakeet-model tdt-ctc-110m # English-only compact model (~220MB)
export CXXFLAGS="-I$(xcrun --show-sdk-path)/usr/include/c++/v1" export MACOSX_DEPLOYMENT_TARGET=11.0 cargo tauri build --bundles app --features parakeet,metal
cargo install tauri-cli --version 2.10.1 --locked cd tauri/src-tauri cargo tauri build --ci --bundles nsis --no-sign ```
Tagged GitHub releases can include both a Windows NSIS installer as minutes-desktop-windows-x64-setup.exe and a raw desktop binary as minutes-desktop-windows-x64.exe. The installer is currently unsigned, so treat it as an advanced-user / preview distribution surface until Windows signing is added.
The desktop app adds a system tray icon, recording controls, audio visualizer, Recall, and a meeting list window. The current Windows desktop build covers recording, transcription, search, settings, and Recall. Calendar suggestions, call detection, tray copy/paste automation, and the native dictation hotkey remain macOS-only for now.
Release workflow details live in:
For macOS development, use a dedicated signed dev app identity:
/Applications/Minutes.app (com.useminutes.desktop)~/Applications/Minutes Dev.app (com.useminutes.desktop.dev)If you are testing hotkeys, Screen Recording, Input Monitoring, or repeated macOS permission prompts, launch only Minutes Dev.app via ./scripts/install-dev-app.sh. Avoid the repo symlink ./Minutes.app, raw target/ binaries, or ad-hoc local bundles for TCC-sensitive testing.
This repository is open source, so local development does not require the maintainer's Apple signing credentials:
- ./scripts/install-dev-app.sh works with ad-hoc signing by default - for more stable macOS permission behavior across rebuilds, set MINUTES_DEV_SIGNING_IDENTITY to a consistent local codesigning identity - release signing and notarization remain maintainer/release workflows
For dictation, the recommended path is the standard shortcut in the desktop app (Cmd/Ctrl + Shift + D by default). The raw-key path for keys like Caps Lock is available as an advanced option but remains more fragile and permission-heavy.
Privacy: All Minutes windows are hidden from screen sharing by default — other participants on Zoom/Meet/Teams won't see the app. Toggle via the tray menu: "Hide from Screen Share ✓".
# Prevents whisper hallucination loops on non-English/noisy audio.
[summarization] engine = "none" # Default: Claude summarizes conversationally via MCP # "auto" = auto-detect an installed agent CLI for pipeline summaries # "agent" = uses your Claude Code, Codex, OpenCode, or Pi subscription (no API key) # "ollama" = local, free # "openai-compatible" = OpenRouter, Vercel/Cloudflare gateways, llama.cpp, LM Studio, etc. # "claude" / "openai" = direct API key (legacy) agent_command = "claude" # Which CLI to use when engine = "agent" (claude, codex, opencode, pi, etc.) ollama_url = "http://localhost:11434" ollama_model = "llama3.2" openai_compatible_base_url = "http://localhost:11434/v1" openai_compatible_model = "llama3.2" openai_compatible_api_key_env = "" # Blank means no Authorization header for local endpoints. Desktop cloud endpoints can still use a saved Keychain key without rewriting config.
[diarization] engine = "auto" # "auto" (default — uses pyannote-rs if models downloaded, otherwise skips), # "pyannote-rs" (always on — native Rust, no Python), # "pyannote" (legacy — requires pip install pyannote.audio), # "none" (explicitly disabled)
Minutes is designed as infrastructure for AI agents. Files are the durable substrate; MCP is the active interface; live transcript JSONL and the local event log are the real-time paths. The MCP server is the primary integration surface today:
list_meetings, search_meetings, get_meeting return structured JSONget_person_profile builds cross-meeting profiles with topics, open commitmentsconsistency_report flags conflicting decisions and stale commitmentsstart_recording, stop_recording, process_audio for pipeline controlstart_live_transcript, read_live_transcript for real-time mid-meeting accessminutes events --follow --since-seq N tails newline-delimited events, including finalized live utterances, for agents that want a durable cursorlist_voices, confirm_speaker for speaker identification workflowsminutes://meetings/recent, minutes://actions/open) for agent context injectionAny agent framework that speaks MCP can use Minutes as its conversation memory layer — the agent handles the intelligence, Minutes handles the recall.
TypeScript SDK — for direct programmatic access without MCP:
npm install minutes-sdk
import { listMeetings, searchMeetings, parseFrontmatter } from "minutes-sdk";
const meetings = await listMeetings("~/meetings", 20);
const results = await searchMeetings("~/meetings", "pricing");
Built with: Rust, whisper.cpp (transcription), pyannote-rs (speaker diarization), Silero VAD (voice activity detection), symphonia (audio decoding), cpal (audio capture), Tauri v2 (desktop app), ureq (HTTP). Optional: ffmpeg (recommended for non-English audio decoding).
```bash
If your phone workflow also saves a .json file alongside the audio (same name, .json extension), Minutes reads it for enriched metadata:
{"device": "iPhone", "source": "voice-memos", "captured_at": "2026-03-24T08:41:00-07:00"}
This adds device and captured_at to the meeting's frontmatter. Works with any automation tool (Apple Shortcuts, Tasker, etc.).
Supports .m4a, .mp3, .wav, .ogg, .webm. Format conversion is automatic — uses ffmpeg when available (recommended for non-English audio), falls back to symphonia.
If a desktop call capture leaves a raw file under ~/.minutes/native-captures/, process that audio file directly with minutes process <path> --type meeting. For compatibility, minutes import <audio-file> also routes to the same meeting-processing path; minutes import granola remains the Granola history importer.
```toml
Maintain a living knowledge base from your conversations — person profiles, decision history, and a chronological log that compounds over time. Inspired by Karpathy's LLM Wiki pattern.
[knowledge]
enabled = true
path = "~/wiki" # or your Obsidian vault, PARA system, etc.
adapter = "wiki" # "wiki" (flat markdown), "para" (atomic facts), "obsidian" (wiki + [[links]])
engine = "none" # "none" = structured YAML only (safest), "agent" = LLM extraction
min_confidence = "strong"
After each meeting, structured facts (decisions, action items, commitments) flow into person profiles automatically. Every fact carries provenance back to its source meeting.
minutes ingest --dry-run --all # Preview what would be extracted
minutes ingest --all # Backfill existing meetings
minutes ingest ~/meetings/call.md # Process a single meeting
Three output formats: - Wiki — people/{slug}.md with facts grouped by category - PARA — areas/people/{slug}/items.json with atomic facts (id, status, supersededBy) - Obsidian — Wiki format with [[wikilinks]] for cross-references
Safety: default engine = "none" extracts only from parsed YAML frontmatter. No LLM call, zero hallucination risk. Confidence thresholds filter speculative facts. Corrupt data is backed up, never silently destroyed.
minutes setup --diarization
[identity] name = "Your Name"
Optional — minutes works out of the box.
```toml
[transcription] engine = "whisper" # "whisper" (default), "parakeet" (opt-in, lower WER), or "apple-speech" (experimental) model = "small" # whisper: tiny (75MB), base, small (466MB), medium, large-v3 (3.1GB)
brew tap silverstein/tap && brew install minutes
brew tap silverstein/tap brew install minutes
brew upgrade silverstein/tap/minutes
No phone app needed. Record a thought on your phone, and it becomes searchable memory on your desktop. Claude even surfaces recent memos proactively — "you had a voice memo about pricing yesterday."
The watcher is folder-agnostic — it processes any audio file that lands in a watched folder. Pick the sync method that matches your setup:
| Phone | Desktop | Sync method |
|---|---|---|
| **iPhone** | **Mac** | iCloud Drive (built-in, ~5-30s) |
| **iPhone** | **Windows/Linux** | iCloud for Windows, or Dropbox/Google Drive |
| **Android** | **Any** | Dropbox, Google Drive, Syncthing, or any folder sync |
| **Any** | **Any** | AirDrop, USB, email — drop the file in the watched folder |
minutes is a native extension for the Claude ecosystem. No API keys needed — Claude summarizes your meetings when you ask, using your existing Claude subscription.
You: "Summarize my last meeting"
Claude: [calls get_meeting] → reads transcript → summarizes in conversation
You: "What did Alex say about pricing?"
Claude: [calls search_meetings] → finds matches → synthesizes answer
You: "Any open action items for me?"
Claude: [calls list_meetings] → scans frontmatter → reports open items
Install the plugin from the marketplace: ```bash
No speech detected / blank audio: The most common cause is microphone permissions. Check System Settings → Privacy & Security → Microphone and ensure your terminal app (or Minutes.app) has access.
tmux users: tmux server runs as a separate process that doesn't inherit your terminal's mic permission. Either run minutes record from a direct terminal window (not inside tmux), or use the Minutes.app desktop bundle which gets its own mic permission.
Build fails with C++ errors on macOS 26+: whisper.cpp needs the SDK include path. Set CXXFLAGS as shown above before building.
Dictation hotkey still fails after you enabled it in System Settings: The native hotkey uses macOS Input Monitoring, which is separate from Screen Recording. The fastest way to test the exact installed desktop identity is:
./scripts/diagnose-desktop-hotkey.sh "$HOME/Applications/Minutes Dev.app"
Use ./scripts/install-dev-app.sh first so you are testing the stable development app identity rather than a raw target/ build. The helper intentionally launches the app through LaunchServices; direct shell execution of Contents/MacOS/minutes-app --diagnose-hotkey can misreport TCC status.
minutes 是一个开源的会话记忆项目,旨在捕捉人类的决策过程和对话。它提供了一个开放的 API,允许开发者创建自己的应用和集成。
minutes 的功能包括捕捉人类决策过程、对话和会议记录,提供自动化的会议总结功能,以及与 Claude 集成的能力,允许开发者创建自己的应用和集成。
minutes 可以在任何平台上运行,包括 Windows、macOS 和 Linux,需要 Rust 和 cmake 运行环境。它还支持使用 Mistral API 和 Claude 集成。
安装 minutes 可以通过使用 Docker、pip 或源码进行部署。具体步骤包括创建一个同步文件夹、安装 minutes-cli 和配置环境变量。
使用 minutes 可以通过 CLI 或 GUI 方式进行操作。CLI 方式包括创建会议记录、搜索会议记录和同步会议记录等功能。GUI 方式包括创建会议记录、查看会议记录和配置环境变量等功能。
minutes 支持配置文件和环境变量的使用。配置文件可以用于配置 minutes 的行为和环境变量可以用于配置 minutes 的环境。例如,配置文件可以用于配置 minutes 的自动化总结功能和环境变量可以用于配置 minutes 的 Claude 集成。
minutes 提供了一个 API,允许开发者创建自己的应用和集成。API 支持 RESTful 风格的接口和 JSON 数据格式。
minutes 的工作流包括手机端和桌面端的会议记录流程。手机端可以通过 voice memo 录制会议记录,桌面端可以通过 minutes-cli 或 GUI 方式操作会议记录。minutes 还支持 Claude 集成,允许开发者创建自己的应用和集成。
minutes 的常见问题包括没有检测到语音、空白音频和 tmux 用户的问题。解决这些问题的方法包括检查麦克风权限、检查系统设置和配置环境变量等。
优质开源MCP工具,Rust高性能实现,会议转知识库方案实用。社区活跃,持续维护,值得采用。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。
AI Skill Hub 点评:Minutes会议记录助手 的核心功能完整,质量优秀。对于Claude Desktop / Claude Code 用户来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。
| 原始名称 | minutes |
| 原始描述 | 开源MCP工具:Every meeting, every idea, every voice note — searchable by your AI. Open-source。⭐1.2k · Rust |
| Topics | 会议记录语音转文字知识库MCP协议AI搜索 |
| GitHub | https://github.com/silverstein/minutes |
| License | MIT |
| 语言 | Rust |
收录时间:2026-05-20 · 更新时间:2026-05-21 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端