经 AI Skill Hub 精选评估,脑电信号RAG检索系统 获评「推荐使用」。这款AI工具在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 6.5 分,适合有一定技术背景的用户使用。
专门为脑电(EEG)信号处理设计的检索增强生成系统。整合HuggingFace、NumPy、Pandas等框架,支持脑电数据的智能检索和生成。适合神经科学研究、脑机接口开发和医学信号处理领域的开发者。
脑电信号RAG检索系统 是一款基于 Python 开发的开源工具,专注于 RAG系统、脑电信号、神经科学 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
专门为脑电(EEG)信号处理设计的检索增强生成系统。整合HuggingFace、NumPy、Pandas等框架,支持脑电数据的智能检索和生成。适合神经科学研究、脑机接口开发和医学信号处理领域的开发者。
脑电信号RAG检索系统 是一款基于 Python 开发的开源工具,专注于 RAG系统、脑电信号、神经科学 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
# 方式一:pip 安装(推荐)
pip install eeg-rag
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install eeg-rag
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/hkevin01/eeg-rag
cd eeg-rag
pip install -e .
# 验证安装
python -c "import eeg_rag; print('安装成功')"
# 命令行使用
eeg-rag --help
# 基本用法
eeg-rag input_file -o output_file
# Python 代码中调用
import eeg_rag
# 示例
result = eeg_rag.process("input")
print(result)
# eeg-rag 配置文件示例(config.yml) app: name: "eeg-rag" debug: false log_level: "INFO" # 运行时指定配置文件 eeg-rag --config config.yml # 或通过环境变量配置 export EEG_RAG_API_KEY="your-key" export EEG_RAG_OUTPUT_DIR="./output"
<a id="top"></a>
Production-grade Retrieval-Augmented Generation for EEG research literature — multi-agent, medically cited, instantly queryable.
</div>
---
[!IMPORTANT] Research/Clinical Disclaimer: EEG-RAG is designed for research and educational purposes. All retrieved citations must be independently verified before clinical decision-making. This system is not a substitute for professional medical advice.
[!TIP] Get started in 5 minutes: pip install -e . && uvicorn eeg_rag.api.main:app --reload then visit http://localhost:8080/docs
---
EEG-RAG is an enterprise-ready, multi-agent RAG system built specifically for electroencephalography (EEG) research and clinical applications. It processes scientific literature from PubMed (35M+ papers), Semantic Scholar, arXiv, OpenAlex, ClinicalTrials.gov, and Europe PMC, then answers natural-language queries with verified, PMID-cited responses in under 2 seconds.
The problem it solves: EEG researchers spend 40-60% of their time searching literature. PubMed holds 150,000+ EEG papers, but there is no unified way to query that knowledge semantically, verify citations, or synthesize findings across studies.
Who it is for: Clinical EEG researchers, epileptologists, BCI engineers, cognitive neuroscientists, ML engineers working on neural data, and graduate students entering the field.
| <sub>Icon</sub> | <sub>Feature</sub> | <sub>Description</sub> | <sub>Impact</sub> | <sub>Status</sub> |
|---|---|---|---|---|
| <sub>🤖</sub> | <sub>**Multi-Agent System**</sub> | <sub>12 specialized AI agents work in parallel — see full agent table below</sub> | <sub>High</sub> | <sub>✅ Stable</sub> |
| <sub>🔍</sub> | <sub>**Hybrid Retrieval**</sub> | <sub>BM25 + Dense vectors + SPLADE learned sparse + Cross-encoder reranking with RRF fusion</sub> | <sub>High</sub> | <sub>✅ Stable</sub> |
| <sub>📡</sub> | <sub>**FastAPI Web Service**</sub> | <sub>REST API with 10 endpoints + Server-Sent Events (SSE) for real-time streaming progress</sub> | <sub>High</sub> | <sub>✅ Stable</sub> |
| <sub>✅</sub> | <sub>**Citation Verification**</sub> | <sub>Medical-grade PMID validation, hallucination detection, retraction checking</sub> | <sub>Critical</sub> | <sub>✅ Stable</sub> |
| <sub>🧠</sub> | <sub>**PubMedBERT Embeddings**</sub> | <sub>768-dim domain embeddings pre-trained on 14M PubMed abstracts; selectable via model_preset</sub> | <sub>High</sub> | <sub>✅ Stable</sub> |
| <sub>📥</sub> | <sub>**Multi-Source Ingestion**</sub> | <sub>PubMed, Semantic Scholar, arXiv, OpenAlex, ClinicalTrials.gov, Europe PMC (120K+ papers)</sub> | <sub>High</sub> | <sub>✅ Stable</sub> |
| <sub>🏥</sub> | <sub>**ClinicalTrials.gov**</sub> | <sub>EEG clinical trial data (epilepsy, BCI, neurofeedback, sleep) via REST v2 API with EEG relevance scoring</sub> | <sub>High</sub> | <sub>✅ New</sub> |
| <sub>🌍</sub> | <sub>**Europe PMC**</sub> | <sub>Open-access EEG literature via cursor-based pagination with full-text XML retrieval</sub> | <sub>High</sub> | <sub>✅ New</sub> |
| <sub>🔬</sub> | <sub>**ResearchAgent**</sub> | <sub>Parallel multi-source coordinator — PubMed + Semantic Scholar + Local in one call with dedup & evidence ranking</sub> | <sub>High</sub> | <sub>✅ New</sub> |
| <sub>🗂️</sub> | <sub>**SystematicReviewAgent**</sub> | <sub>Fully automated PRISMA-compliant systematic reviews: dedup → screen → grade → themes → gaps</sub> | <sub>High</sub> | <sub>✅ New</sub> |
| <sub>🩺</sub> | <sub>**ClinicalMatchingAgent**</sub> | <sub>Maps EEG patterns to clinical diagnoses using ACNS terminology, ICD-10 codes, evidence PMIDs and drug effect lookup</sub> | <sub>High</sub> | <sub>✅ New</sub> |
| <sub>📋</sub> | <sub>**CitationAgent**</sub> | <sub>Batch citation validation: impact scoring, retraction detection, ORCID linking, cross-reference checking, open-access status</sub> | <sub>Critical</sub> | <sub>✅ Stable</sub> |
| <sub>📊</sub> | <sub>**Bibliometrics Dashboard**</sub> | <sub>pyBiblioNet integration: trends, citation networks, KeyBERT NLP, Scopus export</sub> | <sub>Medium</sub> | <sub>✅ Stable</sub> |
| <sub>🔬</sub> | <sub>**NER System**</sub> | <sub>EEG Named Entity Recognition: 400+ terms across 12 categories (electrodes, bands, ERPs, conditions)</sub> | <sub>Medium</sub> | <sub>✅ Stable</sub> |
| <sub>🗂️</sub> | <sub>**Systematic Review (YAML)**</sub> | <sub>YAML-schema extraction, reproducibility scoring, temporal comparison vs Roy et al. 2019</sub> | <sub>Medium</sub> | <sub>✅ Stable</sub> |
| <sub>🏢</sub> | <sub>**Enterprise Security**</sub> | <sub>SVG/PDF malware scanning, prompt injection detection, SHA-256 audit trail, OpenTimestamps</sub> | <sub>Medium</sub> | <sub>🔄 Beta</sub> |
| <sub>🗄️</sub> | <sub>**Knowledge Graph**</sub> | <sub>Neo4j with Cypher queries: multi-hop reasoning across entities (PAPER, BIOMARKER, CONDITION, OUTCOME)</sub> | <sub>Medium</sub> | <sub>🔄 Beta</sub> |
| <sub>🚀</sub> | <sub>**Adaptive Query Routing**</sub> | <sub>Intelligent routing to optimal agents based on query complexity, 30% latency reduction</sub> | <sub>Medium</sub> | <sub>🟡 Planned</sub> |
<details> <summary>📋 All 330+ Requirements Covered — Click to Expand</summary>
</details>
<p align="right">(<a href="#top">back to top ↑</a>)</p>
---
<details> <summary>🛡️ Security, Compliance & IP Protection — Click to Expand</summary>
RESEARCHER_EMAIL=your.email@university.edu
python -m pytest tests/ -m integration -v
CHANGELOG.md entryVia pip (recommended): ```bash
pip install eeg-rag
pip install -r requirements-dev.txt
python scripts/run_ingestion.py --sources pubmed arxiv
```bash
cp .env.example .env
Edit `.env`:bash
NCBI_API_KEY=your_ncbi_key_here # https://www.ncbi.nlm.nih.gov/account/settings/ S2_API_KEY=your_s2_key_here # https://www.semanticscholar.org/product/api
OPENAI_API_KEY=sk-your-key-here OPENAI_MODEL=gpt-3.5-turbo
NEO4J_URI=bolt://localhost:7687 NEO4J_USER=neo4j NEO4J_PASSWORD=your_password REDIS_HOST=localhost REDIS_PORT=6379 ```
pip install "eeg-rag[api]"
```bash
| <sub>Endpoint</sub> | <sub>Method</sub> | <sub>Description</sub> |
|---|---|---|
<sub>/health</sub> | <sub>GET</sub> | <sub>Health check with per-agent status</sub> |
<sub>/metrics</sub> | <sub>GET</sub> | <sub>Performance metrics (latency, cache rate)</sub> |
<sub>/search</sub> | <sub>POST</sub> | <sub>Standard search with AI synthesis</sub> |
<sub>/search/stream</sub> | <sub>POST</sub> | <sub>**SSE streaming** — real-time progress</sub> |
<sub>/paper/details</sub> | <sub>POST</sub> | <sub>Fetch full paper metadata</sub> |
<sub>/paper/citations</sub> | <sub>POST</sub> | <sub>Citation network analysis</sub> |
<sub>/suggest</sub> | <sub>GET</sub> | <sub>Query autocomplete</sub> |
<sub>/query-types</sub> | <sub>GET</sub> | <sub>Available query categories</sub> |
<sub>/docs</sub> | <sub>GET</sub> | <sub>Swagger UI</sub> |
<sub>/redoc</sub> | <sub>GET</sub> | <sub>ReDoc documentation</sub> |
[!NOTE] Interactive docs available at http://localhost:8080/docs once the server is running. No API key required for retrieval-only queries.
<details> <summary>📡 Full curl Examples — Click to Expand</summary>
Standard search:
curl -X POST "http://localhost:8080/search" \
-H "Content-Type: application/json" \
-d '{"query": "deep learning EEG seizure detection", "max_results": 10, "synthesize": true}'
Streaming search (SSE):
curl -N -X POST "http://localhost:8080/search/stream" \
-H "Content-Type: application/json" \
-d '{"query": "P300 amplitude in Alzheimer disease", "max_results": 5}'
Paper details:
curl -X POST "http://localhost:8080/paper/details" \
-H "Content-Type: application/json" \
-d '{"pmid": "28215566"}'
Health check:
curl http://localhost:8080/health
</details>
<p align="right">(<a href="#top">back to top ↑</a>)</p>
---
```python import requests
| <sub>Component</sub> | <sub>Latency</sub> | <sub>Paradigm</sub> | <sub>Clinical Use</sub> |
|---|---|---|---|
| <sub>**P300**</sub> | <sub>~300ms</sub> | <sub>Oddball (target detection)</sub> | <sub>Working memory, BCI spellers</sub> |
| <sub>**N400**</sub> | <sub>~400ms</sub> | <sub>Semantic violation</sub> | <sub>Language disorders</sub> |
| <sub>**N170**</sub> | <sub>~170ms</sub> | <sub>Face stimulus</sub> | <sub>Face processing research</sub> |
| <sub>**MMN**</sub> | <sub>150–250ms</sub> | <sub>Deviant auditory stimulus</sub> | <sub>Pre-attentive processing, schizophrenia</sub> |
| <sub>**ERN**</sub> | <sub>50–100ms</sub> | <sub>Error response</sub> | <sub>Error monitoring, OCD</sub> |
The RRF score formula:
$$\text{RRF}(d) = \sum_{r \in R} \frac{1}{k + r(d)}$$
where $k=60$ (default), $r(d)$ is the rank of document $d$ in ranker $r$, and $R$ is the set of retrieval methods. This provably outperforms linear score combination (Cormack et al., 2009).
| <sub>Method</sub> | <sub>Latency</sub> | <sub>Recall@10</sub> | <sub>When to Use</sub> |
|---|---|---|---|
| <sub>BM25 baseline</sub> | <sub>~20ms</sub> | <sub>78%</sub> | <sub>Fast, exact-term queries</sub> |
| <sub>SPLADE learned sparse</sub> | <sub>~40ms</sub> | <sub>88%</sub> | <sub>Better quality needed</sub> |
| <sub>Dense (PubMedBERT)</sub> | <sub>~30ms</sub> | <sub>82%</sub> | <sub>Semantic / conceptual queries</sub> |
| <sub>Hybrid BM25 + Dense (RRF)</sub> | <sub>~60ms</sub> | <sub>91%</sub> | <sub>Best general baseline</sub> |
| <sub>Hybrid + Reranking</sub> | <sub>~160ms</sub> | <sub>95%</sub> | <sub>High-precision tasks</sub> |
| <sub>Model</sub> | <sub>PubMed NER F1</sub> | <sub>EEG Term Recall</sub> |
|---|---|---|
| <sub>BERT-base</sub> | <sub>0.78</sub> | <sub>72%</sub> |
| <sub>BioBERT</sub> | <sub>0.84</sub> | <sub>81%</sub> |
| <sub>**PubMedBERT**</sub> | <sub>**0.87**</sub> | <sub>**89%**</sub> |
| <sub>SciBERT</sub> | <sub>0.82</sub> | <sub>75%</sub> |
</details>
<p align="right">(<a href="#top">back to top ↑</a>)</p>
---
EEG-RAG 是一个面向 EEG 研究和临床应用的多代理 RAG 系统,处理科学文献并在 2 秒内回答自然语言查询。
EEG-RAG 的关键功能包括多代理系统、混合检索、BM25 + Dense 等特性,解决了检索和生成的复杂问题。
EEG-RAG 需要研究者身份认证、NCBI 认证和 S2 API 密钥等环境依赖和系统要求。
可以通过 pip 安装 EEG-RAG,或者使用 Docker 或源码部署方式。
使用 EEG-RAG 可以通过命令行或 API 接口进行操作,例如快速启动和 API 服务器启动。
EEG-RAG 的配置包括环境变量、MCP 和关键参数等,需要通过 .env 文件进行设置。
EEG-RAG 提供 REST API 服务器,支持健康检查、性能指标和 API 端点等功能。
EEG-RAG 的工作流包括多阶段管道,包括用户查询、查询扩展、初级检索、RRF融合和交叉编码等步骤。
垂直领域RAG应用,针对性强。但项目热度低、文档可能不完善,适合学术研究而非大规模生产应用。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。
AI Skill Hub 点评:脑电信号RAG检索系统 的核心功能完整,质量良好。对于AI 技术爱好者来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。
| 原始名称 | eeg-rag |
| 原始描述 | 开源AI工具:EEG-RAG is a Retrieval-Augmented Generation (RAG) system specifically designed f。⭐6 · Python |
| Topics | RAG系统脑电信号神经科学检索增强生成生物信号处理 |
| GitHub | https://github.com/hkevin01/eeg-rag |
| License | MIT |
| 语言 | Python |
收录时间:2026-05-20 · 更新时间:2026-05-24 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。