跳到正文 Skip to content
// DETERMINISTIC DATA GOVERNANCE

DataVault

AI 可以幻觉。数据处理不行。 AI can hallucinate. Data handling must not.

给 AI Agent 一套确定性 SOP。扫描、分类、PII 检测、本地硬门禁、链上指纹登记,被我设计成一条不会因为模型情绪波动而偏移的数据治理回路。 A deterministic SOP for AI agents. Scan, classify, detect PII, apply local hard gates, and register fingerprints on-chain through a governance loop that does not drift with model mood.

pip install odv
datavault — scan
$ datavault scan ~/Documents --save $ datavault scan ~/Documents --save
// 铁律 IRON RULE

标记为 HIGHCRITICAL 的文件永远不会被注册上链。硬编码逻辑,不可覆盖。 Files flagged HIGH or CRITICAL are never registered on-chain. Hard-coded logic, not overridable.

SCAN SHA-256 指纹 SHA-256 fingerprint
CLASSIFY 59 种扩展名 · 9 类别 59 extensions · 9 categories
CHECK 正则 PII 检测 Regex PII detection
REGISTER 哈希上链 Hash on-chain

前三步本地执行 · 免费离线 · 注册需 --confirm First three steps local · free offline · register requires --confirm

> 深入阅读:四阶段管线:指纹、分类、隐私检测与链上注册 read further: Four-Stage Pipeline: Fingerprint, Classify, Privacy Check & On-Chain Register

GDPR

原始文件永远不离开本地。只有 SHA-256 哈希上链——满足数据最小化 (Art. 5) 和被遗忘权 (Art. 17) 要求。 Raw files never leave local. Only SHA-256 hashes go on-chain — satisfying data minimization (Art. 5) and right to erasure (Art. 17).

CCPA

PII 自动检测 + 硬门禁。信用卡号、API Key 永远不会通过 CHECK 阶段——消费者有权知道哪些个人数据被处理。 Auto PII detection + hard gates. Credit cards, API keys never pass the CHECK stage — consumers have the right to know which personal data is processed.

可审计性 Auditability

确定性流程——相同输入,相同输出。所有扫描结果可复现,所有决策可追溯。零 AI 幻觉参与数据治理。 Deterministic pipeline — same input, same output. All scan results reproducible, all decisions traceable. Zero AI hallucination in data governance.

v0.2 版本 Version
44 测试通过 Tests
6 PII 模式 PII Patterns
5 风险等级 Risk Levels