Alejandro Rojas
Operator Guide · Chapter 10

Document filing rules

The harness assumes you'll file documents (PDFs, screenshots, emails) into the vault throughout the year. This chapter is the discipline for doing that without creating compliance or security problems.

The PII rule

The harness has one non-negotiable rule:

The PII Rule
Never write personally identifiable information into a generated .md file.

PII includes:

Where these appear in source documents (PDFs of bank statements, tax returns, payroll stubs), the source PDF stays in the vault as-is. But any .md file you generate from it replaces these with [on file].

Why

.md files get read by AI tools, may sync to backups or cloud services, may end up in search indices. Even if your vault is encrypted, the .md files are easier to leak than PDFs.

Treat .md as public-adjacent. Treat source PDFs as inert.

This rule has saved me from at least three potential leaks in the last 18 months.

The PII scan hook

The vault template ships with a pre-commit-style hook at _System/hooks/pii-scan.py. Before any AI tool writes a .md file, the hook scans the content for PII patterns:

If the hook detects PII, it blocks the write and tells the user to replace with [on file].

Install the hook before your first session.

The .md companion pattern

Every PDF you file in the vault gets a .md companion with the same basename:

2026-04-15_bank-statement-april.pdf # source — PII intact 2026-04-15_bank-statement-april.md # companion — PII redacted

The companion contains parsed structured data:

# Bank Statement — Acme Bank, Account [on file] **Statement period**: 2026-04-01 to 2026-04-30 **Beginning balance**: $42,318.55 **Ending balance**: $38,492.12 **Total deposits**: $24,150.00 **Total withdrawals**: $27,976.43 ## Transactions | Date | Description | Amount | |------------|-----------------------------------|------------| | 2026-04-02 | Direct deposit, payroll | +$3,500.00 | | 2026-04-03 | Acme Vendor invoice 2026-001 | -$1,247.50 | | 2026-04-05 | Transfer to savings | -$1,000.00 | | ... ## Notable - Two transactions over $5,000 on 04-08 and 04-22 (verify against expected outflows) - One ATM withdrawal in unfamiliar location (verify with cardholder)

The companion answers the questions you'd ask without re-opening the PDF: "what was the ending balance?", "what was that big transaction?"

The companion does NOT include the account number, routing number, or full statement detail. Those stay in the PDF.

Naming conventions

For files inside numbered categories:

Temporal documents (statements, invoices, contracts): YYYY-MM-DD_short-description.ext

Examples:

Topic documents (analyses, SOPs, frameworks): topic-name.md

Examples:

Lowercase, kebab-case, date-prefixed when relevant.

Filing by category — examples

A vendor invoice arrives

  1. Save the PDF: 04-resources/vendors/acme-vendor/invoices/2026-05-20_invoice-INV-2026-001.pdf
  2. Create a companion: 2026-05-20_invoice-INV-2026-001.md with:
    • Invoice date, due date, amount
    • Line items
    • Payment method (account references redacted)
    • Notes
  3. When you pay the invoice, move the PDF to 02-finance/invoices/paid/YYYY/ and update the companion to add the payment date.

A new contract gets signed

  1. Save the PDF: 05-legal/contracts/2026-05-20_acme-msa.pdf
  2. Create a companion with:
    • Parties
    • Effective date
    • Term and renewal terms
    • Key provisions
    • Termination notice period
    • Auto-renewal flag

If the contract is vendor-specific, you can also drop a pointer in 04-resources/vendors/acme/contracts/:

This contract is filed at: /05-legal/contracts/2026-05-20_acme-msa.pdf

A tax document

  1. Save the PDF: 02-finance/tax/2025/2026-04-15_2025-form-1040.pdf
  2. Create a companion: 2026-04-15_2025-form-1040.md with parsed data — but SSNs replaced with [on file], bank account numbers replaced with [on file].
  3. The companion should be enough to answer "what did I owe?" or "what was my AGI?" without re-opening the PDF.

A screenshot

Screenshots come from your daily work and often don't have an obvious category. Default: drop in today's daily-log directory with a descriptive name. File properly later if it ends up mattering:

12-daily-log/2026/05-May/2026-05-20_screenshot_dashboard-error.png

If three days later you decide the screenshot was important, move it to the right category. If not, it stays in daily-log forever and you can delete it during quarterly cleanup.

What about email?

Emails are the most common source of inbound documents. Two approaches:

Approach A: Save the email PDF/raw

Forward important emails to a personal address, or use your email client's "Save as PDF" function. File the PDF in the right category with a .md companion.

Approach B: Just save the parsed companion

If the email isn't worth preserving in original form, write only the companion: a .md with the date, sender, subject, key content. File in the right category.

For most operational emails, Approach B is sufficient. Reserve Approach A for emails you might need as legal evidence (contracts, disputes, complaints).

When to file vs. when to skip

Not everything deserves filing. A guide:

Always file
  • Tax documents (statutory retention requirements)
  • Contracts and legal agreements
  • Insurance policies
  • Bank / credit card statements (3 years minimum)
  • Employee records (federal retention rules vary)
Probably file
  • Vendor invoices (paid and unpaid)
  • Customer payments (refunds, chargebacks)
  • Major correspondence (anything you might cite later)
Skip filing
  • Marketing emails
  • Routine confirmations ("order placed," "package shipped")
  • One-time meeting notes (unless decisions were made — those go in daily log)
  • Drafts you never sent

When in doubt, drop it in today's daily-log directory and decide later.

Quarterly cleanup

Every quarter, spend an hour cleaning up:

Without quarterly cleanup, the vault accumulates clutter that makes it slower to navigate. With cleanup, the vault stays sharp.

Retention rules

Different documents have different retention requirements:

When you don't know the rule, default to 7 years.

The harness makes retention easy: organize by year (02-finance/tax/2025/), and at year-end you can archive entire year folders to cold storage.

What NOT to put in the vault

Some things should not live in your vault at all:

The vault is durable, backed up, and AI-readable. That's the wrong place for secrets.

Next chapter: anti-patterns I learned by stepping in them.