Skip to content

Overview


System Architecture Diagram

This diagram illustrates the flow of data from an input email archive to the final frontend application.

graph TD
    subgraph "Input"
        A[PST / MBOX / EML File]
    end

    subgraph "Backend Processing (Python)"
        B[Python Extractor]
        C{Knowledge Graph}
        D[PII Redaction]
        E[Topic Clustering]

        B --> D
        D --> E
        E --> C
    end

    subgraph "Data Storage (Cloudflare)"
        F[R2 Storage for raw files]
        G[D1 Database for metadata]
    end

    subgraph "API Layer (Cloudflare)"
        H[API Worker]
    end

    subgraph "Frontend (Cloudflare)"
        I[Next.js Application]
    end

    A -- Upload --> B
    C -- Stores --> G
    B -- Stores original file --> F
    G -- Serves data to --> H
    H -- Provides REST API for --> I

    style A fill:#262626,stroke:#6366f1,stroke-width:2px
    style I fill:#262626,stroke:#22d3ee,stroke-width:2px
PST File → Python Extractor → [Contacts, Messages, Topics] → R2 Storage → D1 Database → API Worker → Next.js Frontend