DMARC Report Reader - Architecture

Overview

DMARC Report Reader is a Manifest V3 Chrome/Edge browser extension that processes DMARC aggregate reports in XML format (plain, GZIP-compressed, or ZIP-archived) and presents them in a human-readable format with error diagnosis and recommendations.

Architecture Diagram

Components

Input Layer

Component	File	Description
Popup	`src/popup/popup.js`	Handles drag-drop and file picker input
Gmail Content Script	`src/content/gmail.js`	Detects DMARC attachments in Gmail, injects viewer button
Outlook Content Script	`src/content/outlook.js`	Detects DMARC attachments in Outlook Web, injects viewer button
Viewer Drop Zone	`src/viewer/viewer.js`	Accepts drag-drop directly in viewer

Processing Layer

Component	File	Description
Service Worker	`src/background/service-worker.js`	Central message hub, orchestrates processing
File Handler	`src/parser/file-handler.js`	Detects file format, extracts XML content
DMARC Parser	`src/parser/dmarc-parser.js`	Parses DMARC XML into structured JSON with statistics and alignment analysis
Classification	`src/parser/classification.js`	Heuristic analysis to distinguish spoofing vs misconfiguration
Provider Fingerprint	`src/services/provider-fingerprint.js`	Identifies email service providers from IP data (ASN, hostname, org)

External Libraries

Library	Purpose	Location
JSZip	ZIP file extraction	`lib/jszip.min.js`
pako	GZIP decompression	`lib/pako.min.js`

Services

Service	File	Description
IP Lookup	`src/services/ip-lookup.js`	Fetches geolocation and reverse DNS for source IPs
Provider Fingerprint	`src/services/provider-fingerprint.js`	Identifies ESPs (Google, Microsoft, SendGrid, etc.) from IP enrichment data

External API: ip-api.com (HTTPS, free tier, 45 requests/minute, batch endpoint for efficiency)

Session Caching: IP lookup results are cached in chrome.storage.session with 24-hour TTL and 5000 entry limit. Cache persists across viewer sessions within the same browser session.

On-Demand Enrichment: Reports with more than 50 unique IPs prompt the user to choose between enriching immediately or skipping (can enrich later).

Output Layer

Component	Files	Description
Report Viewer	`src/viewer/viewer.html`, `viewer.js`, `viewer.css`	Full-page report display with filtering, sorting, export

Data Flow

File Processing Pipeline

Input File (XML/ZIP/GZ)
         │
         ▼
   ┌─────────────┐
   │ File Handler│ ─── Detect format via magic bytes
   └─────────────┘
         │
    ┌────┴────┬────────┐
    ▼         ▼        ▼
  [XML]    [GZIP]    [ZIP]
    │         │        │
    │    ┌────┘   ┌────┘
    │    ▼        ▼
    │  pako    JSZip
    │    │        │
    │    └───┬────┘
    │        │
    ▼        ▼
   ┌──────────────┐
   │ DMARC Parser │ ─── XML → Structured JSON + Alignment Analysis
   └──────────────┘
         │
         ▼
   ┌─────────────┐
   │ IP Lookup   │ ─── Enrich with geolocation, hostname, ASN, org
   └─────────────┘
         │
         ▼
   ┌───────────────────┐
   │ Provider Fingerprint │ ─── Identify ESPs from enrichment data
   └───────────────────┘
         │
         ▼
   ┌────────────────┐
   │ Classification │ ─── Spoof vs misconfiguration heuristics
   └────────────────┘
         │
         ▼
   ┌─────────────┐
   │ Viewer      │ ─── Render, filter, diagnose, analyze, export
   └─────────────┘

Message Passing

The extension uses Chrome's messaging API for communication:

Content Script ──processAttachment──▶ Service Worker
       │                                    │
       │                                    ▼
       │                             File Extraction
       │                                    │
       │                                    ▼
       │                             chrome.storage.local
       │                                    │
       └──────────────────────────────────▶ Viewer Tab

Webmail Integration Flow

The extension injects buttons next to DMARC attachments and fetches them directly using the user's authenticated session.

Gmail - Email View Flow

Gmail Email View Load
         │
         ▼
   MutationObserver watches DOM
         │
         ▼
   Scan for elements with DMARC filenames (aria-label, data-tooltip)
         │
         ▼
   Inject blue chart button next to attachment
         │
         ▼
   On click: Find attachment download URL
         │
         ▼
   Fetch attachment data (with credentials)
         │
         ▼
   Send to Service Worker for extraction
         │
         ▼
   Open Viewer tab with parsed report

Gmail - Inbox View Flow

Gmail Inbox View
         │
         ▼
   Scan attachment chips for DMARC filenames
         │
         ▼
   Inject button next to attachment chip
         │
         ▼
   On click: Store pending filename in sessionStorage
         │
         ▼
   Navigate to email (click row)
         │
         ▼
   Email loads → checkPendingFile() finds matching button
         │
         ▼
   Auto-click button → Process attachment → Open Viewer

Outlook Web Flow (Experimental)

Warning: Outlook Web integration is experimental and not fully tested. The DOM structure of Outlook Web changes frequently, which may break the integration.

Similar flow to Gmail, with selectors adapted for Outlook's DOM structure.

Key Design Decisions:

Direct fetch approach: Gmail allows fetching attachments via authenticated URLs constructed from thread IDs. The extension finds these URLs by traversing the DOM.
Inbox navigation: When clicking from inbox, the extension navigates to the email first (Gmail doesn't expose download URLs in inbox view), then auto-processes.
Tab positioning: Viewer tabs open immediately to the right of the current tab for easy reference.
Duplicate prevention: Buttons are cleared and re-scanned on DOM changes to prevent duplicates.

Viewer Features

Summary Statistics

The parser calculates:

Total message count
DKIM/SPF pass/fail counts
Quarantine/reject counts
Pass rate percentages

Advanced Filtering

Filter	Description
Status	All / Pass / Fail / Quarantine / Reject
Domain	Substring match on From header domain
Source IP	Prefix match or CIDR notation (e.g., `192.168.1.0/24`)
Country	Dropdown populated from report data
Hostname	Substring match on reverse DNS
Provider	Dropdown of detected ESPs (Google, Microsoft, SendGrid, etc.)
Classification	Likely Spoof / Likely Misconfiguration / Unknown
Min Messages	Only show records with at least N messages

Sort	Description
Count (High-Low)	Most messages first
Count (Low-High)	Fewest messages first
IP Address	Alphabetical by IP

Top-N Analysis

The viewer calculates and displays:

Top Sending IPs: Highest volume senders with location info
Top Failing Domains: Domains with the most authentication failures
Top Countries: Geographic distribution of senders
Top Networks (ASN): ISPs and cloud providers by volume

Multi-Report ZIP Handling

When a ZIP file contains multiple DMARC reports:

File handler extracts all XML files
Viewer shows a modal for report selection
User can view individual reports or combine all
Combined view aggregates records with deduplicated analysis

Error Diagnosis

The viewer provides contextual diagnosis for:

Issue Type	Examples
DKIM Failures	Invalid signature, no signature, DNS errors
SPF Failures	Unauthorized IP, soft fail, no record, lookup limit
Alignment	Header/envelope From mismatch, domain not aligned
Disposition	Explains impact of quarantine/reject

Analytics Features

Enforcement Readiness Panel

Analyzes report data to recommend DMARC policy transitions:

Status	Alignment Rate	Recommendation
Safe	≥98%	Ready to move to stricter policy
Caution	90-98%	Review failing sources before proceeding
Not Ready	<90%	Fix configuration issues before enforcement

The panel is policy-aware, suggesting appropriate next steps based on current policy (none → quarantine → reject).

Classification Engine

Heuristic analysis distinguishes between:

Classification	Signals
Likely Spoof	Both auth fail, high volume, unknown sender, no legitimate ESP
Likely Misconfiguration	Known ESP, partial auth (DKIM or SPF pass), single message, aligned domain
Unknown	Insufficient signals for classification

Robustness signals indicate confidence level based on number of matching heuristics.

Provider Fingerprinting

Identifies email service providers from IP enrichment data:

Match Type	Source
ASN Match	AS number patterns (e.g., AS15169 → Google)
Hostname Match	Reverse DNS patterns (e.g., *.google.com)
Org Match	Organization name patterns (e.g., "Google LLC")

Supported providers: Google, Microsoft, Amazon SES, SendGrid, Mailchimp, Mailgun, Postmark, SparkPost, Salesforce, Zendesk, Freshdesk, Mimecast, Proofpoint, Barracuda, and more.

Disposition Override Explanation

When receivers override DMARC policy, explains the reason:

Override Type	Explanation
`forwarded`	Mail was forwarded (SPF breaks on forwarding)
`mailing_list`	Mailing list modified the message
`local_policy`	Receiver applied local policy override
`sampled_out`	Receiver sampled (pct < 100)
`trusted_forwarder`	Known trusted forwarder
`other`	Other receiver-specific reason

Export Formats

Format	Contents
JSON	Full structured report with all fields (respects active filters)
CSV	Flat table with key fields for spreadsheet analysis (respects active filters)
Raw XML	View and copy original XML source with syntax highlighting

DMARC Report Structure

Input XML Schema

<feedback>
  <report_metadata>
    <org_name>google.com</org_name>
    <email>noreply-dmarc-support@google.com</email>
    <report_id>...</report_id>
    <date_range>
      <begin>1234567890</begin>
      <end>1234567890</end>
    </date_range>
  </report_metadata>
  <policy_published>
    <domain>example.com</domain>
    <adkim>r</adkim>
    <aspf>r</aspf>
    <p>quarantine</p>
    <sp>quarantine</sp>
    <pct>100</pct>
  </policy_published>
  <record>
    <row>
      <source_ip>192.0.2.1</source_ip>
      <count>10</count>
      <policy_evaluated>
        <disposition>none</disposition>
        <dkim>pass</dkim>
        <spf>pass</spf>
        <reason>...</reason>
      </policy_evaluated>
    </row>
    <identifiers>
      <header_from>example.com</header_from>
      <envelope_from>bounce.example.com</envelope_from>
    </identifiers>
    <auth_results>
      <dkim>
        <domain>example.com</domain>
        <selector>selector1</selector>
        <result>pass</result>
      </dkim>
      <spf>
        <domain>example.com</domain>
        <result>pass</result>
      </spf>
    </auth_results>
  </record>
</feedback>

Parsed JSON Structure

{
  metadata: {
    orgName: "google.com",
    email: "...",
    reportId: "...",
    dateRange: { begin: Date, end: Date }
  },
  policy: {
    domain: "example.com",
    adkim: "relaxed",    // 'r' or 's' (relaxed/strict)
    aspf: "relaxed",     // 'r' or 's' (relaxed/strict)
    policy: "quarantine",
    subdomainPolicy: "quarantine",
    percentage: 100
  },
  records: [
    {
      sourceIp: "192.0.2.1",
      count: 10,
      policyEvaluated: {
        disposition: "none",
        dkim: "pass",
        spf: "pass",
        reasons: [{ type: "forwarded", comment: "..." }]  // Disposition override
      },
      identifiers: {
        headerFrom: "example.com",
        envelopeFrom: "bounce.example.com",
        envelopeTo: "gmail.com"
      },
      authResults: {
        dkim: [{ domain, selector, result }],
        spf: [{ domain, scope, result }]
      },
      // Alignment analysis (computed by parser)
      alignment: {
        dkimAligned: true,         // DKIM domain aligns with header_from
        spfAligned: true,          // SPF domain aligns with header_from
        dkimPassed: true,          // At least one DKIM auth passed
        spfPassed: true,           // SPF auth passed
        overallAligned: true,      // DMARC alignment requirement met
        headerEnvelopeMismatch: false,
        primaryFailureReason: null // "no_dkim_aligned", "spf_not_aligned", etc.
      },
      // Classification (computed post-enrichment)
      classification: {
        label: "Likely Misconfiguration",  // or "Likely Spoof", "Unknown"
        signals: ["known_esp", "partial_auth", "aligned_domain"],
        robustness: "high"         // "high", "medium", "low"
      },
      // Provider info (computed from IP enrichment)
      provider: {
        name: "Google",
        matchType: "asn"           // "asn", "hostname", "org"
      }
    }
  ],
  summary: {
    totalMessages: 10,
    passedDkim: 10,
    failedDkim: 0,
    passedSpf: 10,
    failedSpf: 0,
    passedBoth: 10,
    failedBoth: 0,
    quarantined: 0,
    rejected: 0,
    dkimPassRate: 100,
    spfPassRate: 100,
    overallPassRate: 100
  }
}

Security Considerations

Content Security Policy: Manifest V3 enforces strict CSP
Local Processing: All file parsing happens client-side
Minimal Permissions: Only requests necessary host permissions
External API: Only IP addresses are sent to ip-api.com over HTTPS (no email content)
No Data Storage: Reports are processed in memory only, not persisted
Sandboxed Context: Content scripts run in isolated worlds
XSS Prevention: All user-controlled data (domains, IPs, hostnames) is escaped before HTML rendering
Message Validation: Service worker validates sender origin and message payload structure
Input Validation: File data is validated (size limits, byte value checks) before processing
Service Worker Retry: Content scripts handle MV3 service worker lifecycle with retry logic and visual feedback (connecting/processing/success/error states)
Service Worker Keep-Alive: Uses chrome.alarms API with 4-minute interval to prevent idle termination, plus visibility-change health checks in content scripts

Browser Compatibility

Chrome 88+ (Manifest V3 support)
Edge 88+ (Chromium-based)
Firefox: Not supported (different extension API)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DMARC Report Reader - Architecture

Overview

Architecture Diagram

Components

Input Layer

Processing Layer

External Libraries

Services

Output Layer

Data Flow

File Processing Pipeline

Message Passing

Webmail Integration Flow

Gmail - Email View Flow

Gmail - Inbox View Flow

Outlook Web Flow (Experimental)

Viewer Features

Summary Statistics

Advanced Filtering

Top-N Analysis

Multi-Report ZIP Handling

Error Diagnosis

Analytics Features

Enforcement Readiness Panel

Classification Engine

Provider Fingerprinting

Disposition Override Explanation

Export Formats

DMARC Report Structure

Input XML Schema

Parsed JSON Structure

Security Considerations

Browser Compatibility

FilesExpand file tree

ARCHITECTURE.md

Latest commit

History

ARCHITECTURE.md

File metadata and controls

DMARC Report Reader - Architecture

Overview

Architecture Diagram

Components

Input Layer

Processing Layer

External Libraries

Services

Output Layer

Data Flow

File Processing Pipeline

Message Passing

Webmail Integration Flow

Gmail - Email View Flow

Gmail - Inbox View Flow

Outlook Web Flow (Experimental)

Viewer Features

Summary Statistics

Advanced Filtering

Top-N Analysis

Multi-Report ZIP Handling

Error Diagnosis

Analytics Features

Enforcement Readiness Panel

Classification Engine

Provider Fingerprinting

Disposition Override Explanation

Export Formats

DMARC Report Structure

Input XML Schema

Parsed JSON Structure

Security Considerations

Browser Compatibility