billing-data-aggregator

Modern Go-based batch service that aggregates billing data from the Octopus data warehouse and exports results to S3. This is the clean rewrite of ed4-bi-batch-boil with improved architecture, better error handling, and enhanced observability.

Overview

Purpose: Daily aggregation of financial billing data for downstream BI/analytics processes.

Tech Stack:

Language: Go 1.24+
Database: PostgreSQL (Octopus DWH)
Storage: AWS S3
Runtime: AWS Batch (Docker containers)
Deployment: Terraform + FluxCD
CI/CD: GitHub Actions

Data Sources:

Tripica: Billing charges, account balances, dunning processes, write-offs
Bookkeeper: SAP bookings, journal entries, tax calculations

Quick Start

Local Development

# Clone and setup
git clone https://github.com/enercity/billing-data-aggregator.git
cd billing-data-aggregator
go mod download

# Run tests
go test ./...

# Build
go build -o dist/billing-data-aggregator ./cmd/aggregator

# Configure environment
export BDA_CLIENT_ID=enercity
export BDA_ENVIRONMENT=dev
export BDA_DB_HOST=localhost
export BDA_DB_PASSWORD=your-secret-password
export BDA_S3_BUCKET=billing-exports-dev

# Run
./dist/billing-data-aggregator

Docker

docker build -t billing-data-aggregator:local .

docker run --rm \
  -e BDA_CLIENT_ID=enercity \
  -e BDA_ENVIRONMENT=dev \
  -e BDA_DB_HOST=host.docker.internal \
  -e BDA_DB_PASSWORD=secret \
  -e BDA_S3_BUCKET=billing-exports-dev \
  billing-data-aggregator:local

Architecture

High-Level Data Flow

┌──────────────────────────────────────────────────────────────────┐
│                    Octopus Data Warehouse (PostgreSQL)           │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐               │
│  │ Tripica     │  │ Bookkeeper  │  │ Dunning     │               │
│  │ Vault       │  │ Vault       │  │ Vault       │               │
│  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘               │
│         │                 │                 │                    │
│         └─────────────────┴─────────────────┘                    │
│                           ↓                                      │
│              ┌────────────────────────────┐                      │
│              │  billing-data-aggregator   │                      │
│              │  (Go Application)          │                      │
│              │                            │                      │
│              │  1. Init Scripts (setup)   │                      │
│              │  2. Processors (transform) │                      │
│              │  3. Export (CSV)           │                      │
│              │  4. Archive Scripts        │                      │
│              └────────────┬───────────────┘                      │
│                           ↓                                      │
│              ┌────────────────────────────┐                      │
│              │  report_oibl Schema        │                      │
│              │  - tripica_results         │                      │
│              │  - bookkeeper_results      │                      │
│              └────────────────────────────┘                      │
└──────────────────────────────────────────────────────────────────┘
                            ↓
                 ┌──────────────────────┐
                 │   AWS S3 Bucket      │
                 │   CSV Files          │
                 │   client/environment/│
                 └──────────────────────┘

Component Architecture

cmd/aggregator/main.go
    ↓
internal/config/          → Environment variable configuration
    ↓
internal/database/        → Connection pooling, script execution
    ↓
internal/processors/      → Business logic orchestration
    ├── tripica.go       → Tripica data processing
    └── bookkeeper.go    → Bookkeeper data processing
    ↓
internal/export/          → Result export
    ├── csv.go           → CSV file generation (chunked)
    └── s3.go            → S3 upload with retry logic
    ↓
scripts/                  → SQL transformation scripts
    ├── init/            → Data preparation (system-specific)
    └── archive/         → Export queries (final results)

Configuration

All configuration via environment variables with BDA_ prefix:

Core Settings

# Required
BDA_CLIENT_ID=enercity              # Client identifier
BDA_ENVIRONMENT=prod                # Environment (dev/stage/prod)
BDA_DB_HOST=octopus.db.example.com  # PostgreSQL host
BDA_DB_PASSWORD=xxxxx               # Database password
BDA_S3_BUCKET=billing-exports       # S3 destination bucket

Database Settings

BDA_DB_PORT=5432                    # Default: 5432
BDA_DB_NAME=octopus                 # Default: octopus
BDA_DB_USER=billing_aggregator      # Default: billing_aggregator
BDA_DB_MAX_CONNS=4                  # Default: 4
BDA_DB_MAX_IDLE=0                   # Default: 0 (unlimited)
BDA_DB_MINUTES_IDLE=5               # Default: 5

Processing Settings

BDA_SYSTEMS=tripica,bookkeeper      # Default: tripica,bookkeeper
BDA_IGNORE_SYSTEMS=                 # Systems to skip (optional)
BDA_MAX_ROW_SIZE_FILE=1000000       # Rows per CSV file (default: 1M)
BDA_LOG_LEVEL=info                  # debug|info|warn|error

AWS Settings

BDA_S3_REGION=eu-central-1          # Default: eu-central-1
BDA_S3_URL=                         # Optional: Custom S3 endpoint
BDA_S3_ACCESS_KEY=                  # Optional: Explicit AWS credentials
BDA_S3_SECRET_ACCESS_KEY=           # Optional: Explicit AWS credentials

Complete Configuration Reference

Variable	Required	Default	Description
`BDA_CLIENT_ID`	✅	-	Client identifier (enercity, etc)
`BDA_ENVIRONMENT`	✅	auto-detect	Environment: dev, stage, prod
`BDA_LOG_LEVEL`	❌	`info`	Log level: debug, info, warn, error
`BDA_DB_HOST`	✅	-	PostgreSQL hostname
`BDA_DB_PORT`	❌	`5432`	PostgreSQL port
`BDA_DB_NAME`	❌	`octopus`	Database name
`BDA_DB_USER`	❌	`billing_aggregator`	Database username
`BDA_DB_PASSWORD`	✅	-	Database password
`BDA_DB_MAX_CONNS`	❌	`4`	Maximum concurrent connections
`BDA_DB_MAX_IDLE`	❌	`0`	Max idle connections (0=unlimited)
`BDA_DB_MINUTES_IDLE`	❌	`5`	Idle connection timeout (minutes)
`BDA_S3_BUCKET`	✅	-	S3 bucket for CSV exports
`BDA_S3_REGION`	❌	`eu-central-1`	AWS region
`BDA_S3_URL`	❌	-	Custom S3 endpoint (LocalStack, etc)
`BDA_S3_ACCESS_KEY`	❌	-	AWS access key (uses IAM if empty)
`BDA_S3_SECRET_ACCESS_KEY`	❌	-	AWS secret key (uses IAM if empty)
`BDA_SYSTEMS`	❌	`tripica,bookkeeper`	Comma-separated systems to process
`BDA_IGNORE_SYSTEMS`	❌	-	Comma-separated systems to skip
`BDA_MAX_ROW_SIZE_FILE`	❌	`1000000`	Maximum rows per CSV file
`BDA_SCRIPTS_DIR`	❌	`/app/scripts`	Base directory for SQL scripts

Project Structure

billing-data-aggregator/
├── cmd/
│   └── aggregator/
│       └── main.go                 # Application entry point
│
├── internal/                       # Private application packages
│   ├── config/                     # Configuration management
│   │   ├── config.go              # Environment variable loading
│   │   └── config_test.go         # Configuration tests
│   │
│   ├── database/                   # Database layer
│   │   ├── connection.go          # Connection pooling & retry logic
│   │   ├── scripts.go             # SQL script execution engine
│   │   └── database_test.go       # Database tests
│   │
│   ├── processors/                 # Business logic processors
│   │   ├── processor.go           # Processor interface
│   │   ├── tripica.go             # Tripica data processing
│   │   ├── bookkeeper.go          # Bookkeeper data processing
│   │   └── processor_test.go      # Processor tests
│   │
│   ├── export/                     # Export functionality
│   │   ├── csv.go                 # CSV generation with chunking
│   │   ├── s3.go                  # S3 upload with retry
│   │   └── export_test.go         # Export tests
│   │
│   ├── history/                    # Historical data management
│   ├── validators/                 # Pre-execution validation
│   └── ...                         # Future packages
│
├── scripts/                        # SQL transformation scripts
│   ├── init/                       # Data preparation scripts
│   │   ├── tripica/               # Tripica-specific transformations
│   │   └── bookkeeper/            # Bookkeeper-specific transformations
│   ├── archive/                    # Export/archive queries
│   │   ├── customer/              # Customer data exports
│   │   └── ...
│   ├── history/                    # Historization scripts
│   └── prechecks/                  # Validation/precheck scripts
│
├── terraform/                      # Infrastructure as Code
│   ├── _init.tf                   # Terraform initialization
│   ├── batch.tf                   # AWS Batch configuration
│   ├── s3.tf                      # S3 bucket setup
│   ├── iam.tf                     # IAM roles & policies
│   └── variables.tf               # Terraform variables
│
├── test/                           # Integration tests
│   ├── fixtures/                  # Test data
│   └── integration_test.go        # Integration test suite
│
├── .github/
│   └── workflows/
│       └── ci.yml                 # CI/CD pipeline
│
├── .golangci.yml                   # Linter configuration
├── .goreleaser.yaml                # Release automation
├── Dockerfile                      # Container image definition
├── go.mod                          # Go module definition
├── go.sum                          # Dependency checksums
└── README.md                       # This file

Code Examples

Configuration Loading

package main

import (
    "github.com/enercity/billing-data-aggregator/internal/config"
)

func main() {
    // Load configuration from environment variables
    cfg, err := config.Load()
    if err != nil {
        log.Fatal(err)
    }

    // Access configuration
    log.Printf("Client: %s", cfg.ClientID)
    log.Printf("Environment: %s", cfg.Environment)
    log.Printf("DB: %s", cfg.Database.Host)

    // Get connection string
    connStr := cfg.ConnectionString()
}

Database Connection

import (
    "context"
    "github.com/enercity/billing-data-aggregator/internal/database"
)

// Create connection with pooling and retry logic
db, err := database.NewConnection(
    cfg.ConnectionString(),
    cfg.DBMaxConnections,    // 4
    cfg.DBMaxIdleConns,      // 0
    cfg.DBConnMaxIdleTime,   // 5 minutes
)
if err != nil {
    return fmt.Errorf("database connection failed: %w", err)
}
defer db.Close()

// Execute query
rows, err := db.QueryContext(ctx, "SELECT * FROM customers LIMIT 10")

Script Execution

import (
    "github.com/enercity/billing-data-aggregator/internal/database"
)

// Create script executor with system filtering
executor := database.NewScriptExecutor(db, cfg.IgnoreSystems)

// Execute all scripts in a directory
// Scripts are executed per system, alphabetically sorted
if err := executor.ExecuteScriptsInDir(ctx, "scripts/init"); err != nil {
    return fmt.Errorf("init scripts failed: %w", err)
}

// Directory structure:
// scripts/init/
//   tripica/
//     110-charges.sql
//     120-balances.sql
//   bookkeeper/
//     100-bookings.sql

Processor Usage

import (
    "github.com/enercity/billing-data-aggregator/internal/processors"
)

// Run configured processors
for _, system := range cfg.Systems {
    var processor processors.Processor

    switch system {
    case "tripica":
        processor = processors.NewTripicaProcessor(db, executor, "scripts")
    case "bookkeeper":
        processor = processors.NewBookkeeperProcessor(db, executor, "scripts")
    }

    if err := processor.Process(ctx); err != nil {
        return fmt.Errorf("processor %s failed: %w", processor.Name(), err)
    }
}

CSV Export

import (
    "github.com/enercity/billing-data-aggregator/internal/export"
)

// Create CSV exporter with chunking
exporter := export.NewCSVExporter(
    db.DB(),
    "/tmp/exports",        // Output directory
    cfg.MaxRowSizeFile,    // 1,000,000 rows per file
)

// Export table to CSV files
files, err := exporter.ExportTable(ctx, "tripica_results", "tripica")
if err != nil {
    return fmt.Errorf("export failed: %w", err)
}

// Result: tripica_tripica_results_0000.csv, _0001.csv, etc.
log.Printf("Exported %d files", len(files))

S3 Upload

import (
    "github.com/enercity/billing-data-aggregator/internal/export"
)

// Create S3 uploader
uploader, err := export.NewS3Uploader(
    ctx,
    cfg.S3.Region,                                    // eu-central-1
    cfg.S3.Bucket,                                    // billing-exports
    fmt.Sprintf("%s/%s", cfg.ClientID, cfg.Environment), // enercity/prod
)
if err != nil {
    return err
}

// Upload files with retry logic (3 attempts)
if err := uploader.UploadFiles(ctx, files); err != nil {
    return fmt.Errorf("S3 upload failed: %w", err)
}

// S3 path: s3://billing-exports/enercity/prod/tripica_results_0000.csv

Development

Testing

The project uses a comprehensive testing strategy with multiple approaches:

Test Structure

Unit Tests: Testing individual components with mocks
Table-Driven Tests: Multiple scenarios in a single test
BDD/Gherkin Tests: Behavior-driven tests in German
Integration Tests: End-to-end testing with real dependencies

Quick Test Commands

# Run all tests
make test

# Run only unit tests
make test-unit

# Run BDD tests
make test-bdd

# Generate coverage report
make test-coverage
open coverage.html

# Run tests with race detector
go test -race ./...

# Run specific package
go test ./internal/config/... -v

# Watch mode (auto-rerun on changes)
make watch-test

Unit Tests

Unit tests use testify for assertions:

package config_test

import (
    "testing"
    "github.com/stretchr/testify/assert"
    "github.com/stretchr/testify/require"
)

func TestLoad(t *testing.T) {
    // Setup
    os.Setenv("BDA_CLIENT_ID", "test-client")
    defer os.Unsetenv("BDA_CLIENT_ID")

    // Execute
    cfg, err := config.Load()

    // Assert
    require.NoError(t, err)
    assert.Equal(t, "test-client", cfg.ClientID)
    assert.Equal(t, 5432, cfg.Database.Port)
}

Table-Driven Tests

For testing multiple scenarios efficiently:

func TestValidate(t *testing.T) {
    tests := []struct {
        name    string
        cfg     *Config
        wantErr bool
        errMsg  string
    }{
        {
            name: "valid config",
            cfg: &Config{
                ClientID: "enercity",
                Database: DBConfig{Host: "localhost"},
                S3: S3Config{Bucket: "my-bucket"},
            },
            wantErr: false,
        },
        {
            name: "missing client ID",
            cfg: &Config{
                Database: DBConfig{Host: "localhost"},
            },
            wantErr: true,
            errMsg: "CLIENT_ID is required",
        },
    }

    for _, tt := range tests {
        t.Run(tt.name, func(t *testing.T) {
            err := tt.cfg.Validate()
            if tt.wantErr {
                assert.Error(t, err)
                assert.Contains(t, err.Error(), tt.errMsg)
            } else {
                assert.NoError(t, err)
            }
        })
    }
}

BDD/Gherkin Tests

Behavior-driven tests in German using godog:

Feature File (features/configuration.feature):

# language: de
Funktionalität: Konfiguration

  Szenario: Erfolgreiche Konfiguration laden
    Angenommen die Umgebung ist sauber
    Und die folgenden Umgebungsvariablen sind gesetzt:
      | Variable         | Wert              |
      | BDA_CLIENT_ID    | enercity          |
      | BDA_ENVIRONMENT  | dev               |
      | BDA_DB_HOST      | localhost         |
      | BDA_DB_PASSWORD  | secret            |
      | BDA_S3_BUCKET    | test-bucket       |
    Wenn ich die Konfiguration lade
    Dann sollte die Konfiguration erfolgreich geladen werden
    Und die Client-ID sollte "enercity" sein
    Und der Datenbankhost sollte "localhost" sein

Step Definitions (test/bdd_test.go):

func (c *ConfigurationSteps) ichDieKonfigurationLade() error {
    c.cfg, c.err = config.Load()
    return nil
}

func (c *ConfigurationSteps) sollteDieKonfigurationErfolgreichGeladenWerden() error {
    if c.err != nil {
        return fmt.Errorf("expected no error, got: %v", c.err)
    }
    if c.cfg == nil {
        return fmt.Errorf("expected config to be loaded, got nil")
    }
    return nil
}

Running BDD Tests:

# Via Makefile
make test-bdd

# Direct with godog
godog run features/

# Specific feature
godog run features/configuration.feature

# With tags
godog run --tags=@unit features/

Test Coverage

The project maintains high test coverage:

# Generate coverage report
go test ./... -coverprofile=coverage.out
go tool cover -func=coverage.out

# HTML report
go tool cover -html=coverage.out -o coverage.html

# Via Makefile (opens browser)
make test-coverage

Coverage Targets:

internal/config: 90%+
internal/database: 80%+
internal/export: 85%+
internal/processors: 75%+

Integration Tests

Integration tests require a PostgreSQL database:

# Start test database with Docker
docker run -d \
  --name billing-test-db \
  -e POSTGRES_PASSWORD=test \
  -p 5432:5432 \
  postgres:15

# Run integration tests
export BDA_DB_HOST=localhost
export BDA_DB_PASSWORD=test
go test ./test/... -v

# Cleanup
docker stop billing-test-db
docker rm billing-test-db

Test Fixtures

Test data is stored in test/fixtures/:

test/fixtures/
├── sql/
│   ├── schema.sql              # Test database schema
│   └── seed.sql                # Test data
├── csv/
│   └── sample_export.csv       # Sample CSV data
└── config/
    └── test.env                # Test environment variables

Continuous Integration

Tests run automatically in GitHub Actions:

Unit Tests: On every push/PR
BDD Tests: On every push/PR (with continue-on-error: true)
Coverage Report: Uploaded as artifact
Test Summary: Displayed in PR comments

Test Best Practices

Keep tests isolated: Use t.Parallel() where possible
Use table-driven tests: For multiple scenarios
Mock external dependencies: Database, S3, etc.
Test error paths: Not just happy paths
Use descriptive names: TestExportTable_WithLargeDataset_ShouldChunk
Clean up resources: Use defer for cleanup
Test concurrency: Use -race detector
Keep tests fast: Mock slow operations

Example: Complete Test

func TestCSVExporter_Export(t *testing.T) {
    // Setup: Create temporary directory
    tmpDir := t.TempDir()

    // Setup: Mock database
    db, mock, err := sqlmock.New()
    require.NoError(t, err)
    defer db.Close()

    // Setup: Define expected query and result
    rows := sqlmock.NewRows([]string{"id", "name", "amount"}).
        AddRow(1, "Customer A", 100.50).
        AddRow(2, "Customer B", 200.75)

    mock.ExpectQuery("SELECT .* FROM customers").WillReturnRows(rows)

    // Execute: Create exporter and export
    exporter := export.NewCSVExporter(db, tmpDir, 1000000)
    files, err := exporter.ExportTable(context.Background(), "customers", "test")

    // Assert: No errors
    require.NoError(t, err)
    assert.Len(t, files, 1)

    // Assert: File exists and has correct content
    content, err := os.ReadFile(files[0])
    require.NoError(t, err)
    assert.Contains(t, string(content), "Customer A")
    assert.Contains(t, string(content), "100.50")

    // Assert: All expectations met
    assert.NoError(t, mock.ExpectationsWereMet())
}

Testing Tips

Running Specific Tests:

# By name pattern
go test -run TestCSVExporter ./...

# By file
go test ./internal/export/csv_test.go

# Verbose output
go test -v ./...

# Show test names only
go test -v ./... | grep -E "^(PASS|FAIL|---)"

Debugging Tests:

# Print test output
go test -v ./... 2>&1 | tee test.log

# Run with debugger (dlv)
dlv test ./internal/config -- -test.run TestLoad

# Show coverage per function
go test -coverprofile=coverage.out ./...
go tool cover -func=coverage.out

Prerequisites

Go: 1.24 or later
Docker: For local testing
PostgreSQL: For database tests (optional)
golangci-lint: For linting
pre-commit: For Git hooks (optional)

Setup

# Install dependencies
go mod download

# Install development tools
go install github.com/golangci/golangci-lint/cmd/golangci-lint@latest

# Install pre-commit hooks (optional)
pip install pre-commit
pre-commit install

Linting

# Run all linters
golangci-lint run

# Run specific linter
golangci-lint run --disable-all --enable=errcheck

# Auto-fix issues
golangci-lint run --fix

Building

# Development build
go build -o dist/billing-data-aggregator ./cmd/aggregator

# Production build with optimizations
go build -ldflags="-s -w" -o dist/billing-data-aggregator ./cmd/aggregator

# Cross-compile for Linux
GOOS=linux GOARCH=amd64 go build -o dist/billing-data-aggregator ./cmd/aggregator

# Build Docker image
docker build -t billing-data-aggregator:$(git describe --tags) .

CI/CD Pipeline

Workflow Structure

The project uses three GitHub Actions workflows:

1. Documentation Workflow (`.github/workflows/docs.yml`)

Trigger: Push to main branch + manual dispatch

Purpose: Automatically generates and deploys project documentation to GitHub Pages

Steps:

Generates package documentation with go doc
Copies HTML templates from .github/templates/
Replaces placeholders (repo name, commit SHA, timestamp)
Deploys to GitHub Pages

Output: https://<username>.github.io/<repo>/

Example:

name: Documentation
on:
  push:
    branches: [main]
  workflow_dispatch:

jobs:
  build-docs:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-go@v5
        with:
          go-version: "1.24"
      - name: Generate Documentation
        run: |
          mkdir -p gh_pages
          go doc -all ./... > gh_pages/packages.txt
          cp .github/templates/*.html gh_pages/
      - uses: actions/deploy-pages@v4

2. Test Workflow (`.github/workflows/test.yml`)

Trigger: Every push/PR on all branches + manual dispatch

Purpose: Runs comprehensive test suite (unit + BDD tests)

Jobs:

Unit Tests:
- Runs all unit tests with go test
- Generates coverage report
- Uploads coverage as artifact (30 days retention)
- Uses continue-on-error: true (non-blocking)
BDD Tests:
- Runs Gherkin/godog feature tests
- Tests all feature files in features/
- Uses continue-on-error: true (non-blocking)
Test Summary:
- Downloads coverage artifact
- Creates summary in GitHub UI
- Shows pass/fail status per test suite

Example Output:

## 📊 Test Zusammenfassung

| Test Suite | Status |
|------------|--------|
| Unit Tests | ✅ Passed |
| BDD Tests  | ⚠️ Completed with issues |

## 📈 Coverage
total: (statements) 87.3%

Why continue-on-error: true? Tests are informational during development. They don't block the workflow, allowing CI to complete even with test failures. This is useful during active development and BDD step implementation.

3. CI/CD Workflow (`.github/workflows/ci.yml`)

Trigger: All branches + tags

Purpose: Build, test, and deploy application

Stages:

Code Quality (PRs only):
- Commit message validation
- Pre-commit hooks (formatting, linting)
Build & Test (all branches):
- Go version detection from go.mod
- Unit tests
- Binary compilation for Linux/amd64
- Artifact upload
Docker (all branches, push on tags):
- Docker image build
- Trivy security scan
- ECR push (conditional)
Infrastructure (iac/* tags only):
- Terraform validation
- Terraform plan
- Terraform apply (manual approval)

Tag Strategy

Tag Pattern	Trigger	Example
`dev_*`	Development deployment	`dev_1.0.0`
`stage_*`	Staging deployment	`stage_1.0.0`
`prod_*`	Production deployment	`prod_1.0.0`
`v*`	Versioned release	`v1.0.0`
`iac/v*`	Infrastructure-only update	`iac/v1.2.0`

Creating a Release

# Tag a development release
git tag dev_1.0.0
git push origin dev_1.0.0

# Tag a production release
git tag prod_1.0.0
git push origin prod_1.0.0

# Tag infrastructure update
git tag iac/v1.0.0
git push origin iac/v1.0.0

Deployment

Overview

The billing-data-aggregator uses a GitOps approach with FluxCD and Terraform:

Source Code: GitHub Repository
Container Images: AWS ECR (pushed by GitHub Actions)
Infrastructure: Terraform (managed by FluxCD Terraform Controller)
Execution: AWS Batch (scheduled via EventBridge)

Architecture

GitHub Repository
    ├── terraform/           → Terraform Code
    ├── flux/               → FluxCD Manifests
    ├── Dockerfile          → Container Definition
    └── .github/workflows/  → CI/CD Pipelines
        ↓
GitHub Actions (CI/CD)
    ├── Build & Test
    ├── Docker Build
    └── ECR Push (on tag)
        ↓
AWS ECR
    ├── iac/enercity/billing-data-aggregator  (Terraform as OCI)
    └── billing-data-aggregator               (Container Image)
        ↓
FluxCD (Kubernetes)
    ├── OCIRepository (watches ECR for Terraform updates)
    └── Terraform Resource (applies infrastructure)
        ↓
AWS Batch
    ├── Job Definition (Container + Resources)
    ├── Job Queue (Execution Queue)
    └── Compute Environment (EC2 Instances)
        ↓
EventBridge Schedule
    └── Daily 02:00 UTC → Submit Batch Job
        ↓
CloudWatch Logs
    └── /aws/batch/billing-data-aggregator

AWS Batch Runtime

The application runs as an AWS Batch job:

Schedule: Daily at 02:00 UTC (04:00 CET / 03:00 CEST)
Compute: EC2 (via Launch Template)
Memory: 2048 MB (adjustable in Terraform variables)
vCPUs: 1 (adjustable in Terraform variables)
Timeout: 2 hours (configured in Batch Job Definition)
Retries: 2 attempts on failure (exponential backoff)

Runtime Environment:

Container Image from ECR
Environment variables injected by Terraform
Secrets loaded from AWS Systems Manager Parameter Store
Logs streamed to CloudWatch Logs
S3 access via IAM Role

Terraform Infrastructure

Infrastructure is managed via Terraform in terraform/ directory.

Resources Created:

Resource	Purpose	Configuration
AWS Batch Compute Environment	EC2 instances for job execution	Uses external Launch Template
AWS Batch Job Queue	Job submission queue	Priority 1, ENABLED state
AWS Batch Job Definition	Container configuration	Image, CPU, Memory, Env Vars
EventBridge Rule	Daily schedule trigger	Cron: `cron(0 2 * * ? *)`
IAM Role (Events)	EventBridge → Batch permissions	`batch:SubmitJob`
IAM Role (IRSA)	Kubernetes ServiceAccount	EKS migration support
CloudWatch Log Group	Job execution logs	`/aws/batch/billing-data-aggregator`

Local Terraform Execution:

cd terraform/

# Initialize
terraform init

# Plan (with variables)
terraform plan \
  -var="batch_container_image=367771023052.dkr.ecr.eu-central-1.amazonaws.com/billing-data-aggregator:prod_1.0.0" \
  -var='batch_ce_subnet_ids=["subnet-xxx","subnet-yyy"]' \
  -var='batch_ce_security_group_ids=["sg-xxx"]' \
  -var="batch_launch_template_name=batch-launch-template-enercity-prod"

# Apply
terraform apply

Note: In production, Terraform is executed by FluxCD Terraform Controller, not manually.

FluxCD Deployment

Deployment via FluxCD Terraform Controller (GitOps approach).

FluxCD Structure (flux/ directory):

flux/
├── app/
│   ├── kustomization.yaml      # FluxCD resource loader
│   ├── components.yaml         # Namespace definition
│   └── terraform.yaml          # Terraform Controller config
├── environment/
│   ├── billing-data-aggregator.yaml  # Environment integration
│   └── _versions.yaml          # Version management
└── README.md                   # FluxCD documentation

Workflow:

Tag Terraform Code: git tag iac/v1.0.0 && git push origin iac/v1.0.0
CI/CD Builds OCI Image: GitHub Actions packages Terraform as OCI artifact
Push to ECR: OCI image pushed to iac/enercity/billing-data-aggregator
FluxCD Detects Update: OCIRepository polls ECR for new versions
Terraform Controller: Automatically runs terraform apply
AWS Resources Updated: Batch Job Definition, Schedule, etc.

Version Management (flux/environment/_versions.yaml):

apiVersion: v1
kind: ConfigMap
metadata:
  name: service-versions
  namespace: flux-system
data:
  # Terraform version (semantic versioning)
  version_billing_data_aggregator_tf: "~ 1.0.0" # Accept 1.0.x

  # Container image tag
  container_image_tag: "prod_1.0.0"

Terraform Variables Injection (from FluxCD):

# flux/app/terraform.yaml
vars:
  - name: batch_container_image
    value: "367771023052.dkr.ecr.eu-central-1.amazonaws.com/billing-data-aggregator:${container_image_tag}"
  - name: batch_ce_subnet_ids
    valueFrom:
      kind: ConfigMap
      name: init
      key: subnet_private_ids
  - name: batch_env
    value: |
      {
        "BDA_CLIENT_ID": "${clientId}",
        "BDA_ENVIRONMENT": "${environment}"
      }

Integration with fluxcd-environment Repository:

The flux/ directory contents are referenced in the central FluxCD environment repository:

# In fluxcd-environment repository
flux-apps/service-stacks/billing-data-aggregator/
├── kustomization.yaml
├── components.yaml
└── terraform.yaml

Monitoring Deployment:

# Check Terraform Resource status
kubectl get terraform billing-data-aggregator -n flux-system

# View Terraform logs
kubectl logs -n flux-system \
  -l infra.contrib.fluxcd.io/terraform=billing-data-aggregator \
  --tail=100 --follow

# Check Terraform plan
kubectl describe terraform billing-data-aggregator -n flux-system

# View outputs
kubectl get secret billing-data-aggregator-tf-outputs -n flux-system -o yaml

Deployment Workflow

Development Deployment

# 1. Develop and test locally
make test
make lint

# 2. Commit changes
git add .
git commit -m "feat: add new feature"
git push

# 3. Tag for development
git tag dev_1.0.0
git push origin dev_1.0.0

# 4. GitHub Actions builds and pushes to ECR
# 5. Update FluxCD version
# In flux/environment/_versions.yaml
container_image_tag: "dev_1.0.0"

# 6. Commit and push
git commit -am "chore: update dev version"
git push

Production Deployment

# 1. Tag container image for production
git tag prod_1.0.0
git push origin prod_1.0.0

# 2. Tag Terraform infrastructure
git tag iac/v1.0.0
git push origin iac/v1.0.0

# 3. Update versions in FluxCD
# In flux/environment/_versions.yaml
version_billing_data_aggregator_tf: "~ 1.0.0"
container_image_tag: "prod_1.0.0"

# 4. Commit and push
git commit -am "chore: production release 1.0.0"
git push

# 5. FluxCD automatically applies changes
# 6. Monitor deployment
kubectl logs -n flux-system -l app.kubernetes.io/name=billing-data-aggregator --follow

# 7. Verify Batch Job
aws batch describe-job-definitions \
  --job-definition-name billing-data-aggregator-enercity-prod \
  --status ACTIVE

Rollback

# 1. Revert version in _versions.yaml
version_billing_data_aggregator_tf: "1.0.0"  # Previous version
container_image_tag: "prod_0.9.0"

# 2. Commit and push
git commit -am "chore: rollback to 0.9.0"
git push

# 3. FluxCD automatically applies rollback
# 4. Verify
kubectl get terraform billing-data-aggregator -n flux-system

Manual Job Execution

While jobs are scheduled automatically, you can trigger them manually:

# Submit job manually
aws batch submit-job \
  --job-name "billing-data-aggregator-manual-$(date +%s)" \
  --job-queue billing-data-aggregator-enercity-prod-queue \
  --job-definition billing-data-aggregator-enercity-prod

# Check job status
JOB_ID="<job-id-from-previous-command>"
aws batch describe-jobs --jobs $JOB_ID

# View logs
aws logs tail /aws/batch/billing-data-aggregator --follow

Environment-Specific Configuration

Configuration per client and environment is managed in terraform/configuration.tf:

locals {
  configuration = {
    default = {
      batch_enabled    = true
      schedule_enabled = true
    }
    enercity = {
      prod = {
        batch_enabled    = true
        schedule_enabled = true  # Daily automatic execution
      }
      stage = {
        batch_enabled    = true
        schedule_enabled = true  # Daily automatic execution
      }
    }
    lynqtech = {
      dev = {
        batch_enabled    = true
        schedule_enabled = false  # Manual execution only
      }
    }
  }
}

Troubleshooting Deployment

Terraform fails to apply:

# Check Terraform Controller status
kubectl describe terraform billing-data-aggregator -n flux-system

# View runner pod logs
kubectl logs -n flux-system \
  -l infra.contrib.fluxcd.io/terraform=billing-data-aggregator

# Check Terraform plan
kubectl get terraform billing-data-aggregator -n flux-system -o yaml

Container image not found:

# Verify ECR image exists
aws ecr describe-images \
  --repository-name billing-data-aggregator \
  --image-ids imageTag=prod_1.0.0

# Check ECR authentication
aws ecr get-login-password | docker login \
  --username AWS \
  --password-stdin 367771023052.dkr.ecr.eu-central-1.amazonaws.com

Schedule not triggering:

# Check EventBridge rule
aws events describe-rule \
  --name billing-data-aggregator-enercity-prod-schedule

# Enable rule if disabled
aws events enable-rule \
  --name billing-data-aggregator-enercity-prod-schedule

# Check rule targets
aws events list-targets-by-rule \
  --rule billing-data-aggregator-enercity-prod-schedule

Infrastructure Documentation

Detailed infrastructure documentation:

Terraform: See terraform/README.md
FluxCD: See flux/README.md
AWS Batch: See AWS Console or Terraform outputs

Security Considerations

Secrets Management:

Database passwords: AWS Systems Manager Parameter Store
AWS credentials: IAM Role (no hardcoded keys)
Container registry: ECR with IAM authentication

Network Isolation:

Batch compute in private subnets
Security groups restrict traffic
S3 access via VPC endpoint (optional)

Audit & Compliance:

CloudWatch Logs retention: 30 days (configurable)
CloudTrail logs all API calls
S3 bucket encryption: AES-256
Terraform state encryption: S3 server-side

Monitoring & Observability

Logging

Structured JSON logging with zerolog:

{
  "level": "info",
  "service": "billing-data-aggregator",
  "client_id": "enercity",
  "environment": "prod",
  "batch_job_id": "abc-123",
  "message": "Processing tripica data",
  "timestamp": "2025-11-27T10:30:00Z"
}

Log levels:

DEBUG: Detailed execution flow, SQL queries
INFO: Normal operations, progress updates
WARN: Non-critical issues, retries
ERROR: Critical failures requiring attention

Metrics

AWS CloudWatch metrics:

Job execution duration
Success/failure rate
Rows processed
CSV files generated
S3 upload size

Alerts

CloudWatch alarms for:

Job failures (SNS notification)
Execution timeout
High error rate
S3 upload failures

Troubleshooting

Common Issues

Database connection timeout:

# Check network connectivity
psql -h $BDA_DB_HOST -U $BDA_DB_USER -d $BDA_DB_NAME

# Verify credentials
export BDA_DB_PASSWORD=xxxxx

S3 upload permission denied:

# Check IAM role permissions
aws sts get-caller-identity
aws s3 ls s3://$BDA_S3_BUCKET/

# Verify bucket policy
aws s3api get-bucket-policy --bucket $BDA_S3_BUCKET

Script execution failure:

# Enable debug logging
export BDA_LOG_LEVEL=debug

# Check script syntax
psql -f scripts/init/tripica/110-charges.sql

Debug Mode

Enable verbose logging:

export BDA_LOG_LEVEL=debug
./dist/billing-data-aggregator 2>&1 | tee debug.log

Health Checks

# Test database connectivity
./dist/billing-data-aggregator --health-check

# Validate configuration
./dist/billing-data-aggregator --validate-config

# Dry-run (no S3 upload)
export BDA_DRY_RUN=true
./dist/billing-data-aggregator

Migration from ed4-bi-batch-boil

This project replaces the legacy ed4-bi-batch-boil service.

Key Improvements

✅ Go instead of Node.js: Better performance, type safety
✅ Modular architecture: Clear separation of concerns
✅ Retry logic: Automatic recovery from transient failures
✅ Chunked CSV export: Memory-efficient large data handling
✅ Structured logging: Better observability
✅ Comprehensive tests: Higher code quality (unit + BDD)
✅ Modern CI/CD: GitHub Actions workflows (test + docs + deploy)
✅ IaC: Full Terraform infrastructure
✅ GitHub Pages: Auto-generated documentation

Migration Checklist

Documentation

Project Documentation

All documentation is maintained in this repository:

README.md: This file - comprehensive project overview
GitHub Pages: Auto-generated API documentation (updated on main)
Feature Files: BDD specifications in features/ (German)
GoDoc Comments: Inline code documentation
Examples: See "Code Examples" section above

Accessing Documentation

GitHub Pages (auto-generated):

# View online after first workflow run
open https://<username>.github.io/<repo>/

# Local preview
go install golang.org/x/tools/cmd/godoc@latest
godoc -http=:6060
open http://localhost:6060/pkg/github.com/enercity/billing-data-aggregator/

Package Documentation:

# All packages
go doc -all ./...

# Specific package
go doc ./internal/config

# Specific function
go doc ./internal/config.Load

GoDoc Examples

All public functions include GoDoc comments following Google Go Style:

// Load reads configuration from environment variables with the BDA_ prefix.
// It returns an error if required variables are missing or invalid.
//
// Required environment variables:
//   - BDA_CLIENT_ID: Client identifier (e.g., "enercity")
//   - BDA_ENVIRONMENT: Environment name (dev/stage/prod)
//   - BDA_DB_HOST: PostgreSQL hostname
//   - BDA_DB_PASSWORD: Database password
//   - BDA_S3_BUCKET: S3 bucket for exports
//
// Example:
//
//	os.Setenv("BDA_CLIENT_ID", "enercity")
//	os.Setenv("BDA_ENVIRONMENT", "prod")
//	os.Setenv("BDA_DB_HOST", "db.example.com")
//	os.Setenv("BDA_DB_PASSWORD", "secret")
//	os.Setenv("BDA_S3_BUCKET", "billing-exports")
//
//	cfg, err := config.Load()
//	if err != nil {
//		log.Fatal(err)
//	}
//	fmt.Println(cfg.ClientID) // Output: enercity
func Load() (*Config, error) {
    // Implementation
}

Additional Resources

GitHub Wiki: https://github.com/enercity/billing-data-aggregator/wiki
Issues: Bug reports and feature requests
Pull Requests: Code review and discussions

License

Proprietary - Enercity AG

Maintainers

Team: Billing Data / DevOps
Contact: devops@enercity.com
Slack: #team-billing-data

Contributing

Create feature branch: git checkout -b feature/my-feature
Make changes and add tests
Run linters: golangci-lint run
Run tests: go test ./...
Commit with conventional commits: feat(export): add CSV compression
Create Pull Request
Wait for CI checks and review
Merge to main

Changelog

See CHANGELOG.md for release notes.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
cmd/aggregator		cmd/aggregator
features		features
flux		flux
internal		internal
scripts		scripts
terraform		terraform
test		test
.env.example		.env.example
.gitconfig		.gitconfig
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
.terraform-docs.yml		.terraform-docs.yml
.tflint.hcl		.tflint.hcl
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

enercity/billing-data-aggregator

Folders and files

Latest commit

History

Repository files navigation

billing-data-aggregator

Overview

Quick Start

Local Development

Docker

Architecture

High-Level Data Flow

Component Architecture

Configuration

Core Settings

Database Settings

Processing Settings

AWS Settings

Complete Configuration Reference

Project Structure

Code Examples

Configuration Loading

Database Connection

Script Execution

Processor Usage

CSV Export

S3 Upload

Development

Testing

Test Structure

Quick Test Commands

Unit Tests

Table-Driven Tests

BDD/Gherkin Tests

Test Coverage

Integration Tests

Test Fixtures

Continuous Integration

Test Best Practices

Example: Complete Test

Testing Tips

Prerequisites

Setup

Linting

Building

CI/CD Pipeline

Workflow Structure

1. Documentation Workflow (.github/workflows/docs.yml)

2. Test Workflow (.github/workflows/test.yml)

3. CI/CD Workflow (.github/workflows/ci.yml)

Tag Strategy

Creating a Release

Deployment

Overview

Architecture

AWS Batch Runtime

Terraform Infrastructure

FluxCD Deployment

Deployment Workflow

Development Deployment

Production Deployment

Rollback

Manual Job Execution

Environment-Specific Configuration

Troubleshooting Deployment

Infrastructure Documentation

Security Considerations

Related Documentation

Monitoring & Observability

Logging

Metrics

Alerts

Troubleshooting

Common Issues

Debug Mode

Health Checks

Migration from ed4-bi-batch-boil

Key Improvements

Migration Checklist

Documentation

1. Documentation Workflow (`.github/workflows/docs.yml`)

2. Test Workflow (`.github/workflows/test.yml`)

3. CI/CD Workflow (`.github/workflows/ci.yml`)

Packages