CleanApp Backend version 2+

This repository is for CleanApp (http://cleanapp.io) backend development. CleanApp is a global observability network that turns real-world + digital problems into structured signals and automatically routes them to the organizations that can fix them.

If you want to understand CleanApp as a system, start here:
WHY → THEORY → INVARIANTS → ARCHITECTURE

These four files remain the canonical philosophy and system-design backbone.

Current-State Guides

For implementation-reality and fast navigation, use these as supplements to the canonical docs above:

Current live system map: docs/architecture/current-system-map.md
Shared domain vocabulary: docs/architecture/domain-model.md
Key product and architecture decisions: docs/decisions/decision-log.md
Fast onboarding index for agents and new engineers: docs/agent-context/overview.md

Topical implementation guides:

Cases: docs/cleanapp-cases.md
CleanApp Wire: docs/cleanapp-wire.md
Security hardening: docs/security-hardening-2026-03-11.md
Contact discovery: docs/case-contact-discovery-2026-03-11.md

Standard Commands

We use a root Makefile as the primary entry point for common tasks:

make help
make gitleaks
make ci-analyzer
make hooks

Legacy root-level scripts/configs have been moved out of the repository root to reduce ambiguity/clutter:

Local compose files: conf/compose/
Nginx configs: conf/nginx/
One-off/legacy scripts: scripts/legacy/

CleanApp CLI (`@cleanapp/cli`)

CleanApp also publishes a thin API client CLI for external developers and agents.

npm package: @cleanapp/cli
installed command: cleanapp
default output: JSON (agent-friendly)
optional human-readable mode: --human

Install:

npm i -g @cleanapp/cli

Quick start (env-var mode):

export CLEANAPP_API_URL="https://live.cleanapp.io"
export CLEANAPP_API_TOKEN="cleanapp_fk_live_..."

cleanapp auth whoami
cleanapp submit --title "Broken login" --desc "Users stuck on callback" --source-type web
cleanapp bulk-submit --file reports.ndjson
cleanapp status --report-id 123456

Interactive setup (human-friendly):

cleanapp init

Safety/debug flags available on all commands:

--dry-run (print request payload, send nothing)
--trace (HTTP trace to stderr with secret redaction)

Full CLI docs and examples:

cli/cleanapp/README.md

CleanApp Wire (`/api/v1`)

CleanApp Wire is the canonical machine-to-machine ingestion surface layered on top of the existing fetcher/quarantine pipeline.

It is intended for external agents, scrapers, watcher swarms, and internal automations that need:

one stable envelope
one-time API key issuance
idempotent retries by source_id
quarantine-first trust lanes for new agents
receipts + status lookup without exposing raw internal tables

Primary endpoints:

POST /api/v1/agents/register
GET  /api/v1/agents/me
GET  /api/v1/agents/reputation/{agent_id}
POST /api/v1/agent-reports:submit
POST /api/v1/agent-reports:batchSubmit
GET  /api/v1/agent-reports/receipts/{receipt_id}
GET  /api/v1/agent-reports/status/{source_id}
GET  /api/v1/openapi.yaml
GET  /api/v1/docs

Behavior:

New keys default to tier 0 and land in the quarantine/shadow lane.
Quarantined reports are stored + analyzed but not publicly published by default.
Promotion to public visibility remains an internal/admin action.

OpenAPI specs:

embedded service copy: report-listener/handlers/openapi/cleanapp-wire.v1.yaml
top-level tooling copy: openapi/cleanapp-wire.v1.yaml

Environments

There are three environments:

local - a local machine outside cloud
dev - development machine in cloud
prod - production machine in cloud

Installation

Pre-requisites

Make sure that your local machine has Docker installed. https://docs.docker.com/engine/install/
Make sure you're prepared for working with GoogleCloud.
1. You got necessary access to Google Cloud services. Ask project admins for them.
2. You have gcloud command line interface installed, https://cloud.google.com/sdk/docs/install
3. You are successfully logged in gcloud, https://cloud.google.com/sdk/gcloud/reference/auth/login

Installation steps

Build docker images on your local machine.
Deploy services on the cloud or local machine.

Build Docker images for cleanapp backend

Modify the Docker image version if necessary. Open the file docker_backend/.version and set the desired value of the BUILD_VERSION.
Run the docker_backend/build_server_image.sh from the docker_backens directory.
```
cd docker_backend &&
./build_image.sh
```

Build Docker images for cleanapp referrals and token disbursement processing

Modify the Docker image version if necessary. Open the file docker_pipelines/.version and set the desired value of the BUILD_VERSION.
Run the docker_backend/build_server_image.sh from the docker_pipelines directory.
```
cd docker_pipelines &&
./build_image.sh
```

Deploying in Google Cloud

The deploying process includes:

pulling docker images and running four services:
- cleanapp backend service;
- cleanapp referral service;
- mysql database;
- cleanapp web service;
adding Google cloud scheduler for following processes:
- referrals redeem;
- tokens disbursement;

Pre-requisites

Linux (Debian/Ubuntu/...), this is tested on Google Cloud Ubuntu VPS instance.
Make sure that gcloud is present on the cloud machine. It should be pre-installed by google cloud.
for installing on your local machine make sure that you installed gcloud.

Login to the target machine.
- On GCloud you go to the dashboard, pick the instance, and the click on SSH
Get setup.sh into the current directory, e.g. using

curl https://raw.githubusercontent.com/cleanappio/cleanapp_back_end_v2/main/setup/setup.sh > setup.sh &&
sudo chmod a+x setup.sh

Run

./setup.sh

It should be up and running now.

Operations

Stopping:

./down.sh

Restarting after a stop:

./up.sh

Stopping with deletion of the database:

sudo docker-compose down -v

Refreshing images to the newly built versions:
1. Stop services
2. Delete loaded images (docker images and docker image commands, you may need to use -f flag)
3. If you need a different label or prefix, edit docker-compose.yaml file.
4. (preferable) Load new images using sudo docker pull command
5. Restart services.

Open ports

API server exposes port 8080.
APP server exposes port 3000.
MySQL DB uses port 3306 but currently does not expose it externally. Do so, if you want to connect to it from outside.

How to expose port in Google Cloud

Caveat: Google Cloud UI is not stable, so the instruction below may become obsolete. This is the status on January 2024.

On the account level you need to create firewall rules "allow-8080" and "allow-3000"

Dashboard -> VPC Network -> Firewall, look at VPC Firewall Rules.

It will have the list of available rules. On top of the page (!Not near the table!) there will be a button "Create Firewall Rule"

Name: allow-8080
Description: Allow port 8080.
Target tags: allow-8080
Source filters, IP ranges: 0.0.0.0/0
Protocols and ports: tcp:8080
It's ok to leaave the rest default.

Create another rule for port 3000 using the same way.

You are almost done. Now in Compute Engine > VM Instances select the one you want to use. Pick Edit at the top. Go to network tags and add "allow-8080" and "allow-3000". Save.

You are ready to deploy on this VM.

Verifying once set up

From outside try:

Both times you will get a plain short welcome message with CleanApp API/APP version. Remove dev. prefix for prod instance.

Google Cloud VM Instances

Machine Configuration

We picked

E2 Low cost, day-to-day computing
US-Central1 Iowa
e2-medium (2 vCPU, 1 core, 4 GB memory)
10Gb Disk
ubuntu-2004-focal-v20231101 *Canonical, Ubuntu, 20.04 LTS, amd64 focal image built on 2023-11-01
HTTP/HTTPS allowed.

Secrets Setup

Currently we have three secrets per environment:

MYSQL_APP_PASSWORD_<env>
MYSQL_READER_PASSWORD_<env>
MYSQL_ROOT_PASSWORD_<env> where <env> is LOCAL, DEV or PROD.

Domains and Machines

cleanapp-1 Dev instance, http://dev.api.cleanapp.io / http://dev.app.cleanapp.io point to this instance (external IP 34.132.121.53).
cleanapp-prod Prod instance, http://api.cleanapp.io / http://app.cleanapp.io point to this instance (TODO: Create the machine and edit DNS)

More

(update: December 15, 2025) - need to adjust 10Gb Disk upwards as web scraping needs increase (eg, Bluesky & upcoming RedditReader deployments), see Architecture.md

Deployment Guide (Updated December 2025)

Quick Reference: Service Ports

Core Services

Service	Host Port	Container Port	Domain/Proxy
cleanapp_service (main API)	8080	8080	api.cleanapp.io:8080
cleanapp_pipelines	8090	8090	api.cleanapp.io:8090
cleanapp_web (legacy)	3000	3000	-
cleanapp_frontend	3001	3000	cleanapp.io (nginx)
cleanapp_frontend_embedded	3002	3000	-

Report Processing Services

Service	Host Port	Container Port	Notes
report_listener	9081	8080	Primary report API (live.cleanapp.io)
report_listener_v4	9097	8080	Rust-based v4 API
report_analyze_pipeline	9082	8080	AI analysis pipeline
report_processor	9087	8080	Report processing
report_renderer_service	9093	8080	Image rendering
report_tags_service	9098	8080	Tag management
report_ownership_service	9096 (prod), 9090 (dev)	8080	Ownership tracking

Authentication & Customer Services

Service	Host Port	Container Port
auth_service	9084	8080
customer_service	9080	8080
gdpr_process_service	9091	8080
voice_assistant_service	9092	8080

Area/Event Dashboards

Service	Host Port	Container Port
areas_service	9086	8080
devconnect_2025_areas	9094	8080
edge_city_areas	9095	8080
new_york_areas	9088	8080
montenegro_areas	9083	8080
red_bull_dashboard	9085	8080

Infrastructure

Service	Host Port	Notes
cleanapp_db (MySQL)	3306	Primary database
cleanapp_rabbitmq	5672, 15672	Message queue

Background Services (No External Ports)

bluesky_indexer - Indexes Bluesky posts
bluesky_analyzer - Analyzes posts with Gemini AI
bluesky_submitter - Submits analyzed posts to report_listener
bluesky_now - Real-time Bluesky stream
news_indexer_twitter - Twitter/X indexing
replier_twitter - Twitter reply bot
email_fetcher - Email processing

Full Backend Deployment

For source changes, deploy backend services to production with the canonical source-build-and-pin path:

make deploy-prod-source HOST=deployer@34.122.15.16 SOURCE_SERVICES="report-listener customer-service"

For already-built :prod tags only, use:

make deploy-prod HOST=deployer@34.122.15.16

Go Service Migrations

The hardened Go services no longer perform schema mutation at service boot.

Use the explicit migration entrypoints instead:

./scripts/db/run_go_service_migrations.sh

This runs:

auth-service/cmd/migrate
customer-service/cmd/migrate
report-listener/cmd/migrate
report-analyze-pipeline/cmd/migrate
report-processor/cmd/migrate
gdpr-process-service/cmd/migrate
areas-service/cmd/migrate
email-service/cmd/migrate
report-ownership-service/cmd/migrate

On fresh environments, run migrations before starting these services.

For production code changes, the canonical release path is now source-build-and-pin on the prod VM:

make deploy-prod-source HOST=deployer@34.122.15.16 SOURCE_SERVICES="report-listener customer-service"

This will:

Stage the exact git commit to the prod VM
Build the selected services from that staged source on the VM
Promote those freshly built image versions to :prod
Run explicit Go migrations from the same staged source
Resolve the pulled images to immutable digests
Deploy via platform_blueprint/deploy/prod/vm/deploy_with_digests.sh
Preserve a timestamped pinned manifest for rollback

./build_image.sh -e prod is deprecated and intentionally blocked because it only re-tagged an existing image and did not guarantee a fresh source build.

For already-built :prod tags, use the pull-only pinned deploy path instead:

make deploy-prod HOST=deployer@34.122.15.16

For dev environment:

./setup.sh -e dev --ssh-keyfile ~/.ssh/id_ed25519

Frontend Deployment

Fast Deploy (Recommended for Frontend-Only Changes)

Builds and deploys just the frontend without touching other services:

cd cleanapp-frontend
./fastFEdeploy.sh -e prod

Takes ~7 minutes. Suitable for:

UI/styling changes
Component updates
Configuration changes

Full Frontend Deploy

Includes embedded frontend:

cd cleanapp-frontend
./build_images.sh -e prod --ssh-keyfile ~/.ssh/id_ed25519

Individual Microservice Deployment

Each microservice can still be built independently for dev with ./build_image.sh -e dev, but production deployments should now go through the single source-build-and-pin flow:

make deploy-prod-source HOST=deployer@34.122.15.16 SOURCE_SERVICES="<service-directory>"

Examples:

make deploy-prod-source HOST=deployer@34.122.15.16 SOURCE_SERVICES="report-listener"
make deploy-prod-source HOST=deployer@34.122.15.16 SOURCE_SERVICES="report-analyze-pipeline report-tags"

Key Microservices with Build Scripts

Service	Directory	Notes
report-listener	`report-listener/`	Main Go API for reports
report-listener-v4	`report-listener-v4/`	Rust v4 API
auth-service	`auth-service/`	Authentication
report-analyze-pipeline	`report-analyze-pipeline/`	AI analysis
news-indexer-bluesky	`news-indexer-bluesky/`	Bluesky indexer/analyzer/submitter
email-service-v3	`email-service-v3/`	Email notifications
report-processor	`report-processor/`	Report processing
areas-service	`areas-service/`	Geo-areas API
customer-service	`customer-service/`	Customer management
face-detector	`face-detector/`	Privacy face detection

Bluesky Services

The Bluesky pipeline consists of 4 services:

bluesky_indexer → bluesky_analyzer → bluesky_submitter → report_listener
       ↓
  bluesky_now (real-time stream)

Deploy Bluesky Services

cd news-indexer-bluesky
./build_images.sh -e prod

Check Status

docker ps | grep bluesky
docker logs cleanapp_bluesky_indexer --tail 20
docker logs cleanapp_bluesky_analyzer --tail 20
docker logs cleanapp_bluesky_submitter --tail 20

Restart Bluesky Services

docker start cleanapp_bluesky_indexer cleanapp_bluesky_analyzer cleanapp_bluesky_submitter

Troubleshooting

Check Service Health

# All running containers
docker ps --format "table {{.Names}}\t{{.Status}}"

# Service logs
docker logs <container_name> --tail 50

# Database connection
docker exec -it cleanapp_db mysql -u server -p cleanapp

Restart a Single Service

docker stop <container_name>
docker rm <container_name>
HOST=deployer@34.122.15.16 SERVICES="<service_name>" ./platform_blueprint/deploy/prod/vm/deploy_with_digests.sh

Full System Restart / rollout from existing tags

make deploy-prod HOST=deployer@34.122.15.16

Nginx Reverse Proxy Domains

Domain	Backend
cleanapp.io	:3001 (frontend)
live.cleanapp.io	:9081 (report_listener)
auth.cleanapp.io	:9084 (auth_service)
processing.cleanapp.io	:9087 (report_processor)
areas.cleanapp.io	:9086 (areas_service)
email.cleanapp.io	email-service
renderer.cleanapp.io	:9093 (report_renderer)

Version Management

Each service has a .version file containing BUILD_VERSION=x.y.z.

To bump version before building:

# Check current version
cat .version

# The build script auto-increments minor version for dev builds
./build_image.sh -e dev

Name		Name	Last commit message	Last commit date
Latest commit History 1,193 Commits
.agent/workflows		.agent/workflows
.github/workflows		.github/workflows
areas-service		areas-service
areas_import		areas_import
auth-service		auth-service
backend		backend
brand-dashboard		brand-dashboard
cli/cleanapp		cli/cleanapp
common		common
conf		conf
custom-area-dashboard		custom-area-dashboard
customer-service		customer-service
db		db
docker_backend		docker_backend
docker_pipelines		docker_pipelines
docs		docs
email-fetcher		email-fetcher
email-service-v3		email-service-v3
email-service		email-service
email_sender		email_sender
epc-pusher		epc-pusher
face-detector		face-detector
gdpr-process-service		gdpr-process-service
go-common		go-common
image-compress-test		image-compress-test
model_api_test		model_api_test
news-indexer-bluesky		news-indexer-bluesky
news-indexer		news-indexer
openai-assistant-test		openai-assistant-test
openapi		openapi
openclaw/cleanapp_ingest_skill		openclaw/cleanapp_ingest_skill
pipelines		pipelines
platform_blueprint		platform_blueprint
rabbitmq-publisher		rabbitmq-publisher
rabbitmq-subscriber		rabbitmq-subscriber
replier-twitter		replier-twitter
report-analysis-backfill		report-analysis-backfill
report-analyze-pipeline		report-analyze-pipeline
report-fast-renderer		report-fast-renderer
report-listener-v4		report-listener-v4
report-listener		report-listener
report-ownership-service		report-ownership-service
report-processor		report-processor
report-tags		report-tags
reports-pusher		reports-pusher
rust-common		rust-common
scripts		scripts
setup		setup
stxn_kickoff		stxn_kickoff
test-images-comparison		test-images-comparison
test_draw_image		test_draw_image
tools		tools
voice-assistant-service		voice-assistant-service
xray		xray
.gcloudignore		.gcloudignore
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.version		.version
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
DEPLOYMENT.md		DEPLOYMENT.md
DEVELOP_AND_DEPLOY.md		DEVELOP_AND_DEPLOY.md
Dockerfile		Dockerfile
Dockerfile.emergency		Dockerfile.emergency
INVARIANTS.md		INVARIANTS.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
REPORT_SUBMISSION_WITH_TAGS_FLOW.md		REPORT_SUBMISSION_WITH_TAGS_FLOW.md
THEORY.md		THEORY.md
WHY.md		WHY.md
go.mod		go.mod
go.sum		go.sum

Folders and files

Latest commit

History

Repository files navigation

CleanApp Backend version 2+

Current-State Guides

Standard Commands

CleanApp CLI (@cleanapp/cli)

CleanApp Wire (/api/v1)

Environments

Installation

Pre-requisites

Installation steps

Build Docker images for cleanapp backend

Build Docker images for cleanapp referrals and token disbursement processing

Deploying in Google Cloud

Operations

Open ports

How to expose port in Google Cloud

Verifying once set up

Google Cloud VM Instances

Machine Configuration

Secrets Setup

Domains and Machines

More

Deployment Guide (Updated December 2025)

Quick Reference: Service Ports

Core Services

Report Processing Services

Authentication & Customer Services

Area/Event Dashboards

Infrastructure

Background Services (No External Ports)

Full Backend Deployment

Go Service Migrations

Frontend Deployment

Fast Deploy (Recommended for Frontend-Only Changes)

Full Frontend Deploy

Individual Microservice Deployment

Key Microservices with Build Scripts

Bluesky Services

Deploy Bluesky Services

Check Status

Restart Bluesky Services

Troubleshooting

Check Service Health

Restart a Single Service

Full System Restart / rollout from existing tags

Nginx Reverse Proxy Domains

Version Management

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

CleanApp CLI (`@cleanapp/cli`)

CleanApp Wire (`/api/v1`)

Packages