Root Sustainability – AI Engineer Technical Assessment

Hey there 👋

First off, we put together this assignment especially for this interview, so if you think things are unclear, don't hesitate to ask us questions.

This assessment represents a slice of the address matching problem we face at Root Sustainability.

The system works with messy input data and needs to:

Find plausible address matches using external geocoding data
Score match quality in a meaningful and interpretable way
Surface uncertainty and data-quality issues
Support both automated and human-review workflows

You will extend a backend that:

Talks to the Mapbox geocoding API to find candidate address matches
Computes a match score between the original address and the best matched address
Exposes a small HTTP API that the provided React frontend consumes

The repository is organized as:

frontend/ – React app (Vite + TypeScript) to visualise and manage addresses
backend/ – FastAPI starter backend that you will extend
data/ – Test data to validate your solution
README.md – this file

1. Assessment goals

This assignment is not about building the perfect model. It's about showing how you reason about an applied AI problem and translate that reasoning into working software.

You will:

Build an address similarity function returning a score between 0.0 and 1.0
Do some experimentation and document your reasoning
Integrate your solution into the existing frontend/backend setup

Approximate time budget: 2–3 hours, if you run out of time, focus on 2.1/2.2

2. Assignment

2.1 Wire up Mapbox properly

Get and configure a Mapbox access token
(Mapbox requires a credit card even for the free tier; if you prefer not to, contact us and we’ll provide a token.)
Explore the API and decide how to select the “best” match
Implement your solution in backend/mapbox_client.py

2.2 Research and improve the address similarity function

We have provided a baseline implementation of the address similarity function in backend/similarity.py.

The address similarity function should:

Return a value in [0.0, 1.0]
Represent whether two addresses likely point to the same real-world entity
Be reasonably robust to:
- Language differences
- Spelling variations
- Capitalization
- Formatting differences

We would like you to:

Explore multiple approaches qualitatively
Compare them quantitatively using the dataset in data/addresses.csv
Pick one final approach, explain why, and implement it in backend/similarity.py
Document what you tried, what you chose, and what you would explore next in EXPERIMENTS.md

We care at least as much about your reasoning as about the final score.

2.3 Polish & explore

If you have time left, feel free to:

Do some API or code polishing
Play with your solution via the frontend

Pay attention to how the system behaves in practice:

Latency, failures, or confusing behaviour
Ambiguous input or over-confident scores

We’ll discuss any limitations you might have noticed during the interview.

3. Backend (FastAPI starter)

A minimal FastAPI backend starter is provided in backend/. It includes:

Data models for addresses
An SQLite database and SQLAlchemy ORM model
Endpoint skeletons matching the contract below
A naive Mapbox integration and similarity baseline

3.1 Data model

An address record has the following shape:

{
  "id": 1,
  "address": "Parijs, Frankrijk",
  "matched_address": "Paris, France",
  "match_score": 0.98
}

Where:

address – raw input address as entered by a user
matched_address – the best match returned by Mapbox
match_score – a float in [0, 1] indicating how likely these refer to the same real-world entity
(1.0 = clearly the same, 0.0 = clearly not)

3.2 API contract

Your backend should implement the following endpoints:

List all addresses
- GET /addresses
- Response: 200 OK with Address[]
Create an address
- POST /addresses
- Body:
```
{
  "address": "string"
}
```
- Behaviour:
  - Call Mapbox to find the best match
  - Calculate a similarity score
  - Store the record
  - Return 201 Created (or 200 OK) with the full Address object
Get a single address
- GET /addresses/{id}
- Response: 200 OK with Address
- 404 if not found
Update an address
- POST /addresses/{id}
- Body:
```
{
  "address": "string"
}
```
- Behaviour:
  - Recalculate Mapbox match and similarity score
  - Store the updated record
  - Return 201 Created (or 200 OK) with the full Address object
Refresh scores of one or more addresses
- POST /addresses/refresh
- Body:
```
{
  "ids": [1, 2, 3]
}
```
- Behaviour:
  - Recalculate matches and scores
  - If ids is null, refresh all records
  - Return 200 OK

3.3 Running the backend

From the repository root:

cd backend
pip install -r requirements.txt
uvicorn main:app --reload --port 8000

API:

4. Frontend (React)

A small React + TypeScript frontend is provided in frontend/. It allows you to:

View all addresses in a table
Select rows via checkboxes
Add a new address
Inspect a single address in detail and update it
Refresh scores for selected or all addresses

4.1 Running the frontend

cd frontend
npm install
npm run dev

5. Deliverables

Send us a link to a Git repository on the day before your interview containing:

Your backend implementation
Any notebooks or scripts used during experimentation
Your EXPERIMENTS.md

Good luck! 🔥

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
backend		backend
data		data
frontend		frontend
.env.template		.env.template
.gitignore		.gitignore
.mise.toml		.mise.toml
Experiments.md		Experiments.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Root Sustainability – AI Engineer Technical Assessment

1. Assessment goals

2. Assignment

2.1 Wire up Mapbox properly

2.2 Research and improve the address similarity function

2.3 Polish & explore

3. Backend (FastAPI starter)

3.1 Data model

3.2 API contract

3.3 Running the backend

4. Frontend (React)

4.1 Running the frontend

5. Deliverables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Root Sustainability – AI Engineer Technical Assessment

1. Assessment goals

2. Assignment

2.1 Wire up Mapbox properly

2.2 Research and improve the address similarity function

2.3 Polish & explore

3. Backend (FastAPI starter)

3.1 Data model

3.2 API contract

3.3 Running the backend

4. Frontend (React)

4.1 Running the frontend

5. Deliverables

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages