New Architecture - Pipeline & Sandbox Pool & Database #167

ArthurCRodrigues · 2026-01-28T10:44:13Z

I'll add some bullet points and quick explanations here, so for more details, check #151

Context

We were running into some nasty problems when trying to run the project as a web API. Static variables, state and speed were being a major concern and it was simply impossible to run it as a service.

Also, the previous architecture relied in an orchestrated grading workflow that was causing the orchestration file (autograder_facade.py) to be extremely coupled. Adding a new step meant adding orchestration logic to the grading process.

When it comes to executing student code remotely, we were spinning containers for every request with no proper control. And spinning containers was taking most of the request time.

Finally, we had the goal of being able to store grading packages so that teachers could only send them once and for every submission keep only a reference of the assignment configuration.

Solution

So, in this PR, we introduce an architecture that follows a pipeline pattern: Each step knows that it takes place in a grading process, they're not simple service providers anymore (they are choreographed, not orchestrated). Which makes it way easier to include more steps or adjust the order of which steps are executed within the pipeline.

Secondarily, we introduce a really robust sandbox management sub-system that's responsible for handling all sandbox containers. It uses an optmization technique that starts containers upon application startup and keep them "warm" and ready to receive code. By doing this, we manage to keep control of container usage within the system and also solve the problem of container startup time for requests since they'll be already up.

Pipeline Architecture

An AutograderPipeline is an instance of a grading recipe based on the grading configuration. It contains all the necessary steps (and associated data) to grade the assignment configured by the teacher. One pipeline can have the setup step for checking for required files while another may not, it all depends on what the teachers configured.

And what's cool about the AutograderPipeline is that it's really about the recipe. You can cook submissions with it as much as you want. It is a stateless representation of an specific grading workflow, that can be executed for any submission. As you can see, it is highly compatible with the goal of storing grading packages and simply having their references for further grading.

Finally, we fixed the problems of static variables by following proper coding practices and not using static member variables anymore.

Sandbox Management

Since this is a draft PR, I haven't yet implemented this one. But I'm doing the research and the key points here is:

We'll use Gvisor for enhanced isolation
The Sandbox Management sub-system will work as a background process
We'll keep 2 containers running for each language as default (java,python,c,c++,js)
We'll add features for scaling on increased workload.

config grading WIP

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…twork/autograder into pipeline-architecture # Conflicts: # autograder/autograder.py

…mentation

…issions

…grader

…simplifying imports

…ltiple files

This reverts commit d0b30f8.

… clarity

…and step results

… type

…tion

…ethod

ArthurCRodrigues and others added 30 commits December 23, 2025 09:59

feat: first glance of pipeline architecture

92cbc95

feat: first glance of pipeline architecture

6e1e93e

feat: criteria tree printer

6367028

feat: criteria tree parser

ef0463f

fix: add missing subject_weight field

e0c2963

feat: first glance of pipeline architecture

19391e1

feat: add criteria schema and documentation for grading pipeline

b1715c8

feat: refactor import paths for models in various modules

8fcc9e4

feat: add Pydantic models for criteria configuration validation

1cc7b01

feat: update criteria tree models with embedded test functions

04244c7

Merge branch 'pipeline-architecture' into grader-refactor

046e9ec

feat: tree grading

fc7f6af

config grading WIP

fix: removed grade_from_config

0a18e1d

fix: update imports and typings

a901d7e

Apply suggestions from code review

7a0ec32

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fix: tests import

aed70a1

fix: subject and category configs

3c0eaf6

fix: test config

c1677f3

fix: wrong factor calc

b15aad5

fix: add missing list appending

e422bc5

fix: removed rounding at __balance_subject_weights

15b5baf

fix: parsing subjects_weight to category

15b690f

Merge pull request #150 from webtech-network/grader-refactor

fd47081

refactor: change step order for more coherence

0ada0be

Merge branch 'pipeline-architecture' of https://github.com/webtech-ne…

17e5822

…twork/autograder into pipeline-architecture # Conflicts: # autograder/autograder.py

feat: add Pydantic models for criteria configuration validation

29a922b

refactor: remove submission_id from grading process for cleaner imple…

a4b0410

…mentation

feat: implement feedback generation in feedback step

20e7954

refactor: update result_tree attribute to be optional in grading result

73da418

feat: implement score setting in ExporterStep with error handling

38a07ec

ArthurCRodrigues added 22 commits January 19, 2026 19:47

refactor: Clean reporters and create reporter_service.py

e3da947

feat: add Submission and SubmissionFile dataclasses for handling subm…

920790f

…issions

fix: update run method to specify input_data type as Submission

73ff5f0

refactor: remove submission_id parameter from GradeStep initializer

e27157d

refactor: add comments to clarify pipeline step functionality in auto…

161a523

…grader

refactor: streamline template loading by removing legacy methods and …

c8196d1

…simplifying imports

refactor: remove unused main execution blocks and legacy code from mu…

eeee05c

…ltiple files

refactor: delete current tests

d0b30f8

Revert "refactor: delete current tests"

80467c8

This reverts commit d0b30f8.

refactor: adding placeholders for better debugging

c08ad35

feat: add HTML grading pipeline test script

7a19be0

refactor: remove criteria_config.py

8f9375c

refactor: update configuration model imports and field names for clarity

d310456

refactor: update configuration model imports and field names for clarity

47dcc07

refactor: update parameters type in TestResult to Optional for better…

2d0d84a

… clarity

feat: implement full pipeline test for BuildTreeStep and GradeStep

5c3e1a9

feat: introduce PipelineExecution class to manage pipeline execution …

cbb6250

…and step results

feat: update step execution methods to use PipelineExecution as input…

0521686

… type

refactor: remove unused parameters from autograder pipeline configura…

f83e06b

…tion

feat: enhance pipeline execution flow with PipelineExecution management

893d278

feat: update load_template_step to use PipelineExecution in execute m…

1c939ef

…ethod

feat: enhance feedback_step to generate feedback using grading result

c5cbced

ArthurCRodrigues assigned jaoppb and ArthurCRodrigues Jan 28, 2026

ArthurCRodrigues linked an issue Jan 28, 2026 that may be closed by this pull request

New Pipeline Architecture #151

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Architecture - Pipeline & Sandbox Pool & Database #167

New Architecture - Pipeline & Sandbox Pool & Database #167

Uh oh!

ArthurCRodrigues commented Jan 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

New Architecture - Pipeline & Sandbox Pool & Database #167

Are you sure you want to change the base?

New Architecture - Pipeline & Sandbox Pool & Database #167

Uh oh!

Conversation

ArthurCRodrigues commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Solution

Pipeline Architecture

Sandbox Management

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ArthurCRodrigues commented Jan 28, 2026 •

edited

Loading