Confection 🍬

A Cellular Automata Approach to Disease Modeling
Ishika Tulsian, Yali Sommer, and Zack Amiton

[Video Demo]

We arrived at our project idea from two different angles: an interest in the cellular automata and extensions on the Game of Life (e.g. HighLife, Brian's Brain) on the one hand, and epidemiological models like disease models like SIR on the other.

These dual interest led us to create a series of cellular models, pairing Temporal Forge with a custom visualizer to explore the evolution of various toy "diseases" under varied configurations and introducing multiple complicating factors (death, vaccinations, recovery windows, ...).

Our goal with this project was largely exploratory rather than entering aiming to solve a specific sub-problem or find a "single" answer. Instead, as we investigated, we devised various hypotheses related to our diseases; as a sample:

Can we find configurations which result in cyclic / "glider"-like behavior?
Can we find "common cold"-like configurations which infect every cell of our world, but do not ultimately result in deaths?
Can we find traces where vaccinated cells provide "herd immunity" to neighboring cells, stopping an infection from spreading?

...and so forth!

Model Design: Decisions & Limitations

We initially began our model with a base ruleset, a slight modification of the classic Game of Life rules:

Susceptibility: All cells, by default, are "susceptible," and are infected if they have 2+ infected neighbors
Incubation: Infected cells remain infected if they have 3+ infected neighbors
Recovery: Infected cells recover if they have 2 or fewer infected neighbors
Limited Immunity: Recovered cells return to susceptibility after a single timestep

With these considerations, we decided to use Temporal Forge as a means to model extended traces, creating a Simulation sig to track the evolving state of a given trace, as well as a Configuration sig to either manually or programmatically define starting conditions for our various simulations (where Configuration directly determines the initial state for Simulation, via an initState predicate).

As our model builds on cellular automata, our "simulation" space is an n-by-n board, with Simulation maintaining a suite of set Int -> Int which correspond with row-column indices, where, e.g. (0 -> 0) in Simulation.recovered would denote that cell (0, 0) is recovered in the current state.

Simulation contains base sets infected, susceptible, and recovered, which we expanded to form our core sets, also including dead and vaccinated cells. Additionally, two manager sets are used (incubation mapping to infected, and immunity mapping to recovered) to track the number of states that cells have existed in another state, while a single derived set (protected) is used simply to surface information to our visualizer — perhaps a touch inelegant in overusing sets, but it works!

The state engine of our model is a family of "timestep" predicates, living in confection-core.frg which serve to simulate single iterations for a given configuration. We separated these up into distinct functions to enforce introduce different epidemiological constraints. Some key examples include:

timestep: only uses the "base" ruleset, operating on infected and recovered cells
deadTimestep: which introduces the notion of cells dying after remaining infected for 2 timesteps (tracked via incubation)
vaxTimestep: introduces vaccinated cells, which provide herd immunity, "protecting" nearby cells adjacent to at least 2 vaccinated cells
immunityTimestep: introduces temporary immunity to recovered cells (tracked via bounceback)
...and more!

The current iteration of our model relies on an integer bitwidth of 3 (restricting to an 8-by-8 grid), down from the default of 4 (and hence, a 16-by-16 grid); we found that the runtime of instances (particulaly when considering our test suite) made the larger grid size frustrating, if not infeasible — however, we acknowledge that the arithmetic-heavy approach to our modeling choices (e.g. our neighbor computations) may have also exacerbated this. Additionally, the amount of work needed to shift back is relatively minor (simply increasing the bitwidth should be sufficient, given that none of our timestep function conditions rely on the expanded range of numbers).

One issue we did encounter at both bitwidths was overflow considerations. While most cellular automata assume an infinite grid, the limited space of easily-representable indices when using Int -> Int for row-column identification meant that our grid size was limited by bitwidth. Hence, in order to create more interesting simulations, we used toroidal boundaries (a common restriction) such that border cells neighbor cells on the opposite side.

However, we did encounter wraparound issues, where a grid-based simulation means that any given cell has precisely 8 neighbors, assuming rule evolution rulies on Moore neighborhood (which ours does). Hence, rather than using inequalities to check neighbors, we needed to check for presense / absence of specific values (since 8 neighbors for bitwidth 4 would be treated as -7, or as 0 under bitwidth 3). This also meant needing to modify our helpers to explicitly check whether a set was empty rather than simply its cardinality for bitwidth 3, as otherwise our evolution rules would not be able to distinguish having a full neighborhood from having no neighborhood.

An additional limitation of having a bitwidth of 3 was our inability to explicity define the amount of infected cells in a given state of a configuration. For some traces, such as fast death, we could have made interesting observations by evaluating initial configurations with differing counts of infected cells (i.e., can 5 infected cells kill an entire board just as fast as 10?), but as any value over 3 is off limits, we could not define these cases.

Another limitation around from our use of Temporal Forge, as lasso traces aren't necessarily ideal for cellular automata modeling. For example, the Game of Life is famous for exhibiting chaotic, emergent behavior; while our model follows a different suite of rules, it too evolves rather chaotically. We decided to use a timestamp system that allowed us to add a "cutoff" point to our configuration, after which any timestep being used would simply do nothing (and force an "explicit" lasso point, even in traces that didn't necessarily exhibit periodic behavior). While we later modified this via partial traces, the underlying intuition remains the same.

Further, as our investigation was largely "depth"-focused (i.e. iteratively adding rules which added new rules to the same ruleset, rather than experimenting with variants on the same rule), our hypotheses were primarily of the form "can we achieve X under this ruleset?" rather than "can we find a ruleset that achieves X?" Were we continue to exploring this model / expand our work in this domain, searching for novel rulesets in addition to configurations would be a potentially more interesting guiding approach. We could, for example, attempt to synthesize diseases to finding the "deadliest" disease that still exhibits certain conditions, experimenting with infectiousness, different herd immunity conditions, and so forth. These variants are achievable explicitly by creating new timestep funtions, but as our focus was on how new rules affected the behavior of our simulations, we did not end up working as broadly as we could have.

Interpreting Results

We created a custom visualizer to support stepping through the iterations of a given simulation, which maps the members of each set to its corresponding, colored grid cell:

Grey cells correspond with susceptible cells
Red cells correspond with infected cells
Green cells correspond with recovered cells
Black cells correspond with dead (these should remain present after appearing in a simulation)
Blue cells correspond with vaccinated cells (these should remain fixed thorughout a simulation)
- Blue-Outlined cells correspond with protected cells, which cannot be infected

Additionally, a label at the top of the visualizer notes the current timestep, and buttons at the bottom allow for time to be progressed or reset.

An instance of the model under a given run finds a simulation that meets all the provided criteria: for example, if the condition being searched for was a model that lasts at least some number of iterations without any cells dying, then a satisfying example would show the evolution of that desired model. Conversely, an unsat result would suggest that, for the given criteria being searched for, no such model exists.

Goals & Findings

Our goal from the outset was not to prove a specific theorem or simulate a real-world disease, but rather to explore the expressive capacity of cellular automata-based disease models using formal methods. Specifically, we sought to understand the kinds of dynamic, emergent behaviors we could model from a relatively simple and deterministic rule set. As we iterated on our model, we were able to achieve all our foundational and target goals we set out in our project proposal, as well as implement the additional dead and vaccinated states which were described in our reach goal. These extensions allowed us to experiment with a much richer space of epidemiological behaviors, going beyond simple infection-recovery loops. Over the course of the project, we designed and successfully generated a diverse suite of traces, including but not limited to:

Gliders: Guiding Questions: Could we find an initial configuration that led to self propagating infection clusters reminiscent of GoL gliders? If so, is there a way we get use vaccinated states to stop the spread of infection through such a glider?

Relevant Traces: workingGliderSmallTrace, workingGliderBigTrace, gliderVaxWallTrace

We were able to find traces where the shape of infection spread wouldn't change but instead just move vertically forever. workingGliderSmallTrace and workingGliderBigTrace were implemented using our timestep predicate, and gliderVaxWallTrace was implemented using vaxTimestep to allow for vaccinated and protected states.

Interestingly, we see that only 4 cells need to be vaccinated in order to provide a 'herd immunity' effect to their surrounding cells, stopping the infection glider from persisting!
Oscillators: Guiding Questions: Can we generate configurations that repeat indefinitely without reaching a fixed point? Can we find a system where there is a periodic infection pattern?

Relevant Traces: checkerboardTrace, oscillatorTrace

Using checkerboardTrace we were able to find a period-3 oscillator following the timestep ruleset!
Finite-Length Traces: Guiding Questions: Can we construct a disease that evolves and fully resolves (no infected cells remain) within a bounded number of timesteps?

Relevant Traces: finiteTrace1, finiteTrace2, finiteTrace3, novelTrace

By reducing our search space by making the board 8 x 8, we were able to use novelTrace to find initial configurations that would satisfy the end goal of eventually { no Simulation.infected }. We also created our own seeds that satisfied these conditions in finiteTrace1, finiteTrace2 and finiteTrace3. Here are some examples:

Once we found these finite traces, we decided to see if these initial configurations would hold as finite traces for a ruleset that mimics a more infectious disease (described by timestepMoreInfectious) as well as if it would recover the population quicker for a less infectious disease (described by recoveryTimestep where cells stay recovered longer). We found that for the more infectious disease, the same configuration would lead to the entire population being infected, while for the less infectious, the population recovery rate was higher.
Fast Death Traces: Guiding Questions: Is there some notion of lethal/deathly infectious disease? Can we find minimal seed infections that wipe out the entire population as fast as possible?

Relevant Traces: fastDeathTrace

Within the logic for fastDeathTrace, we enforce that
```
eventually { 
    no Simulation.infected
    no Simulation.susceptible 
    no Simulation.recovered
}
```
which helps us find traces where the entire population dies. fastDeathTrace uses deadTimestep to incorporate the dead state ruleset. Here are some traces it found:

Testing

All tests are in the con.tests.frg file. Our test suite focuses on each of the core elements of our system: neighbor computation / toroidal boundary conditions, wellformedness, initialization, and crucially, our timesteps. We do so using mainly property-based tests and using specific seed configurations to test for timestep properties. We could only really test properties and the behavior of our system using assert statements since we implemented our model in Temporal Forge.

Due Credit

Andrew Wagner + Tim's Forge Game of Life implementation was invaluable as a reference point for our initial explorations — we built out our conway.frg based on this!
Andrew's senior thesis on synthesizing initial configuration served as a great source of inspiration (and is just a very cool read!), though we mostly heeded his warnings and stayed away from non-determinism :)
Brian (of Brian's Brain) for his brain (which we also implemented in confection-old.frg)

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
conway		conway
media		media
README.md		README.md
con-visualizer.js		con-visualizer.js
con.tests.frg		con.tests.frg
confection-core.frg		confection-core.frg
confection-traces.frg		confection-traces.frg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Confection 🍬

Model Design: Decisions & Limitations

Interpreting Results

Goals & Findings

Testing

Due Credit

Collaborators and Sharing

About

Uh oh!

Releases

Packages

Languages

ishika2603/confection

Folders and files

Latest commit

History

Repository files navigation

Confection 🍬

Model Design: Decisions & Limitations

Interpreting Results

Goals & Findings

Testing

Due Credit

Collaborators and Sharing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages