Example for Uncertainty Modeling

A worked example showing how to transform sparse data into quantifiable uncertainty models for early-stage design decision-making.

The Data:

Consider an early-stage conceptual design of an electric vehicle system employing lithium-ion battery (LIB) electric propulsion. This encompasses various applications, including electric ground vehicles, aircraft, and maritime systems. To assess the lifecycle Global Warming Potential (GWP) of the battery system, we focus on the specific energy-based metric ($kg CO_2e/kWh$) — the 100-year time-scaled global warming potential per kilowatt-hour of battery energy capacity. An initial literature review yields the following reported values:

GWP ($kg CO_2e / kWh$)	Value	Source
60-93.2	Range	(A) Abdelbaky et al.¹
60-150	Range	(B) Amarakoon et al.²
120.5-172.9	Range	(C) André & Hajek³
170.5	Scalar	(D) Liberacki et al.⁴
115	Scalar	(E) Pollet et al.⁵
72.9	Scalar	(F) Pontika et al.⁶

The discrepancy among sources introduces epistemic uncertainty, defined as uncertainty due to incomplete knowledge of the system. Crucially, the true lifecycle GWP of the future battery system is a fixed value, albeit unknown at this stage. This distinguishes it from aleatory uncertainty, which arises from inherent variability. The following sections present three approaches to quantify this epistemic uncertainty, translating the sparse literature data into formal uncertainty metrics suitable for integration into decision-making frameworks (e.g., design optimization, lifecycle assessment)⁷.

1. Interval Analysis

Interval analysis provides a fundamental approach to handling non-stochastic uncertainty. It assumes knowledge of a value's bounds — its minimum and maximum — but no information about the distribution of values within those bounds. While we know the range spanned by the data, we have no insight into how values accumulate or cluster within that interval.

We treat every data point as set $X = [a, b]$. For a single value like 170.5, the interval is simply [170.5, 170.5]. We take the union of all sets $(\min(\text{all } X), \max(\text{all } X))$ to define the interval of our data.

$$ GWP_{bat_{\min}} = \min(\text{all X}) = 60.0 , kg CO_2e/kWh $$

$$ GWP_{bat_{\max}} = \max(\text{all X}) = 172.9 , kg CO_2e/kWh $$

Using the script (interval_analysis.py), this results in [60.0, 172.9] $kg CO_2e/kWh$ and illustrated in Figure 1.

Figure 1 illustrates the resulting interval (shown in red) as the union of all individual data ranges, along with the distribution of each data point across the total span (y-axis). Interval analysis provides a robust safety envelope by introducing no additional assumptions about the data. However, this approach is inherently conservative, as it disregards any potential clustering of values within specific sub-ranges.

2. Probability Theory

While Interval Analysis provides the absolute boundaries of our data, it treats all values within those boundaries as equally unknown. However, in decision-making we often assume that there is a central tendency, meaning the true value is more likely to be near the average of reported values than at the extreme edges. To model this, we can use Probability Theory. We discuss two approaches to handle uncertainty with Probability Theory.

2.1 Normal Distribution

Since we are dealing with sparse literature data, we treat each source as a separate probability distribution and aggregate them into a single Mixture Model. In this approach, we make the following assumptions for each literature source $i$:

The Mean ($\mu_i$): We asume the most likely value is the center of the reported range. For scalars the mean is the value itself:

$$ \mu_i = \frac{Low_i + High_i}{2} $$

The Standard Deviation ($\sigma_i$): We assume the reported range represent a 95% confidence interval. In a Normal distribution, 95% of the data falls within $\pm 1.96$ standard deviations of the mean.

$$ \sigma_i = \frac{High_i - Low_i}{2 \times 1.96} $$

For scalar values $(D, E, F)$ we assume a small 5% coefficient of variation to account for inherent measurement error.

We combine all $n$ sources by averaging their Cumulative Distribution Functions (CDFs). This gives every study an equal vote $(1/n)$ in the final model:

$$ P(X \le x) = \frac{1}{n} \sum_{i=1}^{n} \Phi\left(\frac{x - \mu_i}{\sigma_i}\right) $$

where $\Phi$ is the standard normal CDF. Using the provided Python script (probability_analysis_normal.py), we derive the following statistical model for the Battery GWP (Figure 2):

Mean ($\mu_{normal}$): 114.45 $kg CO_2e/kWh$
The "consensus" average of all literature.
Standard Deviation ($\sigma_{normal}$): 37.25 $kg CO_2e/kWh$
A measure of how much the studies disagree.
95% CI Lower Bound (2.5% quantile): 65.26 $kg CO_2e/kWh$
The optimistic boundary.
95% CI Upper Bound (97.5% quantile): 179.61 $kg CO_2e/kWh$
The conservative boundary.

A large standard deviation (approximately 33% relative to the mean) is a quantitative signal of high epistemic uncertainty. It tells the designer that the literature is significantly divided on the true GWP of the battery system. Notably, the normal distribution extends beyond the physically plausible bounds implied by the raw data, assigning small but non-zero probability to values outside the original reported range. These tails reflect the model's assumption that values near — but outside — the observed extremes remain possible, though with diminishing likelihood.

2.2 Uniform Distribution

While the Normal distribution assumes a central tendency (that the middle of a range is more likely), the Uniform Distribution is more conservative. It assumes that for any given study, every value within the reported range is equally likely. This approach is often preferred when we have no evidence to suggest the true value is in the center of a range rather than at the boundaries.

For each range [a,b], the Cumulative Distribution Function (CDF) is a linear ramp:

$$ F(x) = \frac{x-a}{b-a} \quad \text{for} \quad a \le x \le b $$

For scalar values $(D, E, F)$, the CDF is a discrete step function (a jump from 0 to 1 at that exact value). Like the Normal approach, we aggregate these by giving each study a $1/n$ weight:

$$ P(X \le x) = \frac{1}{n}\sum_{i=1}^{n} F_i(x) $$

Using the provided Python script (probability_analysis_uniform.py), we derive the following statistical model (Figure 3):

Mean ($\mu_{uniform}$): 114.45 $kg CO_2e/kWh$
Standard Deviation ($\sigma_{uniform}$): 37.47 $kg CO_2e/kWh$
95% CI Lower (2.5%): 63.64 $kg CO_2e/kWh$
95% CI Upper (97.5%): 170.53 $kg CO_2e/kWh$

While the Normal distribution creates smooth S-curves, the Uniform distribution results in a piecewise linear CDF. The curve is composed of two distinct geometric features that correspond directly to our data: The linear ramps represent the ranges (sources A, B, C), with constant rate of cummulating probability across those intervals. The vertical steps represent the scalars (sources D, E, F), where each jump indicates a "point of agreement", where 1/6 (16.6%) of the total probability is concentrated at a single, precise value.

3. Evidence Theory (Dempster-Shafer Theory)

While Probability Theory forces us to distribute likelihood across a range (even if we don't know the shape), Evidence Theory (also called Dempster-Shafer theory) allows us to measure uncertainty through Belief and Plausibility. In this model, we assign a Basic Belief Assignment (BBA), denoted as $m$, to each piece of evidence. If we trust our 6 sources equally, each recieves a mass of $m=1/6$.

Cumulative Belief Function (CBF - Red): This is the conservative lower bound. For a given value x, the CBF only increases if a study's entire range is below x. The Belief (Bel) (the lower bound) represents the total evidence that strictly supports a proposition. For a GWP value x, Bel(x) is the sum of masses where the entire reported interval is below x. It is our "guaranteed" certainty.

$$CBF(x) = \sum_{B \subseteq (-\infty, x]} m(B)$$

Cumulative Plausibility Function (CPF - Blue): This is the optimistic upper bound. It represents the evidence that could be true. For a given value x, the CPF increases if even just the lowest point of a study's range is below x. The Plausibility (Pl) (the upper bound) represents the total evidence that could be true (i.e., not yet ruled out). For a GWP value x, Pl(x) is the sum of masses where at least part of the reported interval is below x.

$$CPF(x) = \sum_{B \cap (-\infty, x] \neq \varnothing} m(B)$$

Instead of a single CDF curve, Evidence Theory produces two bounding curves that create a Probability Box (P-Box), illustrating how belief and plausibility define a probability interval as lower and upper bounds. Using the provided Python script (evidence_theory.py), we visualize the literature data (Figure 4):

The gray shaded area between the Blue (Plausibility) and Red (Belief) lines is a direct measurement of our lack of knowledge (ignorance). Where the lines are far apart (e.g., between 75 and 120), the literature is either vague or conflicting. Where they pinch together (e.g. at the scalar points), our certainty is higher because the sources provided precise values. A risk-averse designer would look at the Belief curve (Red) to see what can be proven, while a risk-tolerant designer might look at the Plausibility curve (Blue) to see what is possible.

We notice that at approximately 115 $kg CO_2e/kWh$, the Cumulative Belief Function (CBF) and Cumulative Plausibility Function (CPF) converge, momentarily eliminating the "Area of Ignorance." This indicates a point of consensus across the disparate literature sources. At this specific threshold, there is no ambiguity: exactly 50% of the evidence supports a GWP of 115 or lower. This is largely driven by the presence of Source E (Pollet et al.), which provides a precise scalar value of 115, effectively anchoring the probability box and providing a moment of certainty amidst the surrounding epistemic gaps. Assume presenting this to a stakeholder, we can point to 115 as your most robust estimate. Unlike other values where we are "guessing" within a gray zone of ignorance, 115 is the value where our conservative evidence (Belief) and our optimistic potential (Plausibility) perfectly align.

In Evidence Theory, we don't get a single mean or standard deviation like we do in Probability Theory because the theory is designed to avoid making a single guess. Instead of a single number, we get an interval of possible means, known as the Lower Expected Value ($E_{low}$, calculated using the Belief curve) and Upper Expected Value ($E_{high}$, calculated using the Plausibility curve).

For your specific dataset it is the interval [99.82, 129.08]. This tells the decision-maker: "Based on the evidence, the average GWP is somewhere in this range, and our ignorance prevents us from being more precise."

Summary of Uncertainty Methods

Method	Estimated Mean / Range	Key Takeaway
Interval	[60.0, 172.9] $kg CO_2e/kWh$	Maximum possible bounds; no assumptions about internal distribution.
Probability (Normal)	114.45 ± 37.25 $kg CO_2e/kWh$	Most likely central value with normally distributed tails extending beyond observed data.
Probability (Uniform)	114.45 (95% CI: 63.6–170.5) $kg CO_2e/kWh$	Equal likelihood across all reported ranges; piecewise linear CDF.
Evidence (D-S)	Expected value: [99.8, 129.1] $kg CO_2e/kWh$	Quantifies ignorance via Belief-Plausibility gap; consensus point at 115.

References

Mohammad Abdelbaky, Lilian Schwich, João Henriques, Bernd Friedrich, Jef R. Peeters, and Wim Dewulf. Global warming potential of lithium-ion battery cell production: Determining influential primary and secondary raw material supply routes. Cleaner Logistics and Supply Chain, 9:100130, December 2023. ↩
Shanika Amarakoon, Jay Smith, and Brian Segal. Application of Life-Cycle Assessment to Nanoscale Technology: Lithium-ion Batteries for Electric Vehicles, April 2013. ↩
Nicolas André and Manfred Hajek. Robust Environmental Life Cycle Assessment of Electric VTOL Concepts for Urban Air Mobility. In AIAA Aviation 2019 Forum, Dallas, Texas, June 2019. American Institute of Aeronautics and Astronautics. ↩
Adam Liberacki, Barbara Trincone, Gabriella Duca, Luigi Aldieri, Concetto Paolo Vinci, and Fabio Carlucci. The Environmental Life Cycle Costs (ELCC) of Urban Air Mobility (UAM) as an input for sustainable urban mobility. Journal of Cleaner Production, 389:136009, February 2023. ↩
Félix Pollet, Florent Lutz, Thomas Planès, Scott Delbecq, and Marc Budinger. A generic life cycle assessment tool for overall aircraft design. Applied Energy, 399:126514, December 2025. ↩
Evangelia Pontika, Panagiotis Laskaridis, and Phillip J. Ansell. Technology exploration of zero-emission regional aircraft: Why, what, when and how? August 2025. ↩
The theoretical framework for uncertainty modeling in this work is based on: Wen Yao, Xiaoqian Chen, Wencai Luo, Michel Van Tooren, and Jian Guo. Review of uncertainty-based multidisciplinary design optimization methods for aerospace vehicles. Progress in Aerospace Sciences, 47(6):450–479, August 2011. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
figures		figures
LICENSE		LICENSE
README.md		README.md
evidence_theory.py		evidence_theory.py
interval_analysis.py		interval_analysis.py
probability_analysis_normal.py		probability_analysis_normal.py
probability_analysis_uniform.py		probability_analysis_uniform.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Example for Uncertainty Modeling

Table of Contents

The Data:

1. Interval Analysis

2. Probability Theory

2.1 Normal Distribution

2.2 Uniform Distribution

3. Evidence Theory (Dempster-Shafer Theory)

Summary of Uncertainty Methods

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Example for Uncertainty Modeling

Table of Contents

The Data:

1. Interval Analysis

2. Probability Theory

2.1 Normal Distribution

2.2 Uniform Distribution

3. Evidence Theory (Dempster-Shafer Theory)

Summary of Uncertainty Methods

References

Footnotes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages