Reinforcement Learning for Radiator Control: Energy-Efficient Temperature Management

1. Overview

This project implements a Reinforcement Learning (RL) approach to control a building's radiator system, balancing energy savings and occupant comfort. The goal is to minimize electricity costs (including off-peak pricing) while maintaining comfortable temperatures when occupants are home. Note that this problem could be solved with statistic approaches, this is why this project aims to build a template for more complex behavior (high non-linearity, randomness, etc...). Overall it is a nice project to have some fun with RL.

2. Key Features

Simulated Environment: Custom linear physical model (conductance + capacity) for building thermal dynamics.

$$ C \frac{\partial T_{in}}{\partial t} = G (T_{out} - T_{in}) + P_{radiator} $$

RL Agents: Rule-based baseline and a DQN (Deep Q-Network) implemented in PyTorch.
Discrete Action Space: Radiator power levels.
Reward Function: Weighted sum of electricity cost and comfort (temperature deviation).
Data: Synthetic data for algorithm testing; real-world data (MeteoSwiss) for training.

3. Environment

State Space

Current indoor temperature
Outdoor temperature
Radiator state
Occupant presence
Time of day (for pricing)

Action Space

Discrete radiator power levels (e.g., 0%, 33%, 66%, 100%)

Reward Function

Cost Term: Penalizes high electricity usage, scaled by real-time pricing.
Comfort Term: Penalizes deviation from the desired temperature range.
Total Reward: Weighted sum of cost and comfort terms.

4. Data

Weather Data: Sourced from MeteoSwiss (real-world) and synthetic datasets (testing).
Electricity Pricing: Simulated off-peak/peak pricing (averaged for training).
Occupant Presence: Simulated presence ranges.

5. Algorithms

Rule-Based Agent

Simple heuristic (e.g., turn on radiator if temperature is below a threshold).

DQN Agent

Network: PyTorch implementation.
Training: Offline (pre-collected data) or online (interaction with the environment).
Hyperparameters: Learning rate, discount factor, exploration rate (ε-greedy).

6. Evaluation

Metrics

Total Reward: Sum of rewards over a 24-hour period.
Total Cost: Sum of electricity costs over a 24-hour period.
Comfort Metrics: Average deviation from the desired temperature.

Baselines

Rule-based agent (for comparison).

7. Setup & Reproducibility

Dependencies

Python 3.8+
PyTorch
Gymnasium
Poetry (for dependency management)

Installation

Clone the repository:

git clone https://github.com/mp-mech-ai/radiator-rl

Install dependencies:
```
poetry install
```
Download weather data from MeteoSwiss and place it in data/weather/.

Training

Run the DQN training script:
```
python dqn_training.py
```

8. Challenges & Limitations

Long Training Time: DQN requires extensive interaction with the environment.
Localization: Model is currently trained for a single location.
Temperature Assumptions: Simplified linear model may not capture all real-world dynamics.

9. Future Work

Scalability: Extend to multiple locations with diverse weather patterns.
Advanced Algorithms: Implement PPO for continuous power control.
Real-World Deployment: Test on physical hardware or more complex simulators.

10. License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
data		data
radiator_rl		radiator_rl
tests		tests
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning for Radiator Control: Energy-Efficient Temperature Management

1. Overview

2. Key Features

3. Environment

State Space

Action Space

Reward Function

4. Data

5. Algorithms

Rule-Based Agent

DQN Agent

6. Evaluation

Metrics

Baselines

7. Setup & Reproducibility

Dependencies

Installation

Training

8. Challenges & Limitations

9. Future Work

10. License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning for Radiator Control: Energy-Efficient Temperature Management

1. Overview

2. Key Features

3. Environment

State Space

Action Space

Reward Function

4. Data

5. Algorithms

Rule-Based Agent

DQN Agent

6. Evaluation

Metrics

Baselines

7. Setup & Reproducibility

Dependencies

Installation

Training

8. Challenges & Limitations

9. Future Work

10. License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages