VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection

Official code repository for VETime (https://arxiv.org/abs/2602.16681), implemented in PyTorch. VETime proposes a novel time-series anomaly detection framework that unifies temporal and visual modalities through fine-grained alignment and dynamic fusion, achieving state-of-the-art zero-shot localization performance with lower computational overhead than existing vision-based approaches.

📄 Overview

Time-series anomaly detection (TSAD) requires identifying both immediate Point Anomalies and long-range Context Anomalies. However, existing foundation models face a fundamental trade-off: 1D temporal models provide fine-grained pointwise localization but lack a global contextual perspective, while 2D vision-based models capture global patterns but suffer from information bottlenecks due to a lack of temporal alignment and coarse-grained pointwise detection. To resolve this dilemma, we propose VETime , the first TSAD framework that unifies temporal and visual modalities through fine-grained visual-temporal alignment and dynamic fusion.

VETime introduces a Reversible Image Conversion and a Patch-Level Temporal Alignment module to establish a shared visual-temporal timeline, preserving discriminative details while maintaining temporal sensitivity. Furthermore, we design an Anomaly Window Contrastive Learning mechanism and a Task-Adaptive Multi-Modal Fusion to adaptively integrate the complementary perceptual strengths of both modalities. Extensive experiments demonstrate that VETime significantly outperforms state-of-the-art models in zero-shot scenarios, achieving superior localization precision with lower computational overhead than current vision-based approaches.

📁 Project Structure

VETime/
├── train.py                  # Main training script (with Accelerate support)
├── Test_TSB.py               # TSB-AD benchmark evaluation and inference
├── model/
│   ├── VETime.py             # VETIME main model architecture
│   ├── VTS_module.py         # Vision-Time Series fusion module
│   ├── Vision_encoder/       # Vision backbone (MAE, ViT)
│   └── TS_encoder/           # Time series encoder
├── dataset/
│   ├── dataloader.py         # Data loaders and collate functions
│   ├── pre_image.py          # Time series to image conversion utilities
│   └── TSB-AD/               # TSB-AD benchmark datasets
├── loss/
│   └── loss.py               # Contrastive loss, etc.
├── evaluation/
│   ├── metrics.py            # Comprehensive anomaly detection metrics
│   └── basic_metrics.py      # Basic metric implementations
└── requirements.txt          # Dependencies list

📦 Installation

Requirements

Python 3.8+ (Tested on 3.11)
PyTorch 2.3.0+ with CUDA
CUDA 12.1 (Recommended)

Installation Steps

# 1. Clone the repository
git clone https://github.com/yyyangcoder/VETime.git
cd VETime

# 2. Create conda environment
conda create -n VETime python=3.11
conda activate VETime

# 3. Install PyTorch (adjust according to your CUDA version)
conda install pytorch==2.3.0 torchvision==0.18.0 torchaudio==2.3.0 pytorch-cuda=12.1 -c pytorch -c nvidia

# 4. Install project dependencies
pip install -r requirements.txt

# 5. Install TSB-AD package
cd dataset/TSB-AD && pip install -e .

🚀 Quick Start

Download Pre-trained Model Checkpoints

Download the pre-trained model checkpoints from Hugging Face:

huggingface-cli download yyyang0/VETime-checkpoints --local-dir ./checkpoints

Download TSB-AD Datasets

This project uses the TSB-AD (Time Series Benchmark for Anomaly Detection) datasets:

Download Links:

Dataset Structure

After downloading and extracting, place the datasets in the following directory structure:

./dataset/TSB-AD/Datasets/
├── TSB-AD-U/          # Univariate datasets
├── TSB-AD-M/          # Multivariate datasets
└── File_List/         # Evaluation split files

📈 Evaluate Model

Evaluation Metrics

VETime employs the following comprehensive metrics for evaluation:

Metric	Description
VUS-PR	Volume Under Surface (Precision-Recall)
Affiliation Metrics	Event-based evaluation metrics
F1-T	A range-based metric that evaluates anomaly detection performance by considering the temporal context of anomalies
Standard-F1	Standard F1 score

Evaluate on TSB-AD benchmark

python Test_TSB.py \
    --model_name VETime \
    --dataset_test_dir ./dataset/TSB-AD/Datasets/TSB-AD-U \
    --file_list ./dataset/TSB-AD/Datasets/File_List/TSB-AD-U.csv

📄 License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

📚 Citation

If you find VETime useful in your research, please consider citing our paper:

@article{yang2026vetime,
  title={VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection},
  author={Yingyuan Yang and Tian Lan and Yifei Gao and Yimeng Lu and Wenjun He and Meng Wang and Chenghao Liu and Chen Zhang},
  journal={arXiv preprint arXiv:2602.16681},
  year={2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection

📊 Table of Contents

📄 Overview

📁 Project Structure

📦 Installation

Requirements

Installation Steps

🚀 Quick Start

Download Pre-trained Model Checkpoints

Download TSB-AD Datasets

Dataset Structure

📈 Evaluate Model

Evaluation Metrics

Evaluate on TSB-AD benchmark

📄 License

📚 Citation

🔗 References

Related Papers

Related Repositories

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection

📊 Table of Contents

📄 Overview

📁 Project Structure

📦 Installation

Requirements

Installation Steps

🚀 Quick Start

Download Pre-trained Model Checkpoints

Download TSB-AD Datasets

Dataset Structure

📈 Evaluate Model

Evaluation Metrics

Evaluate on TSB-AD benchmark

📄 License

📚 Citation

🔗 References

Related Papers

Related Repositories