CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

Welcome to official repo of CircuitSense, a comprehensive visual circuit benchmark that evaluates models' capability in symbolic reasoning and visual mathematical derivation. We introduce a hierarchical synthetic generation pipeline consisting of a grid-based schematic generator and a block diagram generator with auto-derived symbolic equation labels. This repository contains the code for our hierarchical synthetic generation pipeline.

For further information, please refer to our preprint.

[🌐 Webpage] [🤗 Hugginface Dataset] [📑 Paper]

💥 News

[2025.10.17] 🚀 CircuitSense (full version) got accepted in ICLR 2026 🇧🇷!
[2025.10.17] 🚀 CircuitSense got accepted in NeurIPS 2025 MATH-AI, The 5th Workshop on Mathematical Reasoning and AI and will be presented in San Diego Convention Center on December 6th.
[2025.09.29] 🔥 We release the code for our hierarchical synthetic generation pipeline.
[2025.09.25] The arxiv paper is online.

About CircuitSense

We introduce CircuitSense, a comprehensive benchmark of 8,006 problems for evaluating visual-to-mathematical reasoning in circuit understanding which combines curated questions with synthetic problems focused on symbolic equation derivation. Our hierarchical synthetic generation pipeline produces novel circuits across six levels with guaranteed ground-truth symbolic equations, enabling rigorous evaluation. Our extensive evaluation on perception, analysis, and design tasks shows that models demonstrate adequate perception (85%+ for closed-source) but fail catastrophically at mathematical symbolic modeling (below 19%). This mathematical weakness directly undermines their design capabilities.

Requirements:

# System deps (LaTeX + ngspice)
yes | sudo apt install texlive-full
sudo apt-get install -y libngspice0-dev ngspice

# Python deps
pip install -r requirements_lcapy.txt
pip install PyMuPDF PySpice readchar httpx

Quickstart

Run the end-to-end pipeline via the CLI wrapper in main.py.

# Example 1: default generation + visualization
PYTHONPATH=. python main.py \
  --note grid_v11_240831 \
  --gen_num 50 \
  --num_proc 4

# Example 2: generate symbolic circuits and derive equations
PYTHONPATH=. python main.py \
  --note symbolic_circuits \
  --gen_num 30 \
  --symbolic \
  --derive_equations \
  --show_sample_equations

# Example 3: get the dataset folder from Example 2
PYTHONPATH=. python scripts/create_symbolic_equations_dataset.py \
  --input_dir datasets/symbolic_circuits

Key outputs are written under datasets/<note>/, including labels.json and, when enabled, symbolic_equations.json.

CLI options (from main.py)

--note: Dataset name used for the data JSON and output directory (required)
Generation:
- --circuit_note: version/note used by generator (default: v11)
- --gen_num: number of circuits to generate (default: 50)
- --num_proc: processes for generation (default: 1)
- --symbolic: generate symbolic circuits
- --simple_circuits: generate simpler circuits
- --integrator: enforce one integrator op-amp per circuit
- --rlc: generate RLC networks (one AC source and at least one reactive component)
- --no-meas: hide all probe drawings except those required
Analysis:
- --derive_equations: run Lcapy-based symbolic derivation
- --max_equations: maximum circuits to analyze (default: 20)
- --show_sample_equations: print sample equations during derivation
- --generate_symbolic_questions: include symbolic TF questions in output
- --questions_only: only generate symbolic questions (sets the above accordingly)
- --max_components: skip circuits with more than this many components (default: 20)
- --fast_analysis: shorter timeouts for faster processing
Control:
- --skip_generation: skip generation (use existing data)
- --skip_visualization: skip visualization step
- --force: overwrite existing data without prompt

Pipeline details

Under the hood, main.py orchestrates:

Generation: ppm_construction/data_syn/generate.py produces ppm_construction/data_syn/data/<note>.json
Visualization: ppm_construction/ft_vlm/data_process/get_datasets_from_json_data.py --note <note> transforms JSON into datasets/<note>/ with rendered images and labels.json.
Equation derivation: scripts/analyze_synthetic_circuits_robust.py reads datasets/<note>/labels.json and writes datasets/<note>/symbolic_equations.json (when enabled).

Legacy script usage (optional)

If you prefer calling scripts directly:

# 1) Generate netlists/LaTeX
bash ./ppm_construction/data_syn/scripts/run_gen.sh

# 2) Visualize circuit images with LaTeX
PYTHONPATH=. \
python ./ppm_construction/ft_vlm/data_process/get_datasets_from_json_data.py \
  --note grid_v11_240831

# 3) Derive equations
PYTHONPATH=. \
python scripts/analyze_synthetic_circuits_robust.py \
  --labels_file datasets/grid_v11_240831/labels.json \
  --output_file datasets/grid_v11_240831/symbolic_equations.json \
  --max_circuits 50

Evaluation

The provided evaluation script in evalution folder is for evaluating synthetic generation answers. The script sccepts three command-line arguments:

python benchmark_symbolic_equations.py [mode] [max_questions] [dataset_path]

The mode can be full, inference, or evaluation. Mode full runs the model on the questions and immediately grades them. However, mode inference generates answers and saves them to .txt files in the question folders (it does not grade them) and mode evaluation read the saved responses (from inference only mode) and grades them.

python evaluation/benchmark_symbolic_equations.py full 10

Citation

If you found the provided code with our paper useful in your work, we kindly request that you cite our work.

@misc{akbari2025circuitsensehierarchicalcircuitbenchmark,
      title={CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process}, 
      author={Arman Akbari and Jian Gao and Yifei Zou and Mei Yang and Jinru Duan and Dmitrii Torbunov and Yanzhi Wang and Yihui Ren and Xuan Zhang},
      year={2025},
      eprint={2509.22339},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2509.22339}, 
}

This repository is based on MAPS: Advancing Multi-modal Reasoning in Expert-level Physical Science.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
asset		asset
evaluation		evaluation
ppm_construction		ppm_construction
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert_netlist_remove_n_nodes.py		convert_netlist_remove_n_nodes.py
main.py		main.py
requirements_lcapy.txt		requirements_lcapy.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

💥 News

About CircuitSense

Quickstart

CLI options (from main.py)

Pipeline details

Legacy script usage (optional)

Evaluation

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

armanakbari/CircuitSense

Folders and files

Latest commit

History

Repository files navigation

CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

💥 News

About CircuitSense

Quickstart

CLI options (from main.py)

Pipeline details

Legacy script usage (optional)

Evaluation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages