Skip to content

training getting slow after loading model ckpts #8

@10wook

Description

@10wook

Hi ... its me again.

Hope you miss me haha,,

unfortunately I ma here with some issues.

I have finally succeeded to train your models in eng_kor version.

and I found out that training datasets of 100,000 pics with 700,000 iter was not enough to make some performance that I want.

SO i decided to train more with same dataset.

and then i found out that s/iter has got bigger

before loading the models it costed like 2.46 sec per iter and now it takes at least 4.85 sec per iter.

It usually takes more than 20 sec per iters now.

I found out that my gpus arent working.

han@han-System-Product-Name:~$ nvidia-smi
Tue Apr 22 02:04:17 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.135 Driver Version: 550.135 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 4090 Off | 00000000:01:00.0 On | Off |
| 35% 42C P8 24W / 450W | 22360MiB / 24564MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 1707 G /usr/lib/xorg/Xorg 552MiB |
| 0 N/A N/A 1892 G /opt/teamviewer/tv_bin/TeamViewer 20MiB |
| 0 N/A N/A 1958 G /usr/bin/gnome-shell 90MiB |
| 0 N/A N/A 4783 G ...seed-version=20250417-180112.233000 43MiB |
| 0 N/A N/A 37012 G ...erProcess --variations-seed-version 249MiB |
| 0 N/A N/A 543720 C python 21370MiB |
+-----------------------------------------------------------------------------------------+

I really want to know why this is happening.

Any ideas?? plz help me out

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions