Skip to content

Warning: data is not aligned! This can lead to a speed loss #77

@jadams777

Description

@jadams777

When I run the inference example:

python inference.py \
    --data_root "./checkpoints/ditto_trt_Ampere_Plus" \
    --cfg_pkl "./checkpoints/ditto_cfg/v0.4_hubert_cfg_trt.pkl" \
    --audio_path "./example/audio.wav" \
    --source_path "./example/image.png" \
    --output_path "./tmp/result.mp4" 

I get:

Warning: data is not aligned! This can lead to a speed loss

Here is the full result:

In file included from /root/miniconda3/envs/ditto/lib/python3.10/site-packages/numpy/_core/include/numpy/ndarraytypes.h:1909,
                 from /root/miniconda3/envs/ditto/lib/python3.10/site-packages/numpy/_core/include/numpy/ndarrayobject.h:12,
                 from /root/miniconda3/envs/ditto/lib/python3.10/site-packages/numpy/_core/include/numpy/arrayobject.h:5,
                 from /root/.pyxbld/temp.linux-x86_64-cpython-310/workspace/ditto-talkinghead/core/utils/blend/blend.c:1259:
/root/miniconda3/envs/ditto/lib/python3.10/site-packages/numpy/_core/include/numpy/npy_1_7_deprecated_api.h:17:2: warning: #warning "Using deprecated NumPy API, disable it with " "#define NPY_NO_DEPRECATED_API NPY_1_7_API_VERSION" [-Wcpp]
   17 | #warning "Using deprecated NumPy API, disable it with " \
      |  ^~~~~~~
==================== setup kwargs ====================
max_size <class 'int'> 1920
template_n_frames <class 'int'> -1
crop_scale <class 'float'> 2.3
crop_vx_ratio <class 'int'> 0
crop_vy_ratio <class 'float'> -0.125
crop_flag_do_rot <class 'bool'> True
smo_k_s <class 'int'> 13
emo <class 'numpy.ndarray'> (600, 8)
eye_f0_mode <class 'bool'> False
ch_info <class 'dict'>
overlap_v2 <class 'int'> 10
fix_kp_cond <class 'int'> 1
fix_kp_cond_dim <class 'list'> [0, 202]
sampling_timesteps <class 'int'> 50
online_mode <class 'bool'> False
v_min_max_for_clip <class 'numpy.ndarray'> (4, 265)
smo_k_d <class 'int'> 3
N_d <class 'int'> -1
use_d_keys <class 'NoneType'> None
relative_d <class 'bool'> True
drive_eye <class 'NoneType'> None
delta_eye_arr <class 'numpy.ndarray'> (15, 63)
delta_eye_open_n <class 'int'> 0
fade_type <class 'str'> d0
fade_out_keys <class 'list'> ['exp']
flag_stitching <class 'bool'> True
overall_ctrl_info <class 'dict'> {'delta_pitch': 2}
==================================================
dit: 6it [00:01,  3.99it/s]
writer: 1it [00:13, 13.18s/it][swscaler @ 0x66baa00] Warning: data is not aligned! This can lead to a speed loss
writer: 394it [00:36, 10.72it/s]
ffmpeg -loglevel error -y -i "./tmp/result.mp4.tmp.mp4" -i "./example/audio.wav" -map 0:v -map 1:a -c:v copy -c:a aac "./tmp/result.mp4"
./tmp/result.mp4
(ditto) root@d8b454fef605:/workspace/ditto-talkinghead# cd tmp
(ditto) root@d8b454fef605:/workspace/ditto-talkinghead/tmp# ls
result.mp4  result.mp4.tmp.mp4
(ditto) root@d8b454fef605:/workspace/ditto-talkinghead/tmp# runpodctl send result.mp4
Runpod config file not found, please run `runpodctl config` to create it
7240-theory-common-modem-13
Sending 'result.mp4' (5.0 MB)    
Code is: 7240-theory-common-modem-13
On the other computer run

runpodctl receive 7240-theory-common-modem-13

Sending (->174.65.156.232:57965)
result.mp4 100% |████████████████████| (5.2/5.2 MB, 72 MB/s)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions