How are multiple faces handled in preprocessing ?

Hi and congratulations on your work!
I'm trying to reproduce your results and I'm having trouble preprocessing the FF++ dataset using your code. I have calculated the landmarks for each video as per the instructions. When using the `extract_faces.py` script it fails with error:
```ValueError: operands could not be broadcast together with shapes (68,2) (136,2)``` in [this line](https://github.com/ahaliassos/RealForensics/blob/990243bd3aee8fff4742045e110411d53b504f13/preprocessing/extract_faces.py#L94) of the `crop_patch` function. It seems that it expects only one set of landmarks for each frame. But since in FF++ and other datasets there are multiple faces, how is this handled ? 
The code could run if instead of a numpy array it was a list with one or more (68,2) items but, considering that we want to smooth the landmarks, it would result in very jittery movement since it would take into account multiple faces in different locations.
How do you handle this case? Maybe create a .avi video, tracking each face separately and store as ```vidname_{0}.avi, vidname_{1}.avi, vidname_{2}.avi,``` etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How are multiple faces handled in preprocessing ? #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How are multiple faces handled in preprocessing ? #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions