Working with Multiview Data

Hello, 

Thank you for the great repo. 

I've been trying to use this on a multi-view data set and I'm having some trouble getting a network converge on good results.

The data I'm training on is taken from ~20-30 synced cameras(depending on how many colmap finds in the SFM) set up semi-evenly in a room. The cameras are static, but the scene is dynamic, albeit slow moving. I modified the data loading to take a json that contains frames from each camera. When building a training set, I made the assumption that the order of images loaded in the training is how the model expects frames to be ordered in time. Frames are picked sequentially from each camera, e.g If there's 30 cameras and 150 frames, camera 1 will contribute frames 1,31,61,91...etc. 

I've gotten the network to run and train on the dataset, and the outputs are recognizable, but there's a lot of artifacts. Any help building intuition or advice on how to improve the quality of the outputs would be much appreciated.

Original image:
![1](https://user-images.githubusercontent.com/8620427/134300569-3e6501c8-86b6-4c82-8eef-df6ef9cbc497.png)

Outputs after 250k iterations:

![001](https://user-images.githubusercontent.com/8620427/134300626-574b04a4-e7bd-4311-af84-d24fa5599424.png)
![disp_001](https://user-images.githubusercontent.com/8620427/134300627-719ce23d-c8a8-4819-a546-524231a23482.png)
![disp_jet_001](https://user-images.githubusercontent.com/8620427/134300629-7736aeaf-f255-4335-9496-7879f56686c1.png)
![error_001](https://user-images.githubusercontent.com/8620427/134300633-61062b0f-866f-4615-9072-54141fac755d.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Working with Multiview Data #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Working with Multiview Data #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions