Thanks for your job!
I don't load whisper and set all ars empty, then I try to run demo_video.py just using upload_vid as follows:
video = "examples/videos/v_-EIsT868Trw.mp4"
text_input = "What is the woman doing?"
input_split = "0 3.1\n3 11.5\n11.5 24.2\n24.2 45"
chat_state = CONV_VISION_MS.copy()
upload_vid(video, text_input, chat_state, temperature=1.0, input_splits=input_split)
But I failed:
1、I get error:
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
2、If I set do_sample=False in "self.model.llama_model.generate",the error above can be solved. But the "summary" will be:
examples/videos/v_-EIsT868Trw.mp4 <unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk>
I can't solve this problem, could you help me test my own video without whisper?
Thanks for your job!
I don't load whisper and set all ars empty, then I try to run demo_video.py just using upload_vid as follows:
But I failed:
1、I get error:
2、If I set do_sample=False in "self.model.llama_model.generate",the error above can be solved. But the "summary" will be:
I can't solve this problem, could you help me test my own video without whisper?