After several unsuccessful attempts at fine-tuning where the output was a still frame

Default model seems to output only noise or greenscreen about text-to-video-finetuning HOT 6 OPEN

patrickjonesdotca commented on August 15, 2024

Default model seems to output only noise or greenscreen

from text-to-video-finetuning.

Comments (6)

patrickjonesdotca commented on August 15, 2024

I also tried !python /content/Text-To-Video-Finetuning/inference.py --model /content/Text-To-Video-Finetuning/models/model_scope_diffusers --prompt "cat in a space suit" and had the same output

from text-to-video-finetuning.

ExponentialML commented on August 15, 2024

Hey there. After training, are you pointing to the trained model?

By default, it should be placed at the script root under ./outputs/train_<date>

from text-to-video-finetuning.

dvschultz commented on August 15, 2024

What are you trying to view the video in? I’ve found there’s something weird about the codec sometimes and it needs to be viewed in an application like VLC

from text-to-video-finetuning.

patrickjonesdotca commented on August 15, 2024

Hey there. After training, are you pointing to the trained model?

By default, it should be placed at the script root under ./outputs/train_<date>

Yes I did try the trained model. Trained two different ones in fact.
And then I thought I would do a sanity check and try to generate an image with the installed "base" model and filed this report.

Am I trying to generate an image correctly immediately after install with this line? !python /content/Text-To-Video-Finetuning/inference.py --model /content/Text-To-Video-Finetuning/models/model_scope_diffusers --prompt "cat in a space suit" because if that command is incorrect I've been on the wrong track.

from text-to-video-finetuning.

polyware-ai commented on August 15, 2024

If you have lots of videos you might need to train it for longer. How many steps did you train it and how many videos? 2500 is not enough if you are doing hundreds of videos with different prompts each.

from text-to-video-finetuning.

patrickjonesdotca commented on August 15, 2024

If you have lots of videos you might need to train it for longer. How many steps did you train it and how many videos? 2500 is not enough if you are doing hundreds of videos with different prompts each.

I was using images actually to train the model and there were about a dozen of them. I went the opposite way.

But, the problem as I see it is that one should be able to generate a clip with the inference model before running a training session. I ran into issues with that as well, hence this (possibly errant) bug report.

from text-to-video-finetuning.

Recommend Projects

Default model seems to output only noise or greenscreen about text-to-video-finetuning HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent