Describe the bug after pip install scre

same here with windows10 Edit: I solved this issue by <code class="n

runtime error when executing the default example about screenai HOT 9 OPEN

NuiMrme commented on August 22, 2024

runtime error when executing the default example

from screenai.

Comments (9)

Yingrjimsch commented on August 22, 2024 10

same here with windows10

Edit: I solved this issue by pip uninstall zetascale and reinstall with pip install zetascale In my case it installed an ancient version 0.9.xyz and after I installed the newest version 2.2.7 it worked

@kyegomez maybe it would be good to update the README example with the actual example from the example.py after solving this issue I got more issue because

there was no num_tokens defined
there was no max_seq_len defined
image and text were not initialized with the right dimensions

Another question I've got is, how did you choose num_tokens and max_seq_len?

from screenai.

github-actions commented on August 22, 2024

Hello there, thank you for opening an Issue ! 🙏🏻 The team was notified and they will get back to you asap.

from screenai.

DevChrisRoth commented on August 22, 2024

Got that same issue on a Mac M1

from screenai.

emarashliev commented on August 22, 2024

Same here Intel Mac

from screenai.

carlitose commented on August 22, 2024

Same with mac M2

from screenai.

zhaixiaowai commented on August 22, 2024

Same with windows11&wsl

from screenai.

github-actions commented on August 22, 2024

Stale issue message

from screenai.

MElmardi commented on August 22, 2024

Same with Linux Ubuntu 24 LTS

from screenai.

RokiRan commented on August 22, 2024

After my modifications, I got a working code, and I hope it solves your problem.

import torch
from screenai.main import ScreenAI

# 创建图像张量
image = torch.rand(1, 3, 224, 224)

# 创建 ScreenAI 模型的实例
model = ScreenAI(
    num_tokens=2000,
    max_seq_len=1024,
    patch_size=16,
    image_size=224,
    dim=512,
    depth=6,
    heads=8,
    vit_depth=4,
    multi_modal_encoder_depth=4,
    llm_decoder_depth=4,
    mm_encoder_ff_mult=4,
)

# 假设您的文本已经被转换为词索引，这里我们使用随机整数来模拟
# num_tokens 是您的词汇表大小，max_seq_len 是模型能够处理的最大序列长度
text_indices = torch.randint(0, model.num_tokens, (1, model.max_seq_len))

# 将文本索引张量转换为长整型张量
text = text_indices.long()

# 使用给定的文本和图像张量进行模型的正向传播
out = model(text, image)

# 打印输出张量的形状
print(out)

from screenai.

runtime error when executing the default example about screenai HOT 9 OPEN

Comments (9)

Related Issues (3)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent