🐛 Describe the bug I tried to compile my model and got the follow

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

wrap_fx_proxy_cls doesn't cover torch.cuda.Stream about torchdynamo HOT 3 CLOSED

kehuanfeng commented on July 16, 2024

wrap_fx_proxy_cls doesn't cover torch.cuda.Stream

from torchdynamo.

Comments (3)

yanboliang commented on July 16, 2024

The forward of my model does contain the opeation of explict cuda mem copy, that's why it requires to retrieve cuda stream.

You code indirectly call these ops which doesn't return Tensor. I think it's not hard to support these functions. I can take a look if you have a repro.

from torchdynamo.

kehuanfeng commented on July 16, 2024

@yanboliang Thanks for the reply. You can use the following mini repo. Additionally, there should be similar request for torch.cuda.device.

import torch
import torch.nn as nn
import torch._dynamo as dynamo
from torch.nn.parallel._functions import _get_stream

class CopyModel(nn.Module):
    def __init__(self):
        super(CopyModel, self).__init__()

    def forward(self, input, device):
        self.copy_stream = torch.cuda.Stream(device)
        with torch.cuda.stream(self.copy_stream):
            return input.cuda()

model = CopyModel()
output = model(torch.rand(1000), 1)
print(output.get_device())

opt_model = dynamo.optimize('eager')(model)
output = opt_model(torch.rand(1000), 1)
print(output.get_device())

from torchdynamo.

kehuanfeng commented on July 16, 2024

Closing it as it works with pytorch nightly build

from torchdynamo.

wrap_fx_proxy_cls doesn't cover torch.cuda.Stream about torchdynamo HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent