🐛 Describe the bug Hello! It is said that bfloat16 is only su

use bfloat16 on nvidia V100 GPU about pytorch HOT 2 CLOSED

bugm commented on May 12, 2024

use bfloat16 on nvidia V100 GPU

from pytorch.

Comments (2)

malfet commented on May 12, 2024

I think distinction here is "supported by software"(i.e. emulation) vs "supported by hardware". torch.cuda.is_bf16_supported() tells that your GPU hardware does not have native bf16 instructions, but software can easily emulate some bf16 operations by shifting input values to the left and then running computation in float32, but it will be slower

from pytorch.

bugm commented on May 12, 2024

I think distinction here is "supported by software"(i.e. emulation) vs "supported by hardware". torch.cuda.is_bf16_supported() tells that your GPU hardware does not have native bf16 instructions, but software can easily emulate some bf16 operations by shifting input values to the left and then running computation in float32, but it will be slower

Thanks for you answer! I have tried to use a bfloat16 mixed precision training on a V100 GPU, which shows the time cost is almost the same as full fp32 training(even a little slower).

from pytorch.

use bfloat16 on nvidia V100 GPU about pytorch HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent