Git Product home page Git Product logo

tensorrt_tutorial's People

Contributors

litleo avatar moyanzitto avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tensorrt_tutorial's Issues

如何处理网络模型中的BatchNorm层?

我想用TensorRT来加速YOLO v2模型,但是YOLO v2模型中包含有BatchNorm层,程序会在ICudaEngine* engine = builder->buildCudaEngine(*network)这一步中断报错,官方文档中指的Batch Normalization可以用Scale层代替,具体应该怎么做呢?

tensorrt 7.2.2.3版本

对于大通道,比如1024个channels的网络,比如最简单的 单层网络

class conv_1024_1024_33_d1(torch.nn.Module):
def init(self):
self.in_channel = 1024
self.out_channel = 1024
self.kernel_w = 3
self.kernel_h = 3
super(conv_1024_1024_33_d1, self).init()
self.conv = torch.nn.Sequential(
OrderedDict([("conv1", torch.nn.Conv2d(self.in_channel, self.out_channel, self.kernel_w, 1, 1))])
)

def forward(self, x):
    return self.conv(x)

用这个pytorch转onnx,然后调用trt的execute接口跑出来的耗时结合这层的计算量计算出来的算力(模型计算量/耗时)大于平台标称算力,这是由于trt加速的原因还是什么?有大神知道的么?

cudnn int8 demo问题

cudnn 的卷积INT8加速,在demo中,他的这个代码有点小错误,cudnn cudnnConvolutionForward INT8输入要求是4的倍数,...需要怎么改动才能成功运行??

CUDA_R_32I was not declared

有没有碰到多CUDA_R_32I找不到的情况。
我的cuda 是8.0.27的

在编译的时候报了这个错误

还有CUDA_R_32I这个是在哪个文件里定义的知道吗

fatal error: cuda.h: 没有那个文件或目录

安装tensorrt3的时候出现这个问题,我按照网上的教程在bashrc里面添加:export PATH=/usr/local/cuda-9.0/bin:$PATH 环境变量,并source ~/.bashrc.
但是依旧报错,求帮助

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.