Git Product home page Git Product logo

Comments (13)

dyfcalid avatar dyfcalid commented on May 27, 2024 1

So far everything works fine with our tests, and we’ll test on other machines later to see if we can reproduce your problem.

from persformer_3dlane.

ChonghaoSima avatar ChonghaoSima commented on May 27, 2024

Hello, just try to reproduce your problem, how much memory do you have in your machine? And after how many epoch does it happen?

from persformer_3dlane.

liuzili97 avatar liuzili97 commented on May 27, 2024

thanks for your reply. The memory is 256GB, and the memory problem happens in the first epoch.

I can observe in htop that the memory usage is gradually increasing from around 20GB to 200+GB, and then the process crashes. It takes about 10~20 minutes from the start of training.

from persformer_3dlane.

ChonghaoSima avatar ChonghaoSima commented on May 27, 2024

could you provide more information about your machine? such as pytorch version, cuda version, python version, etc. We didn't go into a memory leak when we train on 4-3090 with 128GB memory.

from persformer_3dlane.

liuzili97 avatar liuzili97 commented on May 27, 2024

my python version, cuda version, and PyTorch version are 3.8.13, 11.1, and 1.8.1.

I will train the model on other machines later and see if the problem still exists.

from persformer_3dlane.

liuzili97 avatar liuzili97 commented on May 27, 2024

it seems the same problem still exists on other machines with python version 3.6.13.

from persformer_3dlane.

dyfcalid avatar dyfcalid commented on May 27, 2024

Maybe you can pull the latest code and have a try to see if the problem still exists.

from persformer_3dlane.

liuzili97 avatar liuzili97 commented on May 27, 2024

Maybe you can pull the latest code and have a try to see if the problem still exists.

Thanks, I pull the latest code but the problem still exists.

Maybe it is caused by some unexpected environment problem. I would close this issue. If anyone else encounters this issue in the future, we may re-open this issue again.

from persformer_3dlane.

nickle-fang avatar nickle-fang commented on May 27, 2024

@liuzili97 I have met the same problem and my environment settings are the same as yours. Have you solved the problem?

from persformer_3dlane.

liuzili97 avatar liuzili97 commented on May 27, 2024

@liuzili97 I have met the same problem and my environment settings are the same as yours. Have you solved the problem?

No, I haven't

from persformer_3dlane.

asadnorouzi avatar asadnorouzi commented on May 27, 2024

I also have the same issue! It consumes almost 99% of my system memory and crashes even before training starts (after loading dataset). I reported it in a separate issue here: #33

from persformer_3dlane.

casialixiaodong avatar casialixiaodong commented on May 27, 2024

could you provide more information about your machine? such as pytorch version, cuda version, python version, etc. We didn't go into a memory leak when we train on 4-3090 with 128GB memory.

Thanks for your perfect work. Would you like to tell me the gcc --version of your environment with your 4-3090? My Server is 8-3090+CUDA11.1+pytorch1.8.0+gcc version 10.3.0 (Ubuntu 10.3.0-1ubuntu120.10), but I can't solve the problem in ‘INSTALL.md’ section when "cd models/nms/ --> python setup.py install"

from persformer_3dlane.

ChonghaoSima avatar ChonghaoSima commented on May 27, 2024

from persformer_3dlane.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.