Comments (13)
So far everything works fine with our tests, and we’ll test on other machines later to see if we can reproduce your problem.
from persformer_3dlane.
Hello, just try to reproduce your problem, how much memory do you have in your machine? And after how many epoch does it happen?
from persformer_3dlane.
thanks for your reply. The memory is 256GB, and the memory problem happens in the first epoch.
I can observe in htop
that the memory usage is gradually increasing from around 20GB to 200+GB, and then the process crashes. It takes about 10~20 minutes from the start of training.
from persformer_3dlane.
could you provide more information about your machine? such as pytorch version, cuda version, python version, etc. We didn't go into a memory leak when we train on 4-3090 with 128GB memory.
from persformer_3dlane.
my python version, cuda version, and PyTorch version are 3.8.13, 11.1, and 1.8.1.
I will train the model on other machines later and see if the problem still exists.
from persformer_3dlane.
it seems the same problem still exists on other machines with python version 3.6.13.
from persformer_3dlane.
Maybe you can pull the latest code and have a try to see if the problem still exists.
from persformer_3dlane.
Maybe you can pull the latest code and have a try to see if the problem still exists.
Thanks, I pull the latest code but the problem still exists.
Maybe it is caused by some unexpected environment problem. I would close this issue. If anyone else encounters this issue in the future, we may re-open this issue again.
from persformer_3dlane.
@liuzili97 I have met the same problem and my environment settings are the same as yours. Have you solved the problem?
from persformer_3dlane.
@liuzili97 I have met the same problem and my environment settings are the same as yours. Have you solved the problem?
No, I haven't
from persformer_3dlane.
I also have the same issue! It consumes almost 99% of my system memory and crashes even before training starts (after loading dataset). I reported it in a separate issue here: #33
from persformer_3dlane.
could you provide more information about your machine? such as pytorch version, cuda version, python version, etc. We didn't go into a memory leak when we train on 4-3090 with 128GB memory.
Thanks for your perfect work. Would you like to tell me the gcc --version of your environment with your 4-3090? My Server is 8-3090+CUDA11.1+pytorch1.8.0+gcc version 10.3.0 (Ubuntu 10.3.0-1ubuntu120.10), but I can't solve the problem in ‘INSTALL.md’ section when "cd models/nms/ --> python setup.py install"
from persformer_3dlane.
from persformer_3dlane.
Related Issues (20)
- Could you please share more details about how the newest result in TRAIN_VAL.md is trained? HOT 3
- Will apollo dataset pretrained model release?
- ZeroDivisionError: Weights sum to zero, can't be normalized HOT 2
- Some question about the results of x_far,x_close,z_far,z_close? HOT 1
- 加载EfficientNet模型报错“unexpected EOF”
- visualization HOT 2
- JSON file Error
- Single Inference HOT 1
- Unable to replicate the performance of GenLaneNet and 3DLaneNet on the openlane dataset based on your framework.
- sheared feature mapping
- TypeError: 'module' object is not callable HOT 2
- 分布训练
- 关于类别数问题 HOT 3
- 不使用分布式训练
- UnboundLocalError: local variable '_label_cam_height' referenced before assignment HOT 1
- apollo dataset pretrained model
- Some questions on code
- how to derive H_g2cam?
- 推理其他数据集发生错误
- Lane Detection Results
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from persformer_3dlane.