Git Product home page Git Product logo

oniroai / semantic-segmentation-with-mobilenetv3 Goto Github PK

View Code? Open in Web Editor NEW
59.0 4.0 14.0 55 KB

TensorFlow (Keras) implementation of MobileNetV3 and its segmentation head

License: GNU General Public License v3.0

Dockerfile 1.19% Makefile 0.60% Python 72.08% Jupyter Notebook 26.13%
deep-learning tensorflow mobilenetv3 segmentation computer-vision cnn neural-network cnn-segmentation semantic-segmentation person-segmentation

semantic-segmentation-with-mobilenetv3's People

Contributors

dependabot[bot] avatar nikolasent avatar voeykovroman avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

semantic-segmentation-with-mobilenetv3's Issues

Train with GPU

Thanks for the great work!
Your model can do inference with very high fps!

I am trying to train model with your train code. And I found the process is very slow and does not use GPU.
I read your code and I found the line where GPU is enabled is commented out.

    def fit(self, best_on_val=True):
        # with tf.device(self.device):
  1. Why did you comment out the GPU line?
  2. I have modified the line to enable the GPU, but the process doesn't become fast. (GPU usage is about 10% or so with RTX 2080Ti). Do you have any idea about it?

Question on the processing

Hi there! Amazing repo you have here. I already tried both training and inferencing but i have few question to ask.

Do you have the image prediction output for this repository that you already tried before?
Cuz i already tried both training and inferencing using this repo, but the result was really bad.
I trained with 356 Image-Mask data.

For _dataset.py, why there are a lot of complicated process like ImageTargetDataset, RandomConncatDataset etc.?
Why can't just use normal image loader practice like in keras or maybe just load images-mask using opencv and append them on list instead of appending their path?

in vis_dataset(), why i can only visualize the Image data and not the Target(Mask)? I tried but errors occurred. How can i know the images have correct corresponding mask?

in Testing Image section at train-mobilenet.ipynb, what exactly would be the output? When i tried, it don't even display/show the prediction mask. It outputs back the original test image. When i try to output "out_img" variable, noisy images were displayed.

Can't load pretrained model

Hi, thanks for the project. It's really interesting to play and learn from it. However I got ValueError: Shapes (1, 1, 24, 2) and (1, 1, 24, 1) are incompatible when trying to load the pretrained model. Seems like it has some problem while loading weights from the model. I also noticed that you mentioned the pretrained model was trained with TF2.0. Could that be the cause of this issue? I am with TF2.3 btw.

question about atrous convolution

I took a look at a bunch of segmentation based mv3 repos on github. Most including yours did not implement the atrous convolution on the final convolution block unlike in the paper. Any particular reason for this?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.