Hands-On Computer Vision with TensorFlow 2, published by Packt

License: MIT License

Jupyter Notebook 98.33% Python 1.06% Swift 0.25% Objective-C 0.01% Ruby 0.01% HTML 0.01% JavaScript 0.02% Shell 0.01% Java 0.31% Starlark 0.01%

hands-on-computer-vision-with-tensorflow-2's People

Stargazers

Watchers

Forkers

jornason zzhuuh2 devops8012 albo-ai albertofernandezvillan hafizur-rahman henrydeng benjaminplanche vinaykus ellenandangel jbinkleyj kwarodom rakesh283343 manjunathshiva anhlbt fengzifrank happyfreeangel anandpaiv sasikiranj leonamtv odgiv lokeshsoni binmosa infosubhasis davidmashburn bailianfa mayuehui asifahmedfuad ridzuan5757 iketutg stchau4work siddhartha-addictive tiagoooliveira ramenwang oilmcut2019 shaileshj-packt shilon5 snulaka ryotaro-aoyama simonengelke kkoo1122 tinman83 guantinglin arifmudi shuyijin shubhamgoswami kirillhiddleston hate-deadline shubham4054 internetoftim ad-0001 sathyaish pant0ng itsmonterey arachchi amitbd1508 joyjeni duxianghong kwalid amimul nlgrf xuwei1119 nelsonbeltran robinrajsb manoj1390 andriihura mayurmorin abhinavsp0730 magesa xkfisher langzippkk danieldwf aggathordemighty liucw2012 tien-le nilamkk meichuanneiku sachinmittal2212 ioannispol nyleng lisall sh4zkh4n xsci xiaotian9202 hdlee4u aya-s javierespinozat murari-goswami stephennfernandes aghazahedim mr-haseeb amuranov abkonate subhash-pal borsuk74 nonlinear1 avtione-repo dontcryme rajdeepd wangshubo90

hands-on-computer-vision-with-tensorflow-2's Issues

Error in Chapter05 ch5_nb1_yolo_inference.ipynb

boxes, scores, classes, nums = yolo(input_img)


logging.info('detections:')
for i in range(nums[0]):
    print('\t{}, {}, {}'.format(class_names[int(classes[0][i])],
                                       np.array(scores[0][i]),
                                       np.array(boxes[0][i])))

prediction_img = draw_outputs(img.numpy(), (boxes, scores, classes, nums), class_names)
plt.figure(figsize=(10, 20))
plt.imshow(prediction_img)

File "D:/PyCharm/YOLO/main.py", line 31, in <module>
    for i in range(nums[0]):
TypeError: 'Tensor' object cannot be interpreted as an integer

Training YoloV3

When training a tf 2.0 YoloV3 model Chapter 5 mentions training using VGG16 or other pretrained NN as a backbone after the head is deactivated. How do you format the y_true value i.e. (batch_size, grid, grid, anchors, (x1, y1, x2, y2, obj, cls)). Batch size is obvious but I wanted to see an example of how the rest would look and how I should format the data to use it in a model.fit operation, for example. I have annotated the images and have the x and y values for the bounding boxes and obviously the class, but I'm not sure how to supply obj value ( and all the other values when transitioning from the backbone to the YoloV3 head). Is obj (objectness) always going to be 1 for y_true since there is always an object in the bounding box for y_true? Is the class simply going to be the corresponding text label used in the coco.names file or some other format. Is there a way to view a full example of the proposed implemention i.e VGG then YoloV3 for example.

loss= nan on running nb5 & nb6 from Chapter6

I have been trying to debug this for a few days to no avail.
I have been running the notebooks as they are.
I'm getting
Epoch 0/90: loss = nan; v-loss = nan; acc = 0.389; v-acc = 0.366; mIoU = 0.020; v-mIoU = 0.019
as output from training and can't figure out what is the problem

Unable to prepare cityscape-dataset

Hi,
I have the same issue as previous "Unable to prepare cityscape-dataset #8" on Jun 29, 2019.

I have verified the environment has been se by:
import os
print(os.environ['CITYSCAPES_DATASET'])
it retuned
C:\Users\c5911\tensorflow_datasets\cityscapes.

However, two scripts
!csCreateTrainIdLabelImgs
or
!csViewer (ran under Jupyter notebook)
still returned:
ERROR: Did not find any files. Please consult the README.

I ran csViewer at prompt command, it also showed "The data was not found".

Under "C:\Users\c5911\tensorflow_datasets\cityscapes", there are two sub folders, getFine and leftImg8bit.

Anything could be wrong?

Thanks.

Chapter 4 ResNet from scratch doesn't number increase filters for each Macroblock.

The number of filters for each macroblock is 64, but according to the diagrams and white paper, it should be increasing.

perceptron in chapter1 never outputs 0

this is due to using np.random.rand which outputs always positive numbers.

ground truth images

what is the shape of ground truth numpy color map for each image ? chapter 6 "preparing data for smart car apps".

Unable to prepare cityscape-dataset

Hi,

running the script (Mac, Jupyter-notebook, Python 2), I am unable to prepare the dataset.
There is the message "ERROR: Did not find any files. Please consult the README.", which I can not interpret.
Running the script step by step on the terminal, there is the same problem.
Has anyone an idea how to solve this?

Searching the internet, I only found this link:
mcordts/cityscapesScripts#73
However, their solution does not work for me.

Unfortunately, without this data, I can not further work on the chapter of this nice book.

With kind regards

CH6: in fcn.py, f5_conv3 is not connected to the rest of the model

chapter 6 material: in fcn.py
in the model definition layer, f5_conv3 ,seems to not be included in the computational graph

Ch6. nb5 build and train: fcn8s_model.fit, ValueError: Shapes (None, 512, 512, 19) and (None, 512, 512, 1) are incompatible

Hi
I am running ch6. nb5 build and train.
At fcn8s_model.fit, I have a ValueError: Shapes (None, 512, 512, 19) and (None, 512, 512, 1) are incompatible (after "C:\Users\c5911\anaconda3\lib\site-packages\tensorflow\python\framework\tensor_shape.py:1134 assert_is_compatible_with
raise ValueError("Shapes %s and %s are incompatible" % (self, other))").
fcn8s_model.summary() showed
conv2d_transpose_2 (Conv2DTrans (None, 512, 512, 19) 92435 add_1[0][0]

Total params: 14,983,177
Trainable params: 14,983,177
Non-trainable params: 0

Why were there incompatible shape?

Thanks,
Gary

P.S.: I changed: visual_val_dataset = cityscapes_input_fn(
root_folder=CITYSCAPES_FOLDER, resize_to=image_size, batch_size=num_show,
shuffle=True, num_epochs=1, augment=False, seed=random_seed, blurred=False) #blurred changed from "True" to "False".

Dot product summation on page 29 is not correct 0 or 1 index start?

Summing the weights of the input and weight vector is not correct.
The inputs are indexed starting at 0, your x0 w0 in all text and pictures.
The conclusion that the dot product is x1 * w1 + x2 * w2 is therefore incorrect it should be
x0 * w0 + x1 * w1 .... etc

For other books reporting this kind or error gets you street cred as a programmer.
https://en.wikipedia.org/wiki/Knuth_reward_check

how to use the trained model

Hello!

I ran your notebook for chapter8 and I would like to use the trained model for new videos. I am aware this may sound like a super idiotic question, but would you help me using the trained model? how do i save it for later use? and then how would i 'run' it on new videos?

Thanks for your help, and apologies for the basic question,

Veronica.

YOLO V3 of Chapter05

I have downloaded weight file (yolov3.weights) and tried to convert it following the instructions. But convert.py program did not produce yolov3.tf. Instead, the program produced the following files:

yolov3.tf.index (24KB),
yolov3.tf.data-00000-of-00002 (62KB), and
yolov3.tf.data-00001-of-00002 (237MB).

Each number within the parenthesis is the size of each file.

I tested the program using the docker (tensorflow:latest-gpu) on Ubuntu 18.04.

Just in case, I attach the screen capture of the log that is written by convert.py.

screen_capture_of_convert_weight.txt

packtpublishing / hands-on-computer-vision-with-tensorflow-2 Goto Github PK

hands-on-computer-vision-with-tensorflow-2's People

Stargazers

Watchers

Forkers

hands-on-computer-vision-with-tensorflow-2's Issues

Recommend Projects

Recommend Topics

Recommend Org