kanazawanaoaki / cook_eyes Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 2.0 69.73 MB

Implement the eyes of the cook

CMake 1.45% Python 0.84% Common Lisp 97.67% C++ 0.03%

cook_eyes's Introduction

Hi, I'm Naoaki

🌱 I'm a Ph.D. student at The University of Tokyo.
👨‍🎓 My major is mechanical and information engineering.

For more information, please visit my web page below.
https://kanazawanaoaki.github.io/

cook_eyes's People

Contributors

Stargazers

Watchers

Forkers

knorth55 shmpwk

cook_eyes's Issues

Potato 0.84が左上に表示されるだけになってしまった

自作データセットでSSDの学習を試してみたが、左上にPotatoが表示されるだけのものになってしまった...
使ったのは、20200515-214217-one_more_dataset

$ rosrun jsk_perception image_publisher.py _file_name:=FILE_PATH

でtrainに使ったjpg画像をパブリッシュして、認識を試してみると以下のような結果になった。
左上にじゃがいもがくる画像が多かったせいか、ほぼ固定位置にpotato 0.84が表示されるか、何も表示されないかになってしまった。

圧縮トピックを保存したrosbagの点群がカクカクしている

(https://gist.github.com/Kanazawanaoaki/f6b533c8219252f19befe570e449a2ea#file-rosbag_compressed_record-launch )のように圧縮されたrgb画像やdepth画像のrosbagを保存し、
(https://gist.github.com/Kanazawanaoaki/f6b533c8219252f19befe570e449a2ea#file-depth2ptcloud-launch ) のようにして再構成しているが、画像はまだましだけど点群がかなりカクカクしている。

(https://drive.google.com/file/d/1NAkvrk4bqJeafvFwC8-Z4iP47g-Y7T5t/view?usp=sharing )のrosbagを使って行うと、

roslaunch kitchen_recognition depth2ptcloud.launch

rosbag play com_test.bag -l --clock

(https://youtu.be/D6D_ldioWc8 )このような結果になった。

データセットが変で学習出来ない

作成したデータセットを使って学習のスクリプトを実行すると、

Traceback (most recent call last):
  File "create_tf_record.py", line 155, in <module>
    tf.app.run()
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/platform/app.py", line 125, in run
    _sys.exit(main(argv))
  File "create_tf_record.py", line 151, in main
    create_tf_record(root_dir, output_path)
  File "create_tf_record.py", line 140, in create_tf_record
    img_path, class_label_path, instance_label_path, class_names)
  File "create_tf_record.py", line 72, in get_tf_example
    cls_lbl = np.argmax(np.bincount(class_label[inst_mask]))
  File "/usr/local/lib/python2.7/dist-packages/numpy/core/fromnumeric.py", line 1037, in argmax
    return _wrapfunc(a, 'argmax', axis=axis, out=out)
  File "/usr/local/lib/python2.7/dist-packages/numpy/core/fromnumeric.py", line 51, in _wrapfunc
    return getattr(obj, method)(*args, **kwds)
ValueError: attempt to get argmax of an empty sequence

のようなエラーが出てしまう。

./run.sh 20200515-145232-dataset --port 6000 bash
+ cd /tensorflow/models/research/scripts
+ python create_tf_record.py --data_dir=/tensorflow/models/research/20200515-145232-dataset --output_dir=/tensorflow/models/research/20200515-145232-dataset

のようにして確認してみる。
https://gist.github.com/Kanazawanaoaki/53050022fc1a7cfaba87a9c5bb862889
のようにプリント文を入れてlogを見ると、class_label[inst_mask]が空になってしまっているデータが一定数ありそれがエラーを起こしているみたい。

Lossが発散してしまう

密集もしていない、しかもTrainとTestが一緒である簡単なデータセットを作成して学習をしてみたが、上手く行かずLossが発散してしまった。

縦長の画像のアノテーションが上手くできていない

labelmeによってアノテーションを行っているが、縦長の画像に対してのアノテーションが上手く言っていない可能性がある。
very_easy_data/train/382076.jpg

labelme --labels labels.txt very_easy_data/train/382076.jpg

labelme_draw_label_png very_easy_datasets/train/SegmentationClassPNG/382076.png

画像が途中で切れてしまって上手くアノテーションが出来ていない気がする。

SSDの検出結果からBoundingBoxをだす

jsk-ros-pkg/jsk_recognition#2511 を使ってRectArray -> ClusterPointIndicesの変換をしてその出力をhttps://github.com/jsk-ros-pkg/jsk_recognition/blob/master/jsk_pcl_ros/sample/tabletop_object_detector.launch のeuclidean clusteringの部分を置き換える。

aaf54ab#diff-363e68923b115984009fddf2cdcfe078 を作って試して見ましたが、上手くバウンディングボックスが出ませんでした。

実行しているのは、

roslaunch kitchen_recognition tabletop_test_ssd_vegs.launch

roslaunch kitchen_recognition depth2ptcloud.launch gui:=false

https://github.com/Kanazawanaoaki/cook_eyes_coral/tree/master/curry_detector_ros のCoral TPUのSSDのlaunchを走らせる

roslaunch curry_detector_ros edgetpu_vegs_detector.launch INPUT_IMAGE:=/camera/rgb/image_raw

https://drive.google.com/file/d/1PVW_1rERg6YojUP5fFfc3UdM2Mx2U2s3/view?usp=sharing のrosbagを再生する。

rosbag play vegs_test.bag -l

kanazawanaoaki / cook_eyes Goto Github PK

cook_eyes's Introduction

Hi, I'm Naoaki

cook_eyes's People

Contributors

Stargazers

Watchers

Forkers

cook_eyes's Issues

Potato 0.84が左上に表示されるだけになってしまった

圧縮トピックを保存したrosbagの点群がカクカクしている

データセットが変で学習出来ない

Lossが発散してしまう

縦長の画像のアノテーションが上手くできていない

SSDの検出結果からBoundingBoxをだす

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent