mkang315 / asf-yolo Goto Github PK

[IMAVIS] Official implementation of "ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation".

License: GNU Affero General Public License v3.0

Python 100.00%

breast-cancer-segmentation cell-detection cell-segmentation computer-vision-algorithms data-science-bowl-2018 deep-learning-framework deep-neural-networks detection-segmentation-algorithm hematoxylin-eosin-staining histopathology-image-analysis

asf-yolo's Introduction

Official ASF-YOLO

This is the source code for the paper "ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation" published on Image and Vision Computing (IMAVIS), of which I am the first author. The paper is available to download on ScienceDirect or arXiv.

Model

The Attentional Scale sequence Fusion You Only Look Once (ASF-YOLO) model configuration (i.e., network construction) file is asf-yolo.yaml in the directory ./models/segment.

Training

The hyperparameter setting file is hyp.scratch-low.yaml in the directory ./data/hyps/.

Installation

Install requirements.txt in a Python>=3.8.0 environment, including PyTorch>=1.8.

pip install -r requirements.txt  # install

Training CLI

python segment/train.py

Testing CLI

python segment/predict.py

Evaluation

We trained and evaluated ASF-YOLO on the two datasets: the 2018 Data Science Bowl (DSB2018) from Kaggle and the Breast Cancer Cell (BCC) dataset from the Center for Bio-Image Informatics, University of California, Santa Barbara (UCSB CBI). The yolov5l-seg.pt is the initial weight of the pretrained MS COCO dataset by YOLOv5l and isn't the ASF-YOLO training weight on the evaluation datasets, which has been stated in the paper.

Suggested Citation

Please cite our paper if you use code from this repository:

Plain Text

Elsevier Reference Style
M. Kang, C.-M. Ting, F.F. Ting, R.C.-W. Phan, ASF-YOLO: a novel YOLO model with attentional scale sequence fusion for cell instance segmentation, Image Vis. Comput. 147 (2024) 105057.
IEEE Reference Style
M. Kang, C.-M. Ting, F. F. Ting, and R. C.-W. Phan, "Asf-yolo: A novel yolo model with attentional scale sequence fusion for cell instance segmentation," Image Vis. Comput., vol. 147, 105057, Jul. 2024.
Nature Reference Style
Kang, M., Ting, C.-M., Ting, F. F. & Phan, R. C.-W. ASF-YOLO: a novel YOLO model with attentional scale sequence fusion for cell instance segmentation. Image Vis. Comput. 147 105057 (2024).
Springer Reference Style
Kang, M., Ting, C.-M., Ting, F.F., Phan, R.C.-W.: ASF-YOLO: a novel YOLO model with attentional scale sequence fusion for cell instance segmentation. Image Vis. Comput. 147, 105057 (2024)

BibTeX Format

\begin{thebibliography}{1}
\bibitem{1} M. Kang, C.-M. Ting, F.F. Ting, R.C.-W. Phan, ASF-YOLO: a novel YOLO model with attentional scale sequence fusion for cell instance segmentation, Image Vis. Comput. 147 (2024) 105057.
\end{thebibliography}

@article{Kang24Asfyolo,
  author = "Ming Kang and Chee-Ming Ting and Fung Fung Ting and Rapha{\"e}l C.-W. Phan",
  title = "ASF-YOLO: A novel yolo model with attentional scale sequence fusion for cell instance segmentation",
  journal = "Image Vis. Comput.",
  volume = "147",
  % ieee_fullname.bst
  pages = "105057",
  % IEEEbib.bst
  note = "p. 105057", 
  month = "Jul.",
  year = "2024",
}

@article{Kang24Asfyolo,
  author = "Kang, Ming and Ting, Chee-Ming and Ting, Fung Fung and Phan, Rapha{\"e}l C.-W.",
  title = "ASF-YOLO: a novel YOLO model with attentional scale sequence fusion for cell instance segmentation",
  journal = "Image Vis. Comput.",
  volume = "147",
  pages = "105057",
  publisher = "Elsevier",
  address = "Amsterdam",
  year = "2024",
  doi= "10.1016/j.imavis.2024.105057",
  url = "https://doi.org/10.1016/j.imavis.2024.105057"
}

^{NOTE: Please remove some optional BibTeX fields, for example, series, volume, address, url and so on, while the LaTeX compiler produces an error. Author names may be manually modified if not automatically abbreviated by the compiler under the control of the .bst file if applicable which defines bibliography/reference style. Kang24Asfyolo could be b1, bib1, or ref1 when references appear in numbered style in which they are cited. The quotation mark pair "" in the field could be replaced by the brace {}.}

License

ASF-YOLO is released under the GNU Affero General Public License v3.0 (AGPL-3.0). Please see the LICENSE file for more information.

Copyright Notice

Many utility codes of our project base on the codes of Ultralytics YOLOv5, EIoU and Soft-NMS repositories.

asf-yolo's People

Contributors

Stargazers

Watchers

Forkers

cv-seg 470053304 juhyunn0

asf-yolo's Issues

您好！请问一下这个对模型的改进，可以用于目标检测吗？

The paper and the code implementation are different

Did you replace the Gaussian kernel in the paper with conv2d directly?

got an unexpected keyword argument 'ESIoU'

https://github.com/mkang315/ASF-YOLO/blob/main/utils/loss.py
line 141
iou = bbox_iou(pbox, tbox[i], ESIoU=True).squeeze() # iou(prediction, target)
ESIoU? maybe EIoU

请问有做过ASF-YOLO的tensorrt的推理加速吗

where is the TFE?

Hello author, where exactly is the TFE module located, thank you very much!

将 PointRend 加入以优化模型分割边缘效果

请教一下，将 PointRend 加入已优化模型分割边缘效果，该如何对源码进行改进呢？我对 yolov5或 yolov8的模型参数化解析构建 Model这一块儿不太熟悉，如何将该模块加入，使用 PointRend 对 mask 的边缘进行精细化分割，想请教您一下，万分感谢！

where is the CPAM?

I noticed the CPAM module in the paper. I found it's the attention_model, but it's not used in the code instead Add, which is more useful？

a problem of figure

作者您好，按照论文说法TPF模块应该有三个输入，但是Figure3中左下角的TPF只有两个输入，请问是我理解出错了吗

建议作者将pt文件换成链接的方式

建议作者将pt文件换成链接的方式，不然code下载下来太慢了

How to train own datasets, and how to get labels?

RuntimeError: max_pool3d_with_indices_backward_cuda does not have a deterministic implementation, but you set 'torch.use_deterministic_algorithms(True)'.

I found this problem. Can you help me look at it?

关于评价指标的请教

感谢您做出的杰出工作，想请教下对于分割任务的评价指标，因为在我的研究方向上，大多数是用的miou来做指标，所以想请教下作者项目中可以输出miou吗？另外想请教下作者，关于小目标分割头的设计是用的v5中原有的分割头吗？

two questions

when i use yolov5l-seg.pt.it appears Transferred 602/671 items from yolov5l-seg.pt.maybe it cannoy apply all items
secend ,when i training ,it apeears AttributeError: anchors, shape = self.anchors[i], p[i].shape,'list' object has no attribute 'shape'
hope get you reply

Trained Weights

Where is possible to download the pretrained weights?

作者您好，我也遇到了确定性算法的警告导致模型不能运行，同评论区的问题一样，我是4张3090一起训练de设置了0,1,2,3.指定单卡直接显示cuda错误，希望您能给出建议

segment/train: weights=/media/dell/lhx/yolo/ASF-YOLO/yolov5l-seg.pt, cfg=/media/dell/lhx/yolo/ASF-YOLO/models/segment/asf-yolo.yaml, data=/media/dell/lhx/yolo/ASF-YOLO/data/bcc.yaml, hyp=/media/dell/lhx/yolo/ASF-YOLO/data/hyps/hyp.scratch-low.yaml, epochs=100, batch_size=8, imgsz=640, rect=False, resume=False, nosave=False, noval=False, noautoanchor=False, noplots=False, evolve=None, bucket=, cache=None, image_weights=False, device=0,1,2,3, multi_scale=False, single_cls=False, optimizer=SGD, sync_bn=False, workers=8, project=../runs_2/train-seg, name=improve, exist_ok=False, quad=False, cos_lr=False, label_smoothing=0.0, patience=100, freeze=[0], save_period=-1, seed=0, local_rank=-1, mask_ratio=4, no_overlap=False
YOLOv5  2024-5-30 Python-3.8.0 torch-2.3.1+cu121 CUDA:0 (NVIDIA GeForce RTX 3090, 24260MiB)
CUDA:1 (NVIDIA GeForce RTX 3090, 24260MiB)
CUDA:2 (NVIDIA GeForce RTX 3090, 24260MiB)
CUDA:3 (NVIDIA GeForce RTX 3090, 24260MiB)

hyperparameters: lr0=0.01, lrf=0.01, momentum=0.937, weight_decay=0.0005, warmup_epochs=3.0, warmup_momentum=0.8, warmup_bias_lr=0.1, box=0.05, cls=0.5, cls_pw=1.0, obj=1.0, obj_pw=1.0, iou_t=0.2, anchor_t=4.0, fl_gamma=0.0, hsv_h=0.015, hsv_s=0.7, hsv_v=0.4, degrees=0.0, translate=0.1, scale=0.5, shear=0.0, perspective=0.0, flipud=0.0, fliplr=0.5, mosaic=1.0, mixup=0.0, copy_paste=0.0
TensorBoard: Start with 'tensorboard --logdir ../runs_2/train-seg', view at http://localhost:6006/
Overriding model.yaml nc=80 with nc=1

             from  n    params  module                                  arguments

0 -1 1 7040 models.common.Conv [3, 64, 6, 2, 2]
1 -1 1 73984 models.common.Conv [64, 128, 3, 2]
2 -1 3 156928 models.common.C3 [128, 128, 3]
3 -1 1 295424 models.common.Conv [128, 256, 3, 2]
4 -1 6 1118208 models.common.C3 [256, 256, 6]
5 -1 1 1180672 models.common.Conv [256, 512, 3, 2]
6 -1 9 6433792 models.common.C3 [512, 512, 9]
7 -1 1 4720640 models.common.Conv [512, 1024, 3, 2]
8 -1 3 9971712 models.common.C3 [1024, 1024, 3]
9 -1 1 2624512 models.common.SPPF [1024, 1024, 5]
10 -1 1 525312 models.common.Conv [1024, 512, 1, 1]
11 4 1 132096 models.common.Conv [256, 512, 1, 1]
12 [-1, 6, -2] 1 0 models.common.Zoom_cat [512]
13 -1 3 3019776 models.common.C3 [1536, 512, 3, False]
14 -1 1 131584 models.common.Conv [512, 256, 1, 1]
15 2 1 33280 models.common.Conv [128, 256, 1, 1]
16 [-1, 4, -2] 1 0 models.common.Zoom_cat [256]
17 -1 3 756224 models.common.C3 [768, 256, 3, False]
18 -1 1 590336 models.common.Conv [256, 256, 3, 2]
19 [-1, 14] 1 0 models.common.Concat [1]
20 -1 3 2495488 models.common.C3 [512, 512, 3, False]
21 -1 1 2360320 models.common.Conv [512, 512, 3, 2]
22 [-1, 10] 1 0 models.common.Concat [1]
23 -1 3 9971712 models.common.C3 [1024, 1024, 3, False]
24 [4, 6, 8] 1 460544 models.common.ScalSeq [256]
25 [17, -1] 1 12325 models.common.attention_model [256]
26 [-1, 20, 23] 1 1393558 models.yolo.Segment [1, [[10, 13, 16, 30, 33, 23], [30, 61, 62, 45, 59, 119], [116, 90, 156, 198, 373, 326]], 32, 256, [256, 512, 1024]]
asf-yolo summary: 407 layers, 48465467 parameters, 48465467 gradients, 155.4 GFLOPs

Transferred 602/671 items from /media/dell/lhx/yolo/ASF-YOLO/yolov5l-seg.pt
AMP: checks passed ✅
optimizer: SGD(lr=0.01) with parameter groups 110 weight(decay=0.0), 116 weight(decay=0.0005), 114 bias
WARNING ⚠️ DP not recommended, use torch.distributed.run for best DDP Multi-GPU results.
See Multi-GPU Tutorial at ultralytics/yolov5#475 to get started.
train: Scanning /media/dell/lhx/yolo/ASF-YOLO/datasets/BCC/labels/train.cache... 128 images, 0 backgrounds, 0 corrupt: 100%|██████████| 128/128 00:00
val: Scanning /media/dell/lhx/yolo/ASF-YOLO/datasets/BCC/labels/val.cache... 32 images, 0 backgrounds, 0 corrupt: 100%|██████████| 32/32 00:00

AutoAnchor: 4.36 anchors/target, 0.970 Best Possible Recall (BPR). Anchors are a poor fit to dataset ⚠️, attempting to improve...
AutoAnchor: WARNING ⚠️ Extremely small objects found: 47 of 1235 labels are <3 pixels in size
AutoAnchor: Running kmeans for 9 anchors on 1235 points...
AutoAnchor: Evolving anchors with Genetic Algorithm: fitness = 0.7403: 100%|██████████| 1000/1000 00:00
AutoAnchor: thr=0.25: 0.9571 best possible recall, 6.31 anchors past thr
AutoAnchor: n=9, img_size=640, metric_all=0.391/0.743-mean/best, past_thr=0.495-mean: 25,43, 88,52, 51,155, 92,121, 163,129, 116,183, 236,232, 160,418, 350,452
AutoAnchor: Done ⚠️ (original anchors better than new anchors, proceeding with original anchors)
Plotting labels to ../runs_2/train-seg/improve3/labels.jpg...
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to ../runs_2/train-seg/improve3
Starting training for 100 epochs...

  Epoch    GPU_mem   box_loss   seg_loss   obj_loss   cls_loss  Instances       Size

0%| | 0/16 00:03
Traceback (most recent call last):
File "/media/dell/lhx/yolo/ASF-YOLO/segment/train.py", line 658, in
main(opt)
File "/media/dell/lhx/yolo/ASF-YOLO/segment/train.py", line 554, in main
train(opt.hyp, opt, device, callbacks)
File "/media/dell/lhx/yolo/ASF-YOLO/segment/train.py", line 317, in train
scaler.scale(loss).backward()
File "/home/leihaoxiang/.conda/envs/yolo/lib/python3.8/site-packages/torch/_tensor.py", line 525, in backward
torch.autograd.backward(
File "/home/leihaoxiang/.conda/envs/yolo/lib/python3.8/site-packages/torch/autograd/init.py", line 267, in backward
_engine_run_backward(
File "/home/leihaoxiang/.conda/envs/yolo/lib/python3.8/site-packages/torch/autograd/graph.py", line 744, in _engine_run_backward
return Variable._execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
RuntimeError: max_pool3d_with_indices_backward_cuda does not have a deterministic implementation, but you set 'torch.use_deterministic_algorithms(True)'. You can turn off determinism just for this operation, or you can use the 'warn_only=True' option, if that's acceptable for your application. You can also file an issue at https://github.com/pytorch/pytorch/issues to help us prioritize adding deterministic support for this operation.

进程已结束,退出代码1

有一个论文问题想请教一下作者

Effect of different attention mechanisms

Hello author, I am a beginner in the YOLO model and would like to ask you some questions. In your article, you mentioned the CPAM module you used, and also mentioned that you used other attention modules such as SENet, CBAM, etc., and compared their effects. I also want to conduct some comparative experiments at present, replace the attention module and observe its effectiveness. Can you briefly explain to me how to proceed? Thank you.