Welcome to summit your project [here]
- The First Comprehensive SAM Survey: Chunhui Zhang, Li Liu, Yawen Cui, Guanjie Huang, Weilin Lin, Yiqian Yang, Yuehong Hu.
"A Comprehensive Survey on Segment Anything Model for Vision and Beyond." ArXiv (2023). [paper] [homepage][中文解读]
-
SAM: Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick.
"Segment Anything." ICCV (2023). [paper] [homepage] [code] [Zhihu] [Reddit] -
SEEM: Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, Jianfeng Gao, Yong Jae Lee.
"Segment Everything Everywhere All at Once." ArXiv (2023). [paper] [code] -
SegGPT: Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang.
"SegGPT: Segmenting Everything In Context." ArXiv (2023). [paper] [code] -
Grounding DINO: Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Hao Zhang, Jie Yang, Chunyuan Li, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang.
"Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection." ArXiv (2023). [paper] [code] -
OVSeg: Feng Liang, Bichen Wu, Xiaoliang Dai, Kunpeng Li, Yinan Zhao, Hang Zhang, Peizhao Zhang, Peter Vajda, Diana Marculescu.
"Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP." CVPR (2023). [paper] [homepage] [code] -
OneFormer: Jitesh Jain, Jiachen Li, MangTik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi.
"OneFormer: One Transformer to Rule Universal Image Segmentation." CVPR (2023). [paper] [homepage] [code] -
ImageBind: Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra.
"ImageBind: One Embedding Space To Bind Them All." CVPR (2023). [paper] [homepage] [code] -
RAM: Youcai Zhang, Xinyu Huang, Jinyu Ma, Zhaoyang Li, Zhaochuan Luo, Yanchun Xie, Yuzhuo Qin, Tong Luo, Yaqian Li, Shilong Liu, Yandong Guo, Lei Zhang.
"Recognize Anything: A Strong Image Tagging Model." ArXiv (2023). [paper] [homepage] [code] -
PACGen: Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee.
"Generate Anything Anywhere in Any Scene." ArXiv (2023). [paper] [homepage] [code] -
OpenSeeD: Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang.
"A Simple Framework for Open-Vocabulary Segmentation and Detection." ICCV (2023). [paper] [code]
💥HQTrack: Jiawen Zhu, Zhenyu Chen, Zeqi Hao, Shijie Chang, Lu Zhang, Dong Wang, Huchuan Lu, Bin Luo, Jun-Yan He, Jin-Peng Lan, Hanyuan Chen, Chenyang Li.
"Tracking Anything in High Quality." ArXiv (2023).
[paper]
[code]
-
Fashion Matrix: Zheng Chong, Xujie Zhang, Fuwei Zhao, Zhenyu Xie, Xiaodan Liang.
"Fashion Matrix: Editing Photos by Just Talking." ArXiv (2023). [paper] [homepage] [code] -
RoboChop: Atharva Dikshit, Alison Bartsch, Abraham George, Amir Barati Farimani.
"RoboChop: Autonomous Framework for Fruit and Vegetable Chopping Leveraging Foundational Models." ArXiv (2023). [paper] -
Industrial-SA: Keno Moenck, Arne Wendt, Philipp Prünte, Julian Koch, Arne Sahrhage, Johann Gierecker, Ole Schmedemann, Falko Kähler, Dirk Holst, Martin Gomse, Thorsten Schüppstuhl, Daniel Schoepflin.
"Industrial Segment Anything -- a Case Study in Aircraft Manufacturing, Intralogistics, Maintenance, Repair, and Overhaul." ArXiv (2023). [paper] -
CNOS: Van Nguyen Nguyen, Tomas Hodan, Georgy Ponimatkin, Thibault Groueix, Vincent Lepetit.
"CNOS: A Strong Baseline for CAD-based Novel Object Segmentation." ArXiv (2023). [paper] [code] -
SAM-Path: Jingwei Zhang, Ke Ma, Saarthak Kapse, Joel Saltz, Maria Vakalopoulou, Prateek Prasanna, Dimitris Samaras.
"SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology." ArXiv (2023). [paper] -
BuboGPT: Yang Zhao, Zhijie Lin, Daquan Zhou, Zilong Huang, Jiashi Feng, Bingyi Kang.
"BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs." ArXiv (2023). [paper] [code] -
OpenSU: Ruiping Liu, Jiaming Zhang, Kunyu Peng, Junwei Zheng, Ke Cao, Yufan Chen, Kailun Yang, Rainer Stiefelhagen.
"Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments." ArXiv (2023). [paper] [code] -
OG: Zichao Dong, Hang Ji, Weikun Zhang, Xufeng Huang, Junbo Chen.
"OG: Equip vision occupancy with instance segmentation and visual grounding." ArXiv (2023). [paper] -
$SAM^{Med}$ : Chenglong Wang, Dexuan Li, Sucheng Wang, Chengxiu Zhang, Yida Wang, Yun Liu, Guang Yang.
$SAM^{Med}$ : A medical image annotation framework based on large vision model. ArXiv (2023). [paper] -
SAM-U: Guoyao Deng, Ke Zou, Kai Ren, Meng Wang, Xuedong Yuan, Sancong Ying, Huazhu Fu.
"SAM-U: Multi-box prompts triggered uncertainty estimation for reliable SAM in medical image." ArXiv (2023). [paper] -
Semantic-SAM: Feng Li, Hao Zhang, Peize Sun, Xueyan Zou, Shilong Liu, Jianwei Yang, Chunyuan Li, Lei Zhang, Jianfeng Gao.
"Semantic-SAM: Segment and Recognize Anything at Any Granularity." ArXiv (2023). [paper] [code] -
SAM-IQA: Xinpeng Li, Ting Jiang, Haoqiang Fan, Shuaicheng Liu.
"SAM-IQA: Can Segment Anything Boost Image Quality Assessment?." ArXiv (2023). [paper] [code] -
Cross-SAM: Xiaoyu Bai, Fan Bai, Xiaofei Huo, Jia Ge, Tony C. W. Mok, Zi Li, Minfeng Xu, Jingren Zhou, Le Lu, Dakai Jin, Xianghua Ye, Jingjing Lu, Ke Yan.
"Matching in the Wild: Learning Anatomical Embeddings for Multi-Modality Images." ArXiv (2023). [paper] -
LAM-SC: Feibo Jiang, Yubo Peng, Li Dong, Kezhi Wang, Kun Yang, Cunhua Pan, Xiaohu You.
"Large AI Model-Based Semantic Communications." ArXiv (2023). [paper] -
MSDeAOT: Yuanyou Xu, Jiahao Li, Zongxin Yang, Yi Yang, Yueting Zhuang.
"ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking." ArXiv (2023). [paper] -
EM-SAM: Ao Cheng, Guoqiang Zhao, Lirong Wang, Ruobing Zhang.
"AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images." ArXiv (2023). [paper] -
SAM-PT: Frano Rajič, Lei Ke, Yu-Wing Tai, Chi-Keung Tang, Martin Danelljan, Fisher Yu.
"Segment Anything Meets Point Tracking." ArXiv (2023). [paper] [code] -
SAMAug: Haixing Dai, Chong Ma, Zhengliang Liu, Yiwei Li, Peng Shu, Xiaozheng Wei, Lin Zhao, Zihao Wu, Dajiang Zhu, Wei Liu, Quanzheng Li, Tianming Liu, Xiang Li.
"SAMAug: Point Prompt Augmentation for Segment Anything Model." ArXiv (2023). [paper] [code] -
SAM-DA: Liangliang Yao, Haobo Zuo, Guangze Zheng, Changhong Fu, Jia Pan.
"SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation." ArXiv (2023). [paper] [code] -
RefSAM: Yonglin Li, Jing Zhang, Xiao Teng, Long Lan.
"RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation." ArXiv (2023). [paper] [code] -
All-in-SAM: Can Cui, Ruining Deng, Quan Liu, Tianyuan Yao, Shunxing Bao, Lucas W. Remedios, Yucheng Tang, Yuankai Huo.
"All-in-SAM: from Weak Annotation to Pixel-wise Nuclei Segmentation with Prompt-based Finetuning." ArXiv (2023). [paper] -
Zenglin Shi, Ying Sun, Mengmi Zhang.
"Training-free Object Counting with Prompts." ArXiv (2023). [paper] [code] -
Xiaoyu Shi, Shurong Chai, Yinhao Li, Jingliang Cheng, Jie Bai, Guohua Zhao, Yen-Wei Chen.
"Cross-modality Attention Adapter: A Glioma Segmentation Fine-tuning Method for SAM Using Multimodal Brain MR Images." ArXiv (2023). [paper] -
TDA: Ruben Glatt, Shusen Liu.
"Topological Data Analysis Guided Segment Anything Model Prompt Optimization for Zero-Shot Segmentation in Biological Imaging." ArXiv (2023). [paper] -
DADF: Yingxin Lai, Zhiming Luo, Zitong Yu.
"Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization." ArXiv (2023). [paper] [code] -
Lucas Prado Osco, Qiusheng Wu, Eduardo Lopes de Lemos, Wesley Nunes Gonçalves, Ana Paula Marques Ramos, Jonathan Li, José Marcato Junior.
"The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot." ArXiv (2023). [paper] -
RSPrompter: Keyan Chen, Chenyang Liu, Hao Chen, Haotian Zhang, Wenyuan Li, Zhengxia Zou, Zhenwei Shi.
"RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model." ArXiv (2023). [paper] [code] -
Zhewei Chen, Wai Keung Wong, Zuofeng Zhong, Jinpiao Liao, Ying Qu.
"Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection." ArXiv (2023). [paper] -
Zheyan Jin, Shiqi Chen, Yueting Chen, Zhihai Xu, Huajun Feng.
"Let Segment Anything Help Image Dehaze." ArXiv (2023). [paper] -
CLIP-SAM: Evan Kellener, Ihina Nath, An Ngo, Thomas Nguyen, Joshua Schuman, Coen Adler, Arnav Kartikeya.
"Utilizing Segment Anything Model For Assessing Localization of GRAD-CAM in Medical Imaging." ArXiv (2023). [paper] -
MESS: Benedikt Blumenstiel, Johannes Jakubik, Hilde Kühne, Michael Vössing.
"What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation." ArXiv (2023). [paper] [code] -
MMPM: Jiange Yang, Wenhui Tan, Chuhao Jin, Bei Liu, Jianlong Fu, Ruihua Song, Limin Wang.
"Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots." ArXiv (2023). [paper] [YouTube] [Bilibili] -
CellViT: Fabian Hörst, Moritz Rempe, Lukas Heine, Constantin Seibold, Julius Keyl, Giulia Baldini, Selma Ugurel, Jens Siveke, Barbara Grünwald, Jan Egger, Jens Kleesiek.
"CellViT: Vision Transformers for Precise Cell Segmentation and Classification." ArXiv (2023). [paper] [code] -
MedLSAM: Wenhui Lei, Xu Wei, Xiaofan Zhang, Kang Li, Shaoting Zhang.
"MedLSAM: Localize and Segment Anything Model for 3D Medical Images." ArXiv (2023). [paper] [code] -
MobileSAM: Chaoning Zhang, Dongshen Han, Yu Qiao, Jung Uk Kim, Sung-Ho Bae, Seungkyu Lee, Choong Seon Hong.
"Faster Segment Anything: Towards Lightweight SAM for Mobile Applications." ArXiv (2023). [paper] [code] -
SonarSAM: Lin Wang, Xiufen Ye, Liqiang Zhu, Weijie Wu, Jianguo Zhang, Huiming Xing, Chao Hu.
"When SAM Meets Sonar Images." ArXiv (2023). [paper] [code] -
AutoSAM: Xinrong Hu, Xiaowei Xu, Yiyu Shi.
"How to Efficiently Adapt Large Segmentation Model(SAM) to Medical Images." ArXiv (2023). [paper] [code] -
3DSAM-adapter: Shizhan Gong, Yuan Zhong, Wenao Ma, Jinpeng Li, Zhao Wang, Jingyang Zhang, Pheng-Ann Heng, Qi Dou.
"3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation." ArXiv (2023). [paper] [code] -
Xinru Shan, Chaoning Zhang.
"Robustness of Segment Anything Model (SAM) for Autonomous Driving in Adverse Weather Conditions." ArXiv (2023). [paper] -
SAM-LST: Shurong Chai, Rahul Kumar Jain, Shiyu Teng, Jiaqing Liu, Yinhao Li, Tomoko Tateyama, Yen-wei Chen.
"Ladder Fine-tuning approach for SAM integrating complementary network." ArXiv (2023). [paper] [code] -
Mohsen Ahmadi, Masoumeh Farhadi Nia, Sara Asgarian, Kasra Danesh, Elyas Irankhah, Ahmad Gholizadeh Lonbar, Abbas Sharifi.
"Comparative Analysis of Segment Anything Model and U-Net for Breast Tumor Detection in Ultrasound and Mammography Images." ArXiv (2023). [paper] -
FastSAM: Xu Zhao, Wenchao Ding, Yongqi An, Yinglong Du, Tao Yu, Min Li, Ming Tang, Jinqiao Wang.
"Fast Segment Anything." ArXiv (2023). [paper] [code] -
Seal: Youquan Liu, Lingdong Kong, Jun Cen, Runnan Chen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu.
"Segment Any Point Cloud Sequences by Distilling Vision Foundation Models." ArXiv (2023). [paper] [code] [homepage] -
Lian Zhang, Zhengliang Liu, Lu Zhang, Zihao Wu, Xiaowei Yu, Jason Holmes, Hongying Feng, Haixing Dai, Xiang Li, Quanzheng Li, Dajiang Zhu, Tianming Liu, Wei Liu.
"Segment Anything Model (SAM) for Radiation Oncology." ArXiv (2023). [paper] -
Enlighten-Anything: Qihan Zhao, Xiaofeng Zhang, Hao Tang, Chaochen Gu, Shanying Zhu.
"Enlighten-anything:When Segment Anything Model Meets Low-light Image Enhancement." ArXiv (2023). [paper] [code] -
SAA+: Yunkang Cao, Xiaohao Xu, Chen Sun, Yuqi Cheng, Liang Gao, Weiming Shen.
"Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection." CVPR2023 Workshop. [paper] [code] -
TEPO: Chuyun Shen, Wenhao Li, Ya Zhang, Xiangfeng Wang.
"Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation." ArXiv (2023). [paper] -
TomoSAM: Federico Semeraro, Alexandre Quintart, Sergio Fraile Izquierdo, Joseph C. Ferguson.
"TomoSAM: a 3D Slicer extension using SAM for tomography segmentation." ArXiv (2023). [paper] [code] -
Madeline Chantry Schiappa, Sachidanand VS, Yunhao Ge, Ondrej Miksik, Yogesh S. Rawat, Vibhav Vineet.
"Robustness Analysis on Foundational Segmentation Models." ArXiv (2023). [paper] [code] -
Yu Qiao, Chaoning Zhang, Taegoo Kang, Donghun Kim, Shehbaz Tariq, Chenshuang Zhang, Choong Seon Hong.
"Robustness of SAM: Segment Anything Under Corruptions and Beyond." ArXiv (2023). [paper] -
AutoSAM: Tal Shaharabany, Aviad Dahan, Raja Giryes, Lior Wolf.
"AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder." ArXiv (2023). [paper] -
SAM-shadow: Xiaofeng Zhang, Chaochen Gu, Shanying Zhu.
"SAM-helps-Shadow:When Segment Anything Model meet shadow removal." ArXiv (2023). [paper] [code] -
Chaoning Zhang, Sheng Zheng, Chenghao Li, Yu Qiao, Taegoo Kang, Xinru Shan, Chenshuang Zhang, Caiyan Qin, Francois Rameau, Sung-Ho Bae, Choong Seon Hong.
"A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering." ArXiv (2023). [paper] -
MAM: Jiachen Li, Jitesh Jain, Humphrey Shi.
"Matting Anything." ArXiv (2023). [paper] [code] -
Haochen Xue, Mingyu Jin, Chong Zhang, Yuxuan Huang, Qian Weng, Xiaobo Jin.
"Automatic Image Blending Algorithm Based on SAM and DINO." ArXiv (2023). [paper] -
MatAny: Jingfeng Yao, Xinggang Wang, Lang Ye, Wenyu Liu.
"Matte Anything: Interactive Natural Image Matting with Segment Anything Models." ArXiv (2023). [paper] [code] -
CNS: Runnan Chen, Youquan Liu, Lingdong Kong, Nenglun Chen, Xinge Zhu, Yuexin Ma, Tongliang Liu, Wenping Wang.
"Towards Label-free Scene Understanding by Vision Foundation Models." ArXiv (2023). [paper] [code] -
SAM3D: Yunhan Yang, Xiaoyang Wu, Tong He, Hengshuang Zhao, Xihui Liu.
"SAM3D: Segment Anything in 3D Scenes." ArXiv (2023). [paper] [code] -
Calib-Anything: Zhaotong Luo, Guohang Yan, Yikang Li.
" Calib-Anything: Zero-training LiDAR-Camera Extrinsic Calibration Method Using Segment Anything." ArXiv (2023). [paper] [code] -
Shijie Chang, Zeqi Hao, Ben Kang, Xiaoqi Zhao, Jiawen Zhu, Zhenyu Chen, Lihe Zhang, Lu Zhang, Huchuan Lu.
" 3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW." ArXiv (2023). [paper] -
USD: Yulin He, Wei Chen, Yusong Tan, Siqi Wang.
" USD: Unknown Sensitive Detector Empowered by Decoupled Objectness and Segment Anything Model." ArXiv (2023). [paper] -
SAM3D: Dingyuan Zhang, Dingkang Liang, Hongcheng Yang, Zhikang Zou, Xiaoqing Ye, Zhe Liu, Xiang Bai.
"SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model." ArXiv (2023). [paper] [code] -
Shehbaz Tariq, Brian Estadimas Arfeto, Chaoning Zhang, Hyundong Shin.
"Segment Anything Meets Semantic Communication." ArXiv (2023). [paper] -
HQ-SAM: Lei Ke, Mingqiao Ye, Martin Danelljan, Yifan Liu, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu.
"Segment Anything in High Quality." ArXiv (2023). [paper] [code] -
DeSAM: Yifan Gao, Wei Xia, Dingdu Hu, Xin Gao.
"DeSAM: Decoupling Segment Anything Model for Generalizable Medical Image Segmentation." ArXiv (2023). [paper] [code] -
FineRewards: Guian Fang, Zutao Jiang, Jianhua Han, Guangsong Lu, Hang Xu, Xiaodan Liang.
"Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards." ArXiv (2023). [paper] [code] -
InstructEdit: Qian Wang, Biao Zhang, Michael Birsak, Peter Wonka.
"InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions." ArXiv (2023). [paper] [code] -
AIMS: Lu Qi, Jason Kuen, Weidong Guo, Jiuxiang Gu, Zhe Lin, Bo Du, Yu Xu, Ming-Hsuan Yang.
"AIMS: All-Inclusive Multi-Level Segmentation." ArXiv (2023). [paper] [code] -
ShadowSAM: Yonghui Wang, Wengang Zhou, Yunyao Mao, Houqiang Li.
"Detect Any Shadow: Segment Anything for Video Shadow Detection." ArXiv (2023). [paper] [code] -
ISA-NeRF: Xiaokang Chen, Jiaxiang Tang, Diwen Wan, Jingbo Wang, Gang Zeng.
"Interactive Segment Anything NeRF with Feature Imitation." ArXiv (2023). [paper] [homepage] -
Yihao Huang, Yue Cao, Tianlin Li, Felix Juefei-Xu, Di Lin, Ivor W. Tsang, Yang Liu, Qing Guo.
"On the Robustness of Segment Anything." ArXiv (2023). [paper] -
SAMScore: Yunxiang Li, Meixu Chen, Wenxuan Yang, Kai Wang, Jun Ma, Alan C. Bovik, You Zhang.
"SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation." ArXiv (2023). [paper] [code] -
SAD: Jun Cen, Yizheng Wu, Kewei Wang, Xingyi Li, Jingkang Yang, Yixuan Pei, Lingdong Kong, Ziwei Liu, Qifeng Chen.
"SAD: Segment Any RGBD." ArXiv (2023). [paper] [code] -
SPT: Zeyu Xiao, Jiawang Bai, Zhihe Lu, Zhiwei Xiong.
"A Dive into SAM Prior in Image Restoration." ArXiv (2023). [paper] -
Matcher: Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, Chunhua Shen.
"Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching." ArXiv (2023). [paper] [code] -
RAP: Jiaxi Jiang, Christian Holz.
"Restore Anything Pipeline: Segment Anything Meets Image Restoration." ArXiv (2023). [paper] [code] -
UVOSAM: Zhenghao Zhang, Zhichao Wei, Shengfan Zhang, Zuozhuo Dai, Siyu Zhu.
"UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model." ArXiv (2023). [paper] -
BreastSAM: Mingzhe Hu, Yuheng Li, Xiaofeng Yang.
"BreastSAM: A Study of Segment Anything Model for Breast Tumor Detection in Ultrasound Images." ArXiv (2023). [paper] -
SAMSh: Leiping Jie, Hui Zhang.
"When SAM Meets Shadow Detection." ArXiv (2023). [paper] [code] -
Instruct2Act: Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li.
"Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model." ArXiv (2023). [paper] [code] -
WS-SAM: Chunming He, Kai Li, Yachao Zhang, Guoxia Xu, Longxiang Tang, Yulun Zhang, Zhenhua Guo, Xiu Li.
"Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping." ArXiv (2023). [paper] -
SAA+: Yunkang Cao, Xiaohao Xu, Chen Sun, Yuqi Cheng, Zongwei Du, Liang Gao, Weiming Shen.
"Segment Any Anomaly without Training via Hybrid Prompt Regularization." ArXiv (2023). [paper] [code] -
OR-NeRF: Youtan Yin, Zhoujie Fu, Fan Yang, Guosheng Lin.
"OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields." ArXiv (2023). [paper] -
PromptUNet: Junde Wu.
"PromptUNet: Toward Interactive Medical Image Segmentation." ArXiv (2023). [paper] [code] -
EAC: Ao Sun, Pingchuan Ma, Yuanyuan Yuan, Shuai Wang.
"Explain Any Concept: Segment Anything Meets Concept-Based Explanation." ArXiv (2023). [paper] -
Xiao Yang, Haixing Dai, Zihao Wu, Ramesh Bist, Sachin Subedi, Jin Sun, Guoyu Lu, Changying Li, Tianming Liu, Lilong Chai.
"SAM for Poultry Science." ArXiv (2023). [paper] -
Leaf Only SAM: Dominic Williams, Fraser MacFarlane, Avril Britten.
"Leaf Only SAM: A Segment Anything Pipeline for Zero-Shot Automated Leaf Segmentation." ArXiv (2023). [paper] -
KD-SAM: Sahib Julka, Michael Granitzer.
"Knowledge distillation with Segment Anything (SAM) model for Planetary Geological Mapping." ArXiv (2023). [paper] -
SAM-Track: Yangming Cheng, Liulei Li, Yuanyou Xu, Xiaodi Li, Zongxin Yang, Wenguan Wang, Yi Yang.
"Segment-and-Track Anything." ArXiv (2023). [paper] [code] -
SEEM: Zhihe Lu, Zeyu Xiao, Jiawang Bai, Zhiwei Xiong, Xinchao Wang.
"Can SAM Boost Video Super-Resolution?" ArXiv (2023). [paper] -
Yuqing Wang, Yun Zhao, Linda Petzold.
"An Empirical Study on the Robustness of the Segment Anything Model (SAM)." ArXiv (2023). [paper] -
SAM-WSSS: Tianle Chen, Zheda Mai, Ruiwen Li, Wei-lun Chao.
"Segment Anything Model (SAM) Enhanced Pseudo Labels for Weakly Supervised Semantic Segmentation." ArXiv (2023). [paper] [code] -
SAM4MIS: Yichi Zhang, Rushi Jiao.
"How Segment Anything Model (SAM) Boost Medical Image Segmentation?" ArXiv (2023). [paper] [code] -
BadSAM: Zihan Guan, Mengxuan Hu, Zhongliang Zhou, Jielu Zhang, Sheng Li, Ninghao Liu.
"BadSAM: Exploring Security Vulnerabilities of SAM via Backdoor Attacks." ArXiv (2023). [paper] -
PerSAM: Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li.
"Personalize Segment Anything Model with One Shot." ArXiv (2023). [paper] [code] -
CAT: Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao.
"Caption Anything: Interactive Image Description with Diverse Multimodal Controls." ArXiv (2023). [paper] [code] -
SAMRS: Di Wang, Jing Zhang, Bo Du, Dacheng Tao, Liangpei Zhang.
"Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model." ArXiv (2023). [paper] [code] -
AV-SAM: Shentong Mo, Yapeng Tian.
"AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation." ArXiv (2023). [paper] -
WSSS: Weixuan Sun, Zheyuan Liu, Yanhao Zhang, Yiran Zhong, Nick Barnes.
"An Alternative to WSSS? An Empirical Study of the Segment Anything Model (SAM) on Weakly-Supervised Semantic Segmentation Problems." ArXiv (2023). [paper] -
PLG-SAM: Peng-Tao Jiang, Yuqi Yang.
"Segment Anything is A Good Pseudo-label Generator for Weakly Supervised Semantic Segmentation." ArXiv (2023). [paper] -
Attack-SAM: Chenshuang Zhang, Chaoning Zhang, Taegoo Kang, Donghun Kim, Sung-Ho Bae, In So Kweon.
"Attack-SAM: Towards Attacking Segment Anything Model With Adversarial Examples." ArXiv (2023). [paper] -
Polyp-SAM: Yuheng Li, Mingzhe Hu, Xiaofeng Yang.
"Polyp-SAM: Transfer SAM for Polyp Segmentation." ArXiv (2023). [paper] [code] -
Dongsheng Han, Chaoning Zhang, Yu Qiao, Maryam Qamar, Yuna Jung, SeungKyu Lee, Sung-Ho Bae, Choong Seon Hong.
"Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected." ArXiv (2023). [paper] -
DSEC-MOS: Zhuyun Zhou, Zongwei Wu, Rémi Boutteau, Fan Yang, Dominique Ginhac.
"DSEC-MOS: Segment Any Moving Object with Moving Ego Vehicle." ArXiv (2023). [paper] [code] -
Christian Mattjie, Luis Vinicius de Moura, Rafaela Cappelari Ravazio, Lucas Silveira Kupssinskü, Otávio Parraga, Marcelo Mussi Delucis, Rodrigo Coelho Barros.
"Zero-shot performance of the Segment Anything Model (SAM) in 2D medical imaging: A comprehensive evaluation and practical guidelines." ArXiv (2023). [paper] [code] -
Dongjie Cheng, Ziyuan Qin, Zekun Jiang, Shaoting Zhang, Qicheng Lao, Kang Li.
"SAM on Medical Images: A Comprehensive Study on Three Prompt Modes." ArXiv (2023). [paper] -
An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren.
"SAM Meets Robotic Surgery: An Empirical Study in Robustness Perspective." ArXiv (2023). [paper] -
Yuhao Huang, Xin Yang, Lian Liu, Han Zhou, Ao Chang, Xinrui Zhou, Rusi Chen, Junxuan Yu, Jiongquan Chen, Chaoyu Chen, Haozhe Chi, Xindi Hu, Deng-Ping Fan, Fajin Dong, Dong Ni.
"Segment Anything Model for Medical Images?" ArXiv (2023). [paper] -
Edit Everything: Defeng Xie, Ruichen Wang, Jian Ma, Chen Chen, Haonan Lu, Dong Yang, Fobo Shi, Xiaodong Lin.
"Edit Everything: A Text-Guided Generative System for Images Editing." ArXiv (2023). [paper] [code] -
SkinSAM: Mingzhe Hu, Yuheng Li, Xiaofeng Yang.
"SkinSAM: Empowering Skin Cancer Segmentation with Segment Anything Model." ArXiv (2023). [paper] -
GazeSAM: Bin Wang, Armstrong Aboah, Zheyuan Zhang, Ulas Bagci.
" GazeSAM: What You See is What You Segment." ArXiv (2023). [paper] [code] -
SAMed: Kaidong Zhang, Dong Liu.
" Customized Segment Anything Model for Medical Image Segmentation." ArXiv (2023). [paper] [code] -
LearnablePromptSAM: Zhongxi Qiu, Yan Hu, Heng Li, Jiang Liu.
" Learnable Ophthalmology SAM." ArXiv (2023). [paper] [code] -
Simiao Ren, Francesco Luzi, Saad Lahrichi, Kaleb Kassaw, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof.
" Segment anything, from space?." ArXiv (2023). [paper] -
Peilun Shi, Jianing Qiu, Sai Mu Dalike Abaxi, Hao Wei, Frank P. -W. Lo, Wu Yuan.
"Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation." ArXiv (2023). [paper] -
MSA: Junde Wu, Yu Zhang, Rao Fu, Huihui Fang, Yuanpei Liu, Zhaowei Wang, Yanwu Xu, Yueming Jin.
" Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation." ArXiv (2023). [paper] [code] -
Mohsen Ahmadi, Ahmad Gholizadeh Lonbar, Abbas Sharifi, Ali Tarlani Beris, Mohammadsadegh Nouri, Amir Sharifzadeh Javidi.
"Application of Segment Anything Model for Civil Infrastructure Defect Assessment." ArXiv (2023). [paper] -
SA3D: Jiazhong Cen, Zanwei Zhou, Jiemin Fang, Wei Shen, Lingxi Xie, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian.
" Segment Anything in 3D with NeRFs." ArXiv (2023). [paper] [code] -
MedSAM: Jun Ma, Bo Wang.
" Segment Anything in Medical Images." ArXiv (2023). [paper] [code] -
TAM: Jinyu Yang, Mingqi Gao, Zhe Li, Shang Gao, Fangjing Wang, Feng Zheng.
"Track Anything: Segment Anything Meets Videos." ArXiv (2023). [paper] [code] -
SNA: Yongcheng Jing, Xinchao Wang, Dacheng Tao.
"Segment Anything in Non-Euclidean Domains: Challenges and Opportunities." ArXiv (2023). [paper] -
SAMAug: Yizhe Zhang, Tao Zhou, Peixian Liang, Danny Z. Chen.
"Input Augmentation with SAM: Boosting Medical Image Segmentation with Segmentation Foundation Model." ArXiv (2023). [paper] -
Count-Anything: Zhiheng Ma, Xiaopeng Hong, Qinnan Shangguan.
"Can SAM Count Anything? An Empirical Study on SAM Counting." ArXiv (2023). [paper] [code] -
Text2Seg: Jielu Zhang, Zhongliang Zhou, Gengchen Mai, Lan Mu, Mengxuan Hu, Sheng Li.
" Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation Models." ArXiv (2023). [paper] [code] -
Maciej A. Mazurowski, Haoyu Dong, Hanxue Gu, Jichen Yang, Nicholas Konz, Yixin Zhang.
"Segment Anything Model for Medical Image Analysis: an Experimental Study." ArXiv (2023). [paper] -
Anything-3D: Qiuhong Shen, Xingyi Yang, Xinchao Wang.
"Anything-3D: Towards Single-view Anything Reconstruction in the Wild." ArXiv (2023). [paper] [code] -
Any-to-Any Transfer: Songhua Liu, Jingwen Ye, Xinchao Wang.
"Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate." ArXiv (2023). [paper] [code] -
Sheng He, Rina Bao, Jingpeng Li, Jeffrey Stout, Atle Bjornerud, P. Ellen Grant, Yangming Ou.
"Computer-Vision Benchmark Segment-Anything Model (SAM) in Medical Images: Accuracy in 12 Datasets." ArXiv (2023). [paper] -
SAM-Adapter: Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Zejian Li, Lingyun Sun, Papa Mao, Ying Zang.
"SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More." ArXiv (2023). [paper] -
Chuanfei Hu, Tianyi Xia, Shenghong Ju, Xinde Li.
" When SAM Meets Medical Images: An Investigation of Segment Anything Model (SAM) on Multi-phase Liver Tumor Segmentation." ArXiv (2023). [paper] -
SATIR: Junzhang Chen, Xiangzhi Bai.
" Learning to "Segment Anything" in Thermal Infrared Images through Knowledge Distillation with a Large Scale Dataset SATIR." ArXiv (2023). [paper] [code] -
Florian Putz, Johanna Grigo, Thomas Weissmann, Philipp Schubert, Daniel Hoefler, Ahmed Gomaa, Hassen Ben Tkhayat, Amr Hagag, Sebastian Lettmaier, Benjamin Frey, Udo S. Gaipl, Luitpold V. Distel, Sabine Semrau, Christoph Bert, Rainer Fietkau, Yixing Huang.
"The Segment Anything foundation model achieves favorable brain tumor autosegmentation accuracy on MRI to support radiotherapy treatment planning." ArXiv (2023). [paper] -
Iraklis Giannakis, Anshuman Bhardwaj, Lydia Sam, Georgios Leontidis.
" Deep learning universal crater detection using Segment Anything Model (SAM)." ArXiv (2023). [paper] -
SAMPolyp: Tao Zhou, Yizhe Zhang, Yi Zhou, Ye Wu, Chen Gong.
" Can SAM Segment Polyps?" ArXiv (2023). [paper] [code] -
Inpaint-Anything: Tao Yu, Runseng Feng, Ruoyu Feng, Jinming Liu, Xin Jin, Wenjun Zeng, Zhibo Chen.
"Inpaint Anything: Segment Anything Meets Image Inpainting." ArXiv (2023). [paper] [code] -
Ge-Peng Ji, Deng-Ping Fan, Peng Xu, Ming-Ming Cheng, Bowen Zhou, Luc Van Gool.
" SAM Struggles in Concealed Scenes -- Empirical Study on "Segment Anything"." ArXiv (2023). [paper] -
Wei Ji, Jingjing Li, Qi Bi, Wenbo Li, Li Cheng.
"Segment Anything Is Not Always Perfect: An Investigation of SAM on Different Real-world Applications." ArXiv (2023). [paper] -
CLIP Surgery: Yi Li, Hualiang Wang, Yiqun Duan, Xiaomeng Li.
"CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks." ArXiv (2023). [paper] [code] -
SAMM: Yihao Liu, Jiaming Zhang, Zhangcong She, Amir Kheradmand, Mehran Armand.
"SAMM (Segment Any Medical Model): A 3D Slicer Integration to SAM." ArXiv (2023). [paper] [code] -
SAM.MD: Saikat Roy, Tassilo Wald, Gregor Koehler, Maximilian R. Rokuss, Nico Disch, Julius Holzschuh, David Zimmerer, Klaus H. Maier-Hein.
"SAM.MD: Zero-shot medical image segmentation capabilities of the Segment Anything Model." ArXiv (2023). [paper] -
SAM vs BET: Sovesh Mohapatra, Advait Gosai, Gottfried Schlaug.
"SAM vs BET: A Comparative Study for Brain Extraction and Segmentation of Magnetic Resonance Images using Deep Learning." ArXiv (2023). [paper] [code] -
SAMCOD: Lv Tang, Haoke Xiao, Bo Li.
"Can SAM Segment Anything? When SAM Meets Camouflaged Object Detection." ArXiv (2023). [paper] [code]
No. | Project | Title | Project page | Code base | Affiliation | Description |
---|---|---|---|---|---|---|
001 | SAM | Segment Anything | Project page | Code | Meta | A foundation model for general segmentation. |
002 | SAM-Track | Segment and Track Anything | Colab | Code | Zhejiang University | A project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. |
003 | Grounded-SAM | Grounded-Segment-Anything | Colab | Code | IDEA-Research | A project by combining Grounding DINO and SAM which aims to detect and segment Anything with text inputs. |
004 | MMDet-SAM | - | - | Code | OpenMMLab | A new way of instance segmentation by combining SAM with Closed-Set Object Detection, Open-Vocabulary Object Detection, Grounding Object Detection. |
005 | MMRotate-SAM | Zero-shot Oriented Object Detection with SAM | - | Code | OpenMMLab | A project join SAM and weakly supervised horizontal box detection to achieve rotated box detection. |
006 | MMOCR-SAM | - | - | Code | OpenMMLab | A solution of Text Detection/Recognition and SAM that segments every text character, with striking text removal and text inpainting demos driven by diffusion models and Gradio. |
007 | MMEditing-SAM | - | - | Code | OpenMMLab | A project join SAM and image generation to create awesome images and edit any part of them. |
008 | Label-Studio-SAM | OpenMMLab PlayGround: Semi-Automated Annotation with Label-Studio and SAM | - | Code | OpenMMLab | A project combining Label-Studio and SAM to achieve semi-automated annotation. |
009 | PaddleSeg | Segment Anything with PaddleSeg | - | Code | PaddlePaddle | A pretrained model parameters of PaddlePaddle format. |
010 | SegGPT | Segmenting Everything In Context | Hugging Face | Code | BAAI-Vision | SAM In Context based on Painter. |
011 | SEEM | Segment Everything Everywhere All at Once | Hugging Face | Code | Microsoft | A project can Segment Everything Everywhere with Multi-modal prompts all at once. |
012 | CLIP Surgery | CLIP Surgery for Better Explainability with Enhancement in Open Vocabulary Tasks | Project page | Code | HKUST | A work about SAM based on CLIP's explainability to achieve text to mask without manual points. |
013 | SAMCOD | Can SAM Segment Anything? When SAM Meets Camouflaged Object Detection | - | Code | - | SAM +Camouflaged object detection (COD) task. |
014 | Inpaint Anything | Segment Anything Meets Image Inpainting | Hugging Face | Code | USTC and EIT | SAM combines Inpainting, which is able to remove the object smoothly. |
015 | PerSAM | Personalize Segment Anything Model with One Shot | Hugging Face | Code | - | SAM with specific concepts. |
016 | MedSAM | Segment Anything in Medical Images | - | Code | - | A step-by-step tutorial with a small dataset to help you quickly utilize SAM. |
017 | Segment-Any-Anomaly | GroundedSAM Anomaly Detection | Colab | Code | HUST | Grounding DINO + SAM to segment any anomaly. |
018 | SSA | Semantic Segment Anything | - | Code | Fudan University | A dense category annotation engine. |
019 | Magic Copy | - | - | Code | - | Magic Copy is a Chrome extension that uses SAM to extract a foreground object from an image and copy it to the clipboard. |
020 | Segment Anything with Clip | Segment Anything with Clip | Hugging Face | Code | - | SAM combined with CLIP. |
021 | MetaSeg | Segment Anything Video | Hugging Face | Code | - | Packaged version of the SAM. |
022 | SAM in Napari | Segment Anything Model (SAM) in Napari | Project page | Code | Applied Computer Vision Lab and German Cancer Research Center | Extended SAM's click-based foreground separation to full click-based semantic segmentation and instance segmentation. |
023 | SAM Medical Imaging | SAM Medical Imaging | - | Code | - | SAM for Medical Imaging. |
024 | 3D-Box | 3D-Box via Segment Anything | - | Code | - | SAM is extended to 3D perception by combining it with VoxelNeXt. |
025 | Anything-3D | - | - | Code | - | Anything 3DNovel View, Anything-NeRF, Any 3DFace. |
026 | L2SET | Learning to Segment EveryThing | - | Code | UC Berkeley, FAIR | A new partially supervised training paradigm for instance segmentation. |
027 | Edit Anything | Edit Anything by Segment-Anything | - | Code | - | Edit anything in images powered by SAM, ControlNet, StableDiffusion, \etc. |
028 | Image Edit Anything | IEA: Image Editing Anything | - | Code | - | Using stable diffusion and SAM for image editing. |
029 | SAM for Stable Diffusion Webui | Segment Anything for Stable Diffusion WebUI | - | Code | - | This extension aim for connecting AUTOMATIC1111 Stable Diffusion WebUI and Mikubill ControlNet Extension with SAM and GroundingDINO to enhance Stable Diffusion/ControlNet inpainting. |
030 | Earth Observation Tools | Segment Anything EO tools | Colab | Code | - | An earth observation tools for SAM. |
031 | Moving Object Detection | Towards Segmenting Anything That Moves | - | Code | - | A project about SAM + Moving Object Detection. |
032 | OCR-SAM | Optical Character Recognition with Segment Anything | Project page | Code | - | Combining MMOCR with SAM and Stable Diffusion. |
033 | SALT | Segment Anything Labelling Tool | - | Code | - | A project uses the SAM Model and adds a barebones interface to label images and saves the masks in the COCO format. |
034 | Prompt Segment Anything | Prompt Segment Anything | - | Code | - | An implementation of zero-shot instance segmentation using SAM. |
035 | SAM-RBox | - | - | Code | - | A project uses SAM for generating rotated bounding boxes with MMRotate, which is a comparison method of H2RBox-v2. |
036 | VISAM | MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors | - | Code | - | Combining SAM with MOT, it create the era of "MOTS". |
037 | SegEO | Segment Anything EO tools | - | Code | - | The tools are developed to ease the processing of spatial data (GeoTIFF and TMS) with SAM using sliding window algorithm for big files. |
038 | Napari Segment Anything | Napari Segment Anything | Project page | Code | - | SAM native Qt UI. |
039 | Segment-Anything-U-Specify | Segment-Anything-U-Specify | - | Code | - | Using CLIP and SAM to segment any instance you specify with text prompt of any instance names. |
040 | SegDrawer | Simple static web-based mask drawer | Colab | Code | - | Simple static web-based mask drawer, supporting semantic segmentation with SAM. |
041 | Track Anything | Segment Anything Meets Videos | Hugging Face | Code | SUSTech | Track-Anything is a flexible and interactive tool for video object tracking and segmentation. |
042 | Count Anything | - | - | Code | - | A method uses SAM and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation. |
043 | RAM | Relate Anything Model | Hugging Face | Code | MMLab, NTU and VisCom Lab, KCL/TongJi | Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image. |
044 | Segment Any RGBD | Segment Any RGBD | Project page | Code | - | Segment AnyRGBD is a toolbox to segment rendered depth images based on SAM. |
045 | Show Anything | Show Anything | Hugging Face | Code | Showlab, NUS | Some Applications that are compatible with both SAM and Generation. |
046 | Transfer Any Style | Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate | - | Code | LV-lab, NUS | An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles. |
047 | Caption Anything | - | Colab | Code | VIP lab, SUSTech | Caption-Anything is a versatile image processing tool that combines the capabilities of SAM, Visual Captioning, and ChatGPT. |
048 | Image2Paragraph | Transform Image Into Unique Paragraph | Project page | Code | - | Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet. |
049 | LIME SAM | Local Interpretable Model-agnostic Explanations Segment Anything | Colab | Code | - | LIME-SAM aims to create an Explainable Artificial Intelligence (XAI) framework for image classification using LIME (Local Interpretable Model-agnostic Explanations) as the base algorithm, with the super-pixel method replaced by SAM. |
050 | Paint Anything | - | - | Code | - | An interactive demo based on SAM for stroke-based painting which enables human-like painting. |
051 | SAMed | Customized Segment Anything Model for Medical Image Segmentation | Colab | Code | USTC | SAMed is built upon the large-scale image segmentation model, SAM, to explore the new research paradigm of customizing large-scale models for medical image segmentation. |
052 | Personalize SAM | Personalize Segment Anything with 1 Shot in 10 Seconds | Hugging Face | Code | MMLab, CUHK | A training-free Personalization approach for SAM, termed as PerSAM. Given only a single image with a reference mask, PerSAM can segment specific visual concepts. |
053 | Open-vocabulary-Segment-Anything | Open-vocabulary-Segment-Anything | - | Code | - | Combining OwlViT with Segment Anything - Open-vocabulary Detection and Segmentation (Text-conditioned, and Image-conditioned). |
054 | Labal-Anything-Pipeline | Label-Anything-Pipeline | - | Code | ZJU | Annotation anything in visual tasks just all in one-pipeline with GPT-4 and SAM. |
055 | Grounded-Segment-Any-Parts | Grounded Segment Anything: From Objects to Parts | Project page | Code | HKU | Expand Segment Anything Model (SAM) to support text prompt input. The text prompt could be object-level(eg, dog) and part-level(eg, dog head). |
056 | AnyLabeling | AnyLabeling | Youtube page | Code | - | Effortless AI-assisted data labeling with AI support from Segment Anything and YOLO. |
057 | SSA | Semantic-Segment-Anything | Project page | Code | - | Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B). |
058 | RefSAM | Label Data with Segment Anything in Roboflow | Project page | Code | - | Referring Image Segmentation Benchmarking with Segment Anything Model (SAM). |
059 | Roboflow Annotate | Launch: Label Data with Segment Anything in Roboflow | Project page | APP | Roboflow | SAM-assisted labeling for training computer vision models. |
060 | ImageBind SAM | - | - | Code | IDEA-Research | This is an experimental demo aims to combine ImageBind and SAM to generate mask with different modalities. |
061 | X-AnyLabeling | X-AnyLabeling | Code | CVHub | A new interactive automatic labeling tool based on AnyLabeling. | |
062 | Segment Anything + NNCF | - | Code | - | OpenVINO™ NNCF for segment anything encoder quantization acceleration. | |
063 | YOLOv8 + SAM | - | - | - | Use SAM in YOLOv8. | |
064 | SearchAnything | SearchAnything | Zhihu blog, Twitter | Code | CAS and MSRA | A semantic local search engine powered by various AI models. |
065 | SAM Meets Stable Diffusion | - | Code | PaddlePaddle | Segment and generate Anything. |
- VainF/Awesome-Anything
- Hedlen/Awesome Segment Anything
- Vision-Intelligence-and-Robots-Group/Awesome-Segment-Anything
- JerryX1110/Awesome-segment-anything-extensions
- dk-liang/Awesome-Segment-Anything
If this survey is useful in your research, please use the following BibTeX entry.
@article{chunhui2023samsurvey,
title={A Comprehensive Survey on Segment Anything Model for Vision and Beyond},
author={Zhang, Chunhui and Liu, Li and Cui, Yawen and Huang, Guanjie and Lin, Weilin and Yang, Yiqian and Hu, Yuehong},
journal={arXiv:2304.02643},
year={2023}
}