Xu Cao's Projects
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
PyTorch android examples of usage in applications
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
Welcome to the Bot Framework samples repository. Here you will find task-focused samples in C#, JavaScript and TypeScript to help you get started with the Bot Framework SDK!
It is a project based on IJCNN's paper Automatic Chromosome Classification using Deep Attention Based Sequence Learning of Chromosome Bands and process some new methods
It is a U-Net based project to handle the chromosome segmentation problem
The realization of different classes of Unet framework including contour-aware-Unet, DCAN, Dual Unet, Attention Unet, Unet++
It is a U-Net based network which absorb ideas from deep aggregation layers(DLA), Unet++, ET-Net......
Use Cylinder3D in Waymo
深度学习入门教程&&优秀文章&&Deep Learning Tutorial
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Spares Optimization Algorithms IHT and OMP
A AI project about Gomoku
Apply OpenPose and Infant Key-point Dataset to Evaluate Infant Posture
github.io for Iroh Cao
Karras et al. (2022) diffusion models used for Med data
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
PyTorch Implementation of Lipschitz Transformer
MAE-ViT-pytorch, structure is based on https://github.com/rwightman/pytorch-image-models
Checklist is a custom Teams message extension app that enables users to Collaborate with their team by creating a shared checklist in a chat or channel. Checklist app is supported across all platforms – Teams desktop, browser, iOS, and Android clients. It is ready for deployment as part of your existing Microsoft 365 subscription.
OpenMMLab's next-generation platform for general 3D object detection.
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
A Deep Learning Python Toolkit for Healthcare Applications.
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
The number of bacterial types is a critical monitoring indicator for indoor air quality standards. It is a challenging task to cultivate and count colonies of bacteria which is expertise required and time-consuming. In this work, we investigate several U-Net improvement approaches. We are motivated by the assumption that contour information and semantic embedding branch can enhance U-Net's segmentation capacity for blurred and overlapping objects. Therefore, we propose Semantic Embedding and Contour Assist U-Net (SEC-U-Net) for direct bacteria segmentation and a shallow CNN for bacteria classification. This algorithm designed the detection of bacteria as a two-stage segmentation and classification task. Experimental results demonstrate the proposed method outperforms the state-of-the-art improved U-Net approaches on our bacteria dataset. Proposed SEC-U-NET+CNN based training presented over 91% and 85% precision rate for E.coli and S.aureus, respectively.
Stable Diffusion web UI
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.