Git Product home page Git Product logo

dannielge's Projects

acg2vec icon acg2vec

ACG2vec (Anime Comics Games to vector) are committed to creating a playground that combines ACG and Deep learning.(文本语义检索、以图搜图、语义搜图、图片超分辨率、推荐系统)

audioclip icon audioclip

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

cap4video icon cap4video

【CVPR'2023 Highlight】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

centerclip icon centerclip

[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast pyav video decoding.

clip4clip icon clip4clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

clipbert icon clipbert

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

cnnh icon cnnh

try to implement the method in "Supervised Hashing for Image Retrieval via Image Representation Learning"

controlvideo icon controlvideo

Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

crowdclip icon crowdclip

CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model [CVPR 2023]

cvnet icon cvnet

Official PyTorch Implementation of Correlation Verifcation for Image Retrieval, CVPR 2022 (Oral Presentation)

daeh icon daeh

Deep Adaptively-Enhanced Hashing with Discriminative Similarity Guidance for Unsupervised Cross-modal Retrieval

desah icon desah

ICMR2023 Deep Enhanced-Similarity Attention Hashing Learning.

dgcpn icon dgcpn

Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing

easynlp icon easynlp

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

everything_at_once icon everything_at_once

This is the official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022

galr icon galr

Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"

gformer icon gformer

[SIGIR'2023] "GFormer: Graph Transformer for Recommendation"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.