xuanhanyu Goto Github PK
Name: xuanhanyu
Type: User
Name: xuanhanyu
Type: User
使用快速傅里叶变换(FFT)实现的音频文件的可视化
2.5D visual sound
3D ResNets for Action Recognition (CVPR 2018)
Code for the paper "Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning" ECCV 2020
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
Attention-based Dropout Layer for Weakly Supervised Object Localization (CVPR 2019 Oral)
A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Automotives, Retail, Pharma, Medicine, Healthcare by Tarry Singh until at-least 2020 until he finishes his Ph.D. (which might end up being inter-stellar cosmic networks! Who knows! 😀)
Tensorflow Implementation of "Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification" (ACL 2016)
Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
Pytorch implementation of audio-visual fusion video captioning model
AudioDVP:Photorealistic Audio-driven Video Portraits
pre-trained video classifier for transfer learning.
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
Audio Visual Instance Discrimination with Cross-Modal Agreement
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
A curated list of different papers and datasets in various areas of audio-visual processing
A curated list of awesome computer vision resources
A paper list of lane detection.
Reading list for research topics in multimodal machine learning
A curated list of awesome self-supervised methods
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Caffe_Code_Analysis
Class Activation Mapping
Cross-model active contrastive coding
Co-Separating Sounds of Visual Objects (ICCV 2019)
Conditional Similarity Networks (CSNs-Tensorflow)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.