maxylee Goto Github PK
Name: Maxy
Type: User
Company: Tsinghua University
Bio: To be or not to be
Location: BeiJing
Name: Maxy
Type: User
Company: Tsinghua University
Bio: To be or not to be
Location: BeiJing
Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"
For Automata final
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
python codes for CIDEr - Consensus-based Image Caption Evaluation
Simple image captioning model
The new Decaf compiler, rewritten in "modern" Java
Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis
Reliably download millions of images efficiently
This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and scripts for the proposed probing tasks. We hope the code could help those who want to research on the multimodal machine translation task.
GenForce: an efficient PyTorch library for deep generative modeling (StyleGANv1v2, PGGAN, etc)
GLM (General Language Model)
GloVe model for distributed word representation
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
A naive Android App for learning Kotlin and Android jetpack
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
MAttNet: Modular Attention Network for Referring Expression Comprehension
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.