Git Product home page Git Product logo

Hay Kim's Projects

advancedliteratemachinery icon advancedliteratemachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.

aniportrait icon aniportrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

anyv2v icon anyv2v

A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

brushnet icon brushnet

The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

champ icon champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

consisti2v icon consisti2v

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

controlnet_plus_plus icon controlnet_plus_plus

Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

ctrl-adapter icon ctrl-adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

diffusers icon diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

dough icon dough

Dough is a open source tool for steering AI animations with precision.

fresco icon fresco

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

img2img-turbo icon img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

instantstyle icon instantstyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

magictime icon magictime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

minicpm icon minicpm

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

monkey icon monkey

【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

musev icon musev

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

ocr-sam icon ocr-sam

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

open-sora icon open-sora

Open-Sora: Democratizing Efficient Video Production for All

open-sora-plan icon open-sora-plan

This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

pia icon pia

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.