xymfei Goto Github PK
Type: User
Type: User
声音克隆Instant voice cloning by MyShell
OTAvatar:具有可控三平面渲染的单镜头说话脸头像OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering [CVPR2023].
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
PanoHead从单图生成 3D 纹理模型Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
PerSAM(个性化细分任何内容)现已推出 @huggingface ! 这种(超酷)方法允许 SAM 实现类似 Dreambooth 的个性化,能够仅基于单个(图像、蒙版)示例快速分割图像中的新事物
删除视频中物件[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
PyRCA:用于根本原因分析的 Python 库,Salesforce开源项目
RealFill-类似PS的图像智能扩展功能
ReMoDiffuse: 可控的文本到动作,检索增强运动扩散模型Retrieval-Augmented Motion Diffusion Model
变声器Voice data <= 10 mins can also be used to train a good VC model!
扣图视频中人物,背景变成绿幕Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
one-click deepfake (face swap)
(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
[自动化脚本] 影视剧解说视频自动分集
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Single Motion Diffusion Model
SLAHMR跟踪视频中运动的人,可以估计视频中人物的全球轨迹和姿势
SMPLer-X可从视频中提取人体动作来制作虚拟角色动画
SoftVC VITS Singing Voice Conversion
快速稳定扩散,一个超轻量级的推理性能优化库An ultra lightweight inference performance optimization library for HuggingFace Diffusers on NVIDIA GPUs.
supervision跟踪识别视频中所有物体,v0.16.0实现了紧贴物件轮廓光环效果注释器来分割标识每个物件,roboflow/supervision: We write your reusable computer vision tools. 💜高级视频分析。跟踪器、区域、注释器等等We write your reusable computer vision tools. 💜
TalkSHOW学习通过面部和手部动作,( #CVPR2023 ) 生成逼真、连贯和多样化的整体 3D 动作,即身体动作以及面部表情和手势贯和多样化的整体身体运动。This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].
Text2Performer. 文本提示生成高分辨率带清晰动作生动的人类视频Paper: Text2Performer: Text-Driven Human Video Generation
半个神器👉一键文本转视频的工具
文本到人体连续动作
视频对象跟踪和分割工具,基于SAM。Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything and XMem.
TriPlaneNet:人物肖像生成人物肖像左右上下移动图,用3D GAN,以实现新颖的视图渲染用于 EG3D 反演的编码器,第二个版本已被 WACV 2024 接受,即将发布
腾讯头像自然说话模型V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
转录视频语音并翻译,语音克隆,口型同步,压制字幕,支持中英视频互相转换
VideoReTalking:让视频中的人物的嘴型与输入的声音同步[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.