Git Product home page Git Product logo

cv-arxiv-daily's Introduction

Contributors Forks Stargazers Issues

Updated on 2024.06.12

Usage instructions: here

Table of Contents
  1. Diffusion
  2. Motion
  3. Image Generation
  4. Video Generation

Diffusion

Publish Date Title Authors PDF Code
2024-06-10 IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao et.al. 2406.06527 null
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525 link
2024-06-10 NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Ting-Hsuan Chen et.al. 2406.06523 null
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508 link
2024-06-10 Rephasing spectral diffusion in time-bin spin-spin entanglement protocols Mehmet T. Uysal et.al. 2406.06497 null
2024-06-10 Probing the Heights and Depths of Y Dwarf Atmospheres: A Retrieval Analysis of the JWST Spectral Energy Distribution of WISE J035934.06 $-$ 540154.6 Harshil Kothari et.al. 2406.06493 null
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465 null
2024-06-10 Cometh: A continuous-time discrete-state graph diffusion model Antoine Siraudin et.al. 2406.06449 null
2024-06-10 QSSEP describes the fluctuations of quantum coherences in the Anderson model Ludwig Hruza et.al. 2406.06444 null
2024-06-10 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Jiwoo Hong et.al. 2406.06424 null
2024-06-07 DVOS: Self-Supervised Dense-Pattern Video Object Segmentation Keyhan Najafian et.al. 2406.05131 null
2024-06-07 Ohms law lost and regained: observation and impact of zeros and poles Krishna Joshi et.al. 2406.05112 null
2024-06-07 Large Generative Graph Models Yu Wang et.al. 2406.05109 null
2024-06-07 CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion Xingrui Wang et.al. 2406.05082 null
2024-06-07 GenHeld: Generating and Editing Handheld Objects Chaerin Min et.al. 2406.05059 link
2024-06-07 Digital Twins of the EM Environment: Benchmark for Ray Launching Models Michele Zhu et.al. 2406.05042 link
2024-06-07 Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs Shentong Mo et.al. 2406.05038 null
2024-06-07 Linear stability analysis for a system of singular amplitude equations arising in biomorphology Aric Wheeler et.al. 2406.05037 null
2024-06-07 Generative diffusion models for synthetic trajectories of heavy and light particles in turbulence Tianyi Li et.al. 2406.05008 null
2024-06-07 CityCraft: A Real Crafter for 3D City Generation Jie Deng et.al. 2406.04983 null
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340 link
2024-06-07 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Fangfu Liu et.al. 2406.04338 null
2024-06-06 Coherent Zero-Shot Visual Instruction Generation Quynh Phung et.al. 2406.04337 null
2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Yang Sui et.al. 2406.04333 link
2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data Jiaxin Shi et.al. 2406.04329 null
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324 null
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323 null
2024-06-07 DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data Qihao Liu et.al. 2406.04322 link
2024-06-06 Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Zhanhao Liang et.al. 2406.04314 null
2024-06-06 ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization Luca Eyring et.al. 2406.04312 link
2024-06-05 Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input Joachim Ott et.al. 2406.03439 null
2024-06-05 Non-stationary Spatio-Temporal Modeling Using the Stochastic Advection-Diffusion Equation Martin Outzen Berild et.al. 2406.03400 link
2024-06-05 Reparameterization invariance in approximate Bayesian inference Hrittik Roy et.al. 2406.03334 null
2024-06-05 UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning Yu Zhang et.al. 2406.03324 null
2024-06-05 Text-to-Image Rectified Flow as Plug-and-Play Priors Xiaofeng Yang et.al. 2406.03293 link
2024-06-05 Relative Entropy for the Numerical Diffusive Limit of the Linear Jin-Xin System Marianne Bessemoulin-Chatard et.al. 2406.03268 null
2024-06-05 Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN Mikołaj Kita et.al. 2406.03233 null
2024-06-05 Holographic drag force with translational symmetry breaking Sara Tahery et.al. 2406.03220 null
2024-06-05 Searching Priors Makes Text-to-Video Synthesis Better Haoran Cheng et.al. 2406.03215 null
2024-06-05 Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Hao Wen et.al. 2406.03184 link
2024-06-04 Dreamguider: Improved Training free Diffusion-based Conditional Generation Nithin Gopalakrishnan Nair et.al. 2406.02549 null
2024-06-05 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Tianchen Zhao et.al. 2406.02540 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-06-04 Guiding a Diffusion Model with a Bad Version of Itself Tero Karras et.al. 2406.02507 null
2024-06-04 Tensor Network Space-Time Spectral Collocation Method for Solving the Nonlinear Convection Diffusion Equation Dibyendu Adak et.al. 2406.02505 null
2024-06-04 Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions Peiyao Lai et.al. 2406.02502 null
2024-06-04 Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation Jiajun Wang et.al. 2406.02485 link
2024-06-04 Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion Colin Hansen et.al. 2406.02477 null
2024-06-04 Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems Jason Hu et.al. 2406.02462 null
2024-05-31 Mixed Diffusion for 3D Indoor Scene Synthesis Siyi Hu et.al. 2405.21066 link
2024-05-31 Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models Jingjing Wang et.al. 2405.21059 null
2024-05-31 Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models Xinxi Zhang et.al. 2405.21050 null
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048 null
2024-05-31 Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging Niloufar Zakariaei et.al. 2405.21021 null
2024-05-31 Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman et.al. 2405.20971 link
2024-06-03 Large Language Models are Zero-Shot Next Location Predictors Ciro Beneduce et.al. 2405.20962 link
2024-05-31 Search of extended emission from HESS J1702-420 with eROSITA Denys Malyshev et.al. 2405.20927 null
2024-05-31 Flow matching achieves minimax optimal convergence Kenji Fukumizu et.al. 2405.20879 null
2024-05-31 MegActor: Harness the Power of Raw Video for Vivid Portrait Animation Shurong Yang et.al. 2405.20851 link
2024-05-30 Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image Kailu Wu et.al. 2405.20343 link
2024-05-30 OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving Lening Wang et.al. 2405.20337 link
2024-05-30 VividDream: Generating 3D Scene with Ambient Dynamics Yao-Chih Lee et.al. 2405.20334 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325 link
2024-05-30 Don't drop your samples! Coherence-aware training benefits Conditional diffusion Nicolas Dufour et.al. 2405.20324 null
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320 link
2024-05-30 DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2405.20289 null
2024-05-30 SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow Chaoyang Wang et.al. 2405.20282 link
2024-05-30 CV-VAE: A Compatible Video VAE for Latent Generative Video Models Sijie Zhao et.al. 2405.20279 link
2024-05-31 KerasCV and KerasNLP: Vision and Language Power-Ups Matthew Watson et.al. 2405.20247 null
2024-05-29 X-VILA: Cross-Modality Alignment for Large Language Model Hanrong Ye et.al. 2405.19335 null
2024-05-29 Hilbert Space Diffusion in Systems with Approximate Symmetries Rahel L. Baumgartner et.al. 2405.19260 null
2024-05-29 Weak Generative Sampler to Efficiently Sample Invariant Distribution of Stochastic Differential Equation Zhiqiang Cai et.al. 2405.19256 null
2024-05-29 ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning Ruchika Chavhan et.al. 2405.19237 link
2024-05-29 Pseudo-Gevrey Smoothing for the Passive Scalar Equations near Couette Jacob Bedrossian et.al. 2405.19233 null
2024-05-29 DiPPeST: Diffusion-based Path Planner for Synthesizing Trajectories Applied on Quadruped Robots Maria Stamatopoulou et.al. 2405.19232 null
2024-05-29 Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification Michail Mamalakis et.al. 2405.19204 null
2024-05-30 $E^{3}$ Gen: Efficient, Expressive and Editable Avatars Generation Weitian Zhang et.al. 2405.19203 null
2024-05-29 Going beyond compositional generalization, DDPMs can produce zero-shot interpolation Justin Deschenaux et.al. 2405.19201 link
2024-05-29 Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning Hanye Zhao et.al. 2405.19189 link
2024-05-28 On the Origin of Llamas: Model Tree Heritage Recovery Eliahu Horwitz et.al. 2405.18432 link
2024-05-28 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention Lianghui Zhu et.al. 2405.18428 link
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407 null
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406 link
2024-05-28 Short-time Fokker-Planck propagator beyond the Gaussian approximation Julian Kappler et.al. 2405.18381 null
2024-05-28 A Hessian-Aware Stochastic Differential Equation for Modelling SGD Xiang Li et.al. 2405.18373 null
2024-05-28 Simulating infinite-dimensional nonlinear diffusion bridges Gefan Yang et.al. 2405.18353 link
2024-05-28 VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers Jun Zheng et.al. 2405.18326 null
2024-05-28 Multi-modal Generation via Cross-Modal In-Context Learning Amandeep Kumar et.al. 2405.18304 link
2024-05-28 CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths Reihaneh Teimouri et.al. 2405.18267 null
2024-05-27 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control Zhengfei Kuang et.al. 2405.17414 null
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405 null
2024-05-27 A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training Kai Wang et.al. 2405.17403 link
2024-05-27 RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control Litu Rout et.al. 2405.17401 null
2024-05-27 EASI-Tex: Edge-Aware Mesh Texturing from Single Image Sai Raj Kishore Perla et.al. 2405.17393 null
2024-05-27 Global existence, fast signal diffusion limit, and $L^\infty$ -in-time convergence rates in a competitive chemotaxis system Cordula Reisch et.al. 2405.17392 null
2024-05-27 Supernova Remnants in Gamma Rays Andrea Giuliani et.al. 2405.17384 null
2024-05-27 Muon spin relaxation in mixed perovskite (LaAlO $3$)${x}$(SrAl${0.5}$Ta${0.5}$O$3$)${1-x}$ with $x\simeq 0.3$ Takashi U. Ito et.al. 2405.17371 null
2024-05-27 Finite Fractal Dimension of uniform attractors for non-autonomous dynamical systems with infinite dimensional symbol space Rafael de Oliveira Moura et.al. 2405.17367 null
2024-05-27 Emergent time crystal from a fractional Langevin equation with white and colored noise David Santiago Quevedo et.al. 2405.17331 null
2024-05-24 Self-consistent evaluation of proximity and inverse proximity effects with pair-breaking in diffusive SN junctions Arpit Raj et.al. 2405.15770 null
2024-05-24 FastDrag: Manipulate Anything in One Step Xuanjia Zhao et.al. 2405.15769 null
2024-05-24 InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation Yuchi Wang et.al. 2405.15758 link
2024-05-24 Looking Backward: Streaming Video-to-Video Translation with Feature Banks Feng Liang et.al. 2405.15757 link
2024-05-24 Score-based generative models are provably robust: an uncertainty quantification perspective Nikiforos Mimikos-Stamatopoulos et.al. 2405.15754 null
2024-05-24 Murray-von Neumann dimension for strictly semifinite weights Aldo Garcia Guinto et.al. 2405.15725 null
2024-05-24 Hierarchical Uncertainty Exploration via Feedforward Posterior Trees Elias Nehme et.al. 2405.15719 null
2024-05-24 Simulation-based inference of radio millisecond pulsars in globular clusters Joanna Berteaud et.al. 2405.15691 null
2024-05-24 Jet Quenching of the Heavy Quarks in the Quark-Gluon Plasma and the Nonadditive Statistics Trambak Bhattacharyya et.al. 2405.15679 null
2024-05-24 Taming Score-Based Diffusion Priors for Infinite-Dimensional Nonlinear Inverse Problems Lorenzo Baldassari et.al. 2405.15676 null
2024-05-23 Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis Basile Van Hoorick et.al. 2405.14868 null
2024-05-23 Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin et.al. 2405.14867 link
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864 null
2024-05-23 Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models Gen Li et.al. 2405.14861 null
2024-05-23 Semantica: An Adaptable Image-Conditioned Diffusion Model Manoj Kumar et.al. 2405.14857 null
2024-05-23 TerDiT: Ternary Diffusion Models with Transformers Xudong Lu et.al. 2405.14854 link
2024-05-23 Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer Shuang Wu et.al. 2405.14832 null
2024-05-23 Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models Katherine Xu et.al. 2405.14828 null
2024-05-23 New limits on neutrino decay from high-energy astrophysical neutrinos Victor B. Valera et.al. 2405.14826 null
2024-05-23 PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher Dongjun Kim et.al. 2405.14822 null
2024-05-21 Personalized Residuals for Concept-Driven Text-to-Image Generation Cusuh Ham et.al. 2405.12978 null
2024-05-21 Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Yue Han et.al. 2405.12970 null
2024-05-21 Differential Walk on Spheres Bailey Miller et.al. 2405.12964 null
2024-05-21 Learning the Infinitesimal Generator of Stochastic Diffusion Processes Vladimir R. Kostic et.al. 2405.12940 null
2024-05-21 Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra Álvaro Tovar-Pardo et.al. 2405.12918 null
2024-05-21 An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation Zhiyu Tan et.al. 2405.12914 link
2024-05-21 Deep HST/UVIS imaging of the candidate dark galaxy CDG-1 Pieter van Dokkum et.al. 2405.12907 null
2024-05-21 Diffusion of brightened dark excitons in a high-angle incommensurate Moiré homobilayer Arnab Barman Ray et.al. 2405.12901 null
2024-05-21 Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images Xiaofei Yu et.al. 2405.12875 link
2024-05-21 High-Field Microscale NMR Spectroscopy with NV Centers in Dipolarly-Coupled Samples Carlos Munuera-Javaloy et.al. 2405.12857 null
2024-05-20 Images that Sound: Composing Images and Sounds on a Single Canvas Ziyang Chen et.al. 2405.12221 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211 link
2024-05-20 Cosmic Ray Diffusion in the Turbulent Interstellar Medium: Effects of Mirror Diffusion and Pitch Angle Scattering Lucas Barreto-Mota et.al. 2405.12146 null
2024-05-20 Two-dimensional signal-dependent parabolic-elliptic Keller-Segel system and its means field derivation Lukas Bol et.al. 2405.12134 null
2024-05-20 An effective advection induced by oscillating microstructure in a diffusion equation David Wiedemann et.al. 2405.12108 null
2024-05-20 Sobolev regularity theory for stochastic reaction-diffusion-advection equations with spatially homogeneous colored noises and variable-order nonlocal operators Jae-Hwan Choi et.al. 2405.11969 null
2024-05-20 Optimal balanced-norm error estimate of the LDG method for reaction-diffusion problems II: the two-dimensional case with layer-upwind flux Yao Cheng et.al. 2405.11939 null
2024-05-20 Nonequilbrium physics of generative diffusion models Zhendong Yu et.al. 2405.11932 null
2024-05-20 "Set It Up!": Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2405.11928 null
2024-05-20 Diff-BGM: A Diffusion Model for Video Background Music Generation Sizhe Li et.al. 2405.11913 link
2024-05-17 Probabilistic transfer learning methodology to expedite high fidelity simulation of reactive flows Bruno S. Soriano et.al. 2405.10944 null
2024-05-17 Reconstruction of Manipulated Garment with Guided Deformation Prior Ren Li et.al. 2405.10934 null
2024-05-17 Limitations of the rate-distribution formalism in describing luminescence quenching in the presence of diffusion Jakub Jędrak et.al. 2405.10903 null
2024-05-17 Improving face generation quality and prompt following with synthetic captions Michail Tarasiou et.al. 2405.10864 null
2024-05-17 Diffusion Geometry Iolo Jones et.al. 2405.10858 null
2024-05-17 Some remarks on a mathematical model for water flow in porous media with competition between transport and diffusion Judita Runcziková et.al. 2405.10751 null
2024-05-17 Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems Hanyu Chen et.al. 2405.10748 link
2024-05-17 Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning Antoine Legouhy et.al. 2405.10723 null
2024-05-17 Numerical Recovery of the Diffusion Coefficient in Diffusion Equations from Terminal Measurement Bangti Jin et.al. 2405.10708 null
2024-05-17 Ratchet-mediated resetting: Current, efficiency, and exact solution Connor Roberts et.al. 2405.10698 null
2024-05-16 Text-to-Vector Generation with Neural Path Representation Peiying Zhang et.al. 2405.10317 null
2024-05-16 Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model Zheng Gu et.al. 2405.10316 null
2024-05-16 CAT3D: Create Anything in 3D with Multi-View Diffusion Models Ruiqi Gao et.al. 2405.10314 null
2024-05-16 Societal Adaptation to Advanced AI Jamie Bernardi et.al. 2405.10295 null
2024-05-16 Power-law relaxation of a confined diffusing particle subject to resetting with memory Denis Boyer et.al. 2405.10283 null
2024-05-16 Interplay between Domain Walls in Type-II Superconductors and Gradients of Temperature/Spin Density Takuma Kanakubo et.al. 2405.10200 null
2024-05-16 Fixed points of maps and nontrivial weak solutions to a class of nonlinear strongly coupled elliptic systems Dung Le et.al. 2405.10171 null
2024-05-16 Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks João Bordalo et.al. 2405.10122 null
2024-05-16 Advancing Set-Conditional Set Generation: Graph Diffusion for Fast Simulation of Reconstructed Particles Dmitrii Kobylianskii et.al. 2405.10106 null
2024-05-16 Spurious reconstruction from brain activity Ken Shirakawa et.al. 2405.10078 link
2024-05-15 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer Chengyu Wu et.al. 2405.09539 link
2024-05-15 A velocity-based moving mesh Discontinuous Galerkin method for the advection-diffusion equation Ezra Rozier et.al. 2405.09408 null
2024-05-15 Probing particle acceleration in Abell 2256: from to 16 MHz to gamma rays E. Osinga et.al. 2405.09384 null
2024-05-15 Diffusion-based Contrastive Learning for Sequential Recommendation Ziqiang Cui et.al. 2405.09369 null
2024-05-15 DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations Nima Fathi et.al. 2405.09288 link
2024-05-15 Searches for Galactic Neutrinos with the IceCube Neutrino observatory A. Sandrock et.al. 2405.09267 null
2024-05-15 Dance Any Beat: Blending Beats with Visuals in Dance Video Generation Xuanchen Wang et.al. 2405.09266 null
2024-05-15 Exact analysis of the two-dimensional asymmetric simple exclusion process with attachment and detachment of particles Yuki Ishiguro et.al. 2405.09261 null
2024-05-15 Propagation of chaos for moderately interacting particle systems related to singular kinetic Mckean-Vlasov SDEs Zimo Hao et.al. 2405.09195 null
2024-05-15 QMedShield: A Novel Quantum Chaos-based Image Encryption Scheme for Secure Medical Image Storage in the Cloud Arun Amaithi Rajan et.al. 2405.09191 null
2024-05-14 The Flux Hypothesis for Odd Transport Phenomena Cory Hargus et.al. 2405.08798 null
2024-05-14 A Generalized Curvilinear Coordinate system-based Patch Dynamics Scheme in Equation-free Multiscale Modelling Tanay Kumar Karmakar et.al. 2405.08764 null
2024-05-14 Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Zhimin Li et.al. 2405.08748 link
2024-05-14 Dimensionality reduction in bulk-boundary reaction-diffusion systems Tom Burkart et.al. 2405.08728 null
2024-05-14 Design and Analysis of Resilient Vehicular Platoon Systems over Wireless Networks Tingyu Shui et.al. 2405.08706 null
2024-05-14 Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models Bingdong Li et.al. 2405.08674 null
2024-05-14 Quantum Circuit Model for Lattice Boltzmann Fluid Flow Simulations Dinesh Kumar E et.al. 2405.08669 null
2024-05-14 Anomalous Landau damping and algebraic thermalization in two-dimensional superfluids far from equilibrium Clément Duval et.al. 2405.08606 null
2024-05-14 PTPI-DL-ROMs: pre-trained physics-informed deep learning-based reduced order models for nonlinear parametrized PDEs Simone Brivio et.al. 2405.08558 null
2024-05-14 $γ$ rays from in-flight positron annihilation as a probe of new physics Pedro De la Torre Luque et.al. 2405.08482 null
2024-05-13 Cloaking for random walks using a discrete potential theory Trent DeGiovanni et.al. 2405.07961 link
2024-05-13 Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data Mahdi Morafah et.al. 2405.07925 null
2024-05-13 CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models Nick Stracke et.al. 2405.07913 null
2024-05-13 Latest results from Super-Kamiokande Andrew D. Santos et.al. 2405.07900 null
2024-05-13 Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging Chi-en Amy Tai et.al. 2405.07861 null
2024-05-13 Radiogenomic biomarkers for immunotherapy in glioblastoma: A systematic review of magnetic resonance imaging studies Prajwal Ghimire et.al. 2405.07858 null
2024-05-13 Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction Chi-en Amy Tai et.al. 2405.07854 null
2024-05-13 SAR Image Synthesis with Diffusion Models Denisa Qosja et.al. 2405.07776 null
2024-05-13 LGDE: Local Graph-based Dictionary Expansion Dominik J. Schindler et.al. 2405.07764 link
2024-05-13 FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation Jianyi Chen et.al. 2405.07682 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547 link
2024-05-10 Controllable Image Generation With Composed Parallel Token Prediction Jamie Stirling et.al. 2405.06535 null
2024-05-10 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461 null
2024-05-10 A universal phenomenology of charge-spin interconversion and dynamics in diffusive systems with spin-orbit coupling Tim Kokkeler et.al. 2405.06334 null
2024-05-10 PUMA: margin-based data pruning Javier Maroto et.al. 2405.06298 null
2024-05-10 Green's Function and Pointwise Space-time Behaviors of the Three-Dimensional Relativistic Boltzmann Equation Yanchao Li et.al. 2405.06280 null
2024-05-10 Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging Zhuchen Shao et.al. 2405.06175 null
2024-05-10 Integrability-preserving regularizations of Laplacian Growth Razvan Teodorescu et.al. 2405.06167 null
2024-05-10 Dispersivity calculation in digital twins of multiscale porous materials using the micro-continuum approach Julien Maes et.al. 2405.06155 null
2024-05-09 Modelling the random spreading of fake news through a two-dimensional time-inhomogeneous birth-death process Antonio Di Crescenzo et.al. 2405.06123 null
2024-05-09 Distilling Diffusion Models into Conditional GANs Minguk Kang et.al. 2405.05967 null
2024-05-09 Towards comprehensive coverage of chemical space: Quantum mechanical properties of 836k constitutional and conformational closed shell neutral isomers consisting of HCNOFSiPSClBr Danish Khan et.al. 2405.05961 null
2024-05-09 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask Zineb Senane et.al. 2405.05959 link
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953 null
2024-05-09 Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers Peng Gao et.al. 2405.05945 link
2024-05-09 Composable Part-Based Manipulation Weiyu Liu et.al. 2405.05876 null
2024-05-09 Parameter identification for an uncertain reaction-diffusion equation via setpoint regulation Gildas Besançon et.al. 2405.05866 null
2024-05-09 Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control Gunshi Gupta et.al. 2405.05852 link
2024-05-09 Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models Zhe Ma et.al. 2405.05846 null
2024-05-09 MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction Pinhuang Tan et.al. 2405.05814 null
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255 link
2024-05-08 Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models Hongjie Wang et.al. 2405.05252 null
2024-05-08 Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation Jonas Kohler et.al. 2405.05224 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216 link
2024-05-08 An adaptive finite element multigrid solver using GPU acceleration Manuel Liebchen et.al. 2405.05047 null
2024-05-08 Reviewing Intelligent Cinematography: AI research for camera-based video production Adrian Azzarelli et.al. 2405.05039 null
2024-05-08 Monitoring of neoadjuvant chemotherapy through time domain diffuse optics: Breast tissue composition changes and collagen discriminative potential Nikhitha Mule et.al. 2405.05035 null
2024-05-08 An anti-noise seismic inversion method based on diffusion model Yingtian Liu et.al. 2405.05026 null
2024-05-08 Stochastic spatial Lotka-Volterra predator-prey models Uwe C. Täuber et.al. 2405.05006 null
2024-05-08 A unified theory of the self-similar supersonic Marshak wave problem Menahem Krief et.al. 2405.04981 null
2024-05-07 Tactile-Augmented Radiance Fields Yiming Dou et.al. 2405.04534 link
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496 null
2024-05-07 CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model Haixia Xiao et.al. 2405.04483 null
2024-05-07 Derivation of kinetic and diffusion equations from a hard-sphere Rayleigh gas using collision trees and semigroups Karsten Matthies et.al. 2405.04449 null
2024-05-07 Brownian Motion on The Spider Like Quantum Graphs Madhumita Paul et.al. 2405.04439 null
2024-05-07 Learning local Dirichlet-to-Neumann maps of nonlinear elliptic PDEs with rough coefficients Miranda Boutilier et.al. 2405.04433 null
2024-05-07 Josephson threshold detector in the phase diffusion regime Dmitry A. Ladeynov et.al. 2405.04426 null
2024-05-07 Mathematical Modeling of $^{18}$F-Fluoromisonidazole ($^{18}$ F-FMISO) Radiopharmaceutical Transport in Vascularized Solid Tumors Mohammad Amin Abazari et.al. 2405.04418 null
2024-05-07 Community Detection for Heterogeneous Multiple Social Networks Ziqing Zhu et.al. 2405.04371 null
2024-05-07 Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos Junyi Ma et.al. 2405.04370 link
2024-05-06 An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas Mira Slavcheva et.al. 2405.03682 null
2024-05-06 Field-of-View Extension for Diffusion MRI via Deep Generative Models Chenyu Gao et.al. 2405.03652 null
2024-05-06 Cosine Annealing Optimized Denoising Diffusion Error Correction Codes Congyang Ou et.al. 2405.03638 null
2024-05-06 Strang Splitting for Parametric Inference in Second-order Stochastic Differential Equations Predrag Pilipovic et.al. 2405.03606 null
2024-05-06 Dissipative gradient nonlinearities prevent $δ$ -formations in local and nonlocal attraction-repulsion chemotaxis models Tongxing Li et.al. 2405.03586 null
2024-05-06 Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models Ludwig Winkler et.al. 2405.03549 null
2024-05-06 CCDM: Continuous Conditional Diffusion Models for Image Generation Xin Ding et.al. 2405.03546 link
2024-05-06 Asymptotic-preserving hybridizable discontinuous Galerkin method for the Westervelt quasilinear wave equation Sergio Gómez et.al. 2405.03535 null
2024-05-06 Quasi-Monte Carlo for Bayesian design of experiment problems governed by parametric PDEs Vesa Kaarnioja et.al. 2405.03529 null
2024-05-06 On anomalous dissipation induced by transport noise Antonio Agresti et.al. 2405.03525 null
2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu et.al. 2405.02280 link
2024-05-03 Relic gravitons and non-stationary processes Massimo Giovannini et.al. 2405.02193 null
2024-05-03 Tangentially Active Polymers in Cylindrical Channels José Martín-Roca et.al. 2405.02192 null
2024-05-03 Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving Haicheng Liao et.al. 2405.02145 null
2024-05-03 Global regularity and infinite Prandtl number limit of temperature patches for the 2D Boussinesq system Omar Lazar et.al. 2405.02137 null
2024-05-03 Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling Radek Erban et.al. 2405.02117 null
2024-05-03 On variable annuities with surrender charges Tiziano De Angelis et.al. 2405.02115 null
2024-05-03 Anomalous transport in the quantum East-West kinetically constrained model Pietro Brighi et.al. 2405.02102 null
2024-05-03 Radiative and mechanical energies in galaxies I. Contributions of molecular shocks and PDRs in 3C 326 N J. A. Villa-Vélez et.al. 2405.02058 null
2024-05-03 The CO-dark molecular gas in the cold HI arc Gan Luo et.al. 2405.02055 null
2024-05-02 Customizing Text-to-Image Models with a Single Image Pair Maxwell Jones et.al. 2405.01536 null
2024-05-02 The heat equation with time-correlated random potential in d=2: Edwards-Wilkinson fluctuations Sotirios Kotitsas et.al. 2405.01519 null
2024-05-02 Effective Lifshitz black holes, hydrodynamics, and transport coefficients in fluid/gravity correspondence D. C. Moreira et.al. 2405.01505 null
2024-05-02 LocInv: Localization-aware Inversion for Text-Guided Image Editing Chuanming Tang et.al. 2405.01496 link
2024-05-02 Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models Matias Mendieta et.al. 2405.01494 null
2024-05-02 StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Yupeng Zhou et.al. 2405.01434 link
2024-05-02 In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies Yunbum Kook et.al. 2405.01425 null
2024-05-02 Statistical algorithms for low-frequency diffusion data: A PDE approach Matteo Giordano et.al. 2405.01372 link
2024-05-02 On Nanowire Morphological Instability and Pinch-Off by Surface Electromigration Mikhail Khenner et.al. 2405.01331 null
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248 null
2024-05-01 TexSliders: Diffusion-Based Texture Editing in CLIP Space Julia Guerrero-Viu et.al. 2405.00672 null
2024-05-01 RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models Zheng Zeng et.al. 2405.00666 null
2024-05-01 Large deviations of current for the symmetric simple exclusion process on a semi-infinite line and on an infinite line with slow bonds Kapil Sharma et.al. 2405.00654 null
2024-05-01 Stochastic fluids with transport noise: Approximating diffusion from data using SVD and ensemble forecast back-propagation James Woodfield et.al. 2405.00640 null
2024-05-01 Vacancy-mediated transport and segregation tendencies of solutes in FCC nickel under diffusional creep: A density functional theory study Shehab Shousha et.al. 2405.00639 null
2024-05-01 Engine-fed Kilonovae (Mergernovae) -- II. Radiation Shunke Ai et.al. 2405.00638 null
2024-05-01 Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure Assefa Seyoum Wahd et.al. 2405.00631 null
2024-05-01 Hysteresis and Self-Oscillations in an Artificial Memristive Quantum Neuron Finlay Potter et.al. 2405.00624 null
2024-05-01 Lane Segmentation Refinement with Diffusion Models Antonio Ruiz et.al. 2405.00620 null
2024-05-01 Anomalous diffusion and factor ordering in (1+1)-dimensional Lorentzian quantum gravity Elijah Sanderson et.al. 2405.00594 null
2024-04-30 MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model Wenxun Dai et.al. 2404.19759 link
2024-04-30 Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting Paul Engstler et.al. 2404.19758 null
2024-04-30 Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation Ian Dunn et.al. 2404.19739 link
2024-04-30 Investigating the correlations between IceCube high-energy neutrinos and Fermi-LAT $γ$ -ray observations. II Ming-Xuan Lu et.al. 2404.19730 null
2024-04-30 X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models Emmanuelle Bourigault et.al. 2404.19604 null
2024-04-30 Cool-core, X-ray cavities and cold front revealed in RXCJ0352.9+1941 cluster by Chandra and GMRT observations Satish S. Sonkamble et.al. 2404.19549 null
2024-04-30 Shocks in the Warm Neutral Medium I -- Theoretical model Benjamin Godard et.al. 2404.19533 null
2024-04-30 MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction Luxi Chen et.al. 2404.19525 link
2024-04-30 Well-posedness of McKean-Vlasov SDEs with density-dependent drift Anh-Dung Le et.al. 2404.19499 null
2024-04-30 TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models Teng Zhou et.al. 2404.19475 null
2024-04-29 Stylus: Automatic Adapter Selection for Diffusion Models Michael Luo et.al. 2404.18928 null
2024-04-29 TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation Junhao Cheng et.al. 2404.18919 link
2024-04-29 Learning general Gaussian mixtures with efficient score matching Sitan Chen et.al. 2404.18893 null
2024-04-29 A Survey on Diffusion Models for Time Series and Spatio-Temporal Data Yiyuan Yang et.al. 2404.18886 link
2024-04-29 Learning Mixtures of Gaussians Using Diffusion Models Khashayar Gatmiry et.al. 2404.18869 null
2024-04-29 Construction of local reduced spaces for Friedrichs' systems via randomized training Christian Engwer et.al. 2404.18839 null
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820 null
2024-04-29 Spectral measures and iterative bounds for effective diffusivity of steady and space-time periodic flows N. B. Murphy et.al. 2404.18754 null
2024-04-29 Diffuse scattering from dynamically compressed single-crystal zirconium following the pressure-induced $α\toω$ phase transition P. G. Heighway et.al. 2404.18740 null
2024-04-29 Diffusion coefficient matrix for multiple conserved charges: a Kubo approach Sourav Dey et.al. 2404.18718 null
2024-04-26 Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos Zhengze Xu et.al. 2404.17571 null
2024-04-26 MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Shangzhan Zhang et.al. 2404.17569 null
2024-04-26 [OI] fine structure line profiles in Mon R2 and M17 SW: the puzzling nature of cold foreground material identified by [12CII] self-absorption C. Guevara et.al. 2404.17538 null
2024-04-26 Reduction of the effective population size in a branching particle system in the moderate mutation-selection regime Florin Boenkost et.al. 2404.17527 null
2024-04-26 Chemotaxis-inspired PDE model for airborne infectious disease transmission: analysis and simulations Pierluigi Colli et.al. 2404.17506 null
2024-04-26 TextGaze: Gaze-Controllable Face Generation with Natural Language Hengfei Wang et.al. 2404.17486 null
2024-04-26 Consistent Second Moment Methods with Scalable Linear Solvers for Radiation Transport Samuel Olivier et.al. 2404.17473 null
2024-04-26 Quasi particle model vs lattice QCD thermodynamics: extension to $N_f=2+1+1$ flavors and momentum dependent quark masses Maria Lucia Sambataro et.al. 2404.17459 null
2024-04-26 Vaporization dynamics of a super-heated water-in-oil droplet: modeling and numerical solution Muhammad Saeed Saleem et.al. 2404.17457 null
2024-04-26 Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation Seungwook Kim et.al. 2404.17419 null
2024-04-25 The Third Monocular Depth Estimation Challenge Jaime Spencer et.al. 2404.16831 null
2024-04-25 Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials Ye Fang et.al. 2404.16829 null
2024-04-25 ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving Jiehui Huang et.al. 2404.16771 link
2024-04-25 Analysis of Ethanol Blending Effects on Auto-Ignition and Heat Release in n-Heptane/Ethanol Non-Premixed Flames Liang Ji et.al. 2404.16762 null
2024-04-25 Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior Han Wang et.al. 2404.16678 null
2024-04-25 The First Estimation of the Ambipolar Diffusivity Coefficient from Multi-Scale Observations of the Class 0/I Protostar, HOPS-370 Travis J. Thieme et.al. 2404.16668 null
2024-04-25 Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method A. Emir Gumrukcuoglu et.al. 2404.16658 null
2024-04-25 Denoising: from classical methods to deep CNNs Jean-Eric Campagne et.al. 2404.16617 link
2024-04-25 Stochastic Dissipative Euler's equations for a free body J. A. de la Torre et.al. 2404.16613 null
2024-04-25 MuseumMaker: Continual Style Customization without Catastrophic Forgetting Chenxi Liu et.al. 2404.16612 null
2024-04-24 Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models Xu Shen et.al. 2404.15625 null
2024-04-24 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620 link
2024-04-23 Measuring topological constraint relaxation in ring-linear polymer blends Daniel L. Vigil et.al. 2404.15560 null
2024-04-23 Thermal boundary conductance of sharp metal-diamond interfaces predicted by machine learning molecular dynamics Khalid Zobaid Adnan et.al. 2404.15465 null
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449 null
2024-04-23 GLoD: Composing Global Contexts and Local Details in Image Generation Moyuru Yamada et.al. 2404.15447 null
2024-04-23 Thermal boundary conductance and thermal conductivity strongly depend on nearby environment Khalid Zobaid Adnan et.al. 2404.15439 null
2024-04-23 ID-Animator: Zero-Shot Identity-Preserving Human Video Generation Xuanhua He et.al. 2404.15275 link
2024-04-23 From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation Zehuan Huang et.al. 2404.15267 null
2024-04-23 Score matching for sub-Riemannian bridge sampling Erlend Grong et.al. 2404.15258 null
2024-04-23 Nucleation mechanism of multiple-order parameter ferroelectric domain wall motion in hafnia Songsong Zhou et.al. 2404.15251 null
2024-04-23 Local well-posedness for a novel nonlocal model for cell-cell adhesion via receptor binding Mabel Lizzy Rajendran et.al. 2404.15222 null
2024-04-23 Heat flow, log-concavity, and Lipschitz transport maps Giovanni Brigati et.al. 2404.15205 null
2024-04-23 Signature of Particle Diffusion on the X-ray Spectra of the blazar Mkn 421 C. Baheeja et.al. 2404.15171 null
2024-04-23 A general multi-wave quasi-resonance theory for lattice energy diffusion Wei Lin et.al. 2404.15147 null
2024-04-23 CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method Mingbao Lin et.al. 2404.15141 link
2024-04-23 Taming Diffusion Probabilistic Models for Character Control Rui Chen et.al. 2404.15121 null
2024-04-22 Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses Inhee Lee et.al. 2404.14410 null
2024-04-22 GeoDiffuser: Geometry-Based Image Editing with Diffusion Models Rahul Sajnani et.al. 2404.14403 null
2024-04-22 Observational characterisation of large-scale transport and horizontal turbulent diffusivity in the quiet Sun F. Rincon et.al. 2404.14383 null
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381 link
2024-04-22 Temporal Entanglement Profiles in Dual-Unitary Clifford Circuits with Measurements Jiangtian Yao et.al. 2404.14374 null
2024-04-22 Operando Analysis of Adsorption-Limited Hydrogen Oxidation Reaction at Palladium Surfaces Yukun Liu et.al. 2404.14348 null
2024-04-22 Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion Alexander Shmakov et.al. 2404.14332 null
2024-04-22 X-Ray: A Sequential 3D Representation for Generation Tao Hu et.al. 2404.14329 link
2024-04-22 Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training Yiming Liu et.al. 2404.14309 null
2024-04-22 Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity Yu Hou et.al. 2404.14240 link
2024-04-19 Analysis of Classifier-Free Guidance Weight Schedulers Xi Wang et.al. 2404.13040 null
2024-04-19 A multigrain-multilayer astrochemical model with variable desorption energy for surface species Juris Kalvans et.al. 2404.13011 null
2024-04-19 RadRotator: 3D Rotation of Radiographs with Diffusion Models Pouria Rouzrokh et.al. 2404.13000 null
2024-04-19 Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics Xiaofei Wang et.al. 2404.12973 null
2024-04-19 On the McKean-Vlasov SDE with branching Julien Claisse et.al. 2404.12964 null
2024-04-19 Robust hybrid finite element methods for reaction-dominated diffusion problems Thomas Führer et.al. 2404.12956 null
2024-04-19 Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling Grigory Bartosh et.al. 2404.12940 null
2024-04-19 Diffusive contact between randomly driven colloidal suspensions Galor Geva et.al. 2404.12929 null
2024-04-19 Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models Konstantinos Vilouras et.al. 2404.12920 null
2024-04-19 Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images Santosh et.al. 2404.12908 link
2024-04-18 G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis Yufei Ye et.al. 2404.12383 null
2024-04-18 Lazy Diffusion Transformer for Interactive Image Editing Yotam Nitzan et.al. 2404.12382 null
2024-04-18 Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models Trevor J. Chan et.al. 2404.12361 null
2024-04-18 AniClipart: Clipart Animation with Text-to-Video Priors Ronghuan Wu et.al. 2404.12347 null
2024-04-18 Customizing Text-to-Image Diffusion with Camera Viewpoint Control Nupur Kumari et.al. 2404.12333 null
2024-04-18 Guided Discrete Diffusion for Electronic Health Record Generation Zixiang Chen et.al. 2404.12314 null
2024-04-18 Investigation of Spin-Pumping and -Transport in the Ni80Fe20/Pt/Co Asymmetric Trilayer Shilpa Samdani et.al. 2404.12307 null
2024-04-18 RISE: 3D Perception Makes Real-World Robot Imitation Simple and Effective Chenxi Wang et.al. 2404.12281 null
2024-04-18 A New Computational Method for Energetic Particle Acceleration and Transport with its Feedback Jeongbhin Seo et.al. 2404.12276 null
2024-04-18 Tree-Based Nonlinear Reduced Modeling Diane Guignard et.al. 2404.12262 null
2024-04-17 Factorized Diffusion: Perceptual Illusions by Noise Decomposition Daniel Geng et.al. 2404.11615 null
2024-04-17 InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior Zhiheng Liu et.al. 2404.11613 null
2024-04-17 IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination Xi Chen et.al. 2404.11593 null
2024-04-17 Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Zezhong Fan et.al. 2404.11589 null
2024-04-17 Emulators for scarce and noisy data: application to auxiliary field diffusion Monte Carlo for the deuteron Rahul Somasundaram et.al. 2404.11566 null
2024-04-17 MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation Kuan-Chieh et.al. 2404.11565 null
2024-04-17 Predicting Long-horizon Futures by Conditioning on Geometry and Time Tarasha Khurana et.al. 2404.11554 null
2024-04-17 A Bayesian level-set inversion method for simultaneous reconstruction of absorption and diffusion coefficients in diffuse optical tomography Anuj Abhishek et.al. 2404.11552 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537 null
2024-04-17 Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt Zhanjie Zhang et.al. 2404.11474 link
2024-04-16 Searching for cold gas traced by MgII quasar absorbers in massive X-ray-selected galaxy clusters A. Y. Fresco et.al. 2404.10773 null
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765 null
2024-04-16 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Yuchi Wang et.al. 2404.10763 link
2024-04-16 A High-Order Conservative Cut Finite Element Method for Problems in Time-Dependent Domains Sebastian Myrbäck et.al. 2404.10756 link
2024-04-16 GazeHTA: End-to-end Gaze Target Detection with Head-Target Association Zhi-Yi Lin et.al. 2404.10718 null
2024-04-16 Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution Yutao Yuan et.al. 2404.10688 link
2024-04-16 Generating Human Interaction Motions in Scenes with Text Control Hongwei Yi et.al. 2404.10685 null
2024-04-16 StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization Yingshu Chen et.al. 2404.10681 null
2024-04-16 Arsenic diffusion in MOVPE-Grown GaAs/Ge epitaxial structures V. Orejuela et.al. 2404.10669 null
2024-04-16 Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay Jinmei Liu et.al. 2404.10662 link
2024-04-15 Accurate quantum Monte Carlo forces for machine-learned force fields: Ethanol as a benchmark Emiel Slootman et.al. 2404.09755 null
2024-04-15 Electric potential during tokamak disruptions and steady-state current drive Allen H Boozer et.al. 2404.09744 null
2024-04-15 Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement Wenyi Lian et.al. 2404.09735 link
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732 link
2024-04-15 Structure and dynamics of active string fluids and gels formed by dipolar active Brownian particles Maria Kelidou et.al. 2404.09693 null
2024-04-15 Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis Alessa Hering et.al. 2404.09666 null
2024-04-15 Impact of chirality on active Brownian particle: Exact moments in two and three dimensions Anweshika Pattanayak et.al. 2404.09650 null
2024-04-15 All-in-one simulation-based inference Manuel Gloeckler et.al. 2404.09636 link
2024-04-15 Branching diffusion processes and spectral properties of Feynman-Kac semigroup Pierre Collet et.al. 2404.09568 null
2024-04-15 Entropy on the Path Space and Application to Singular Diffusions and Mean-field Models Patrick Cattiaux et.al. 2404.09552 null
2024-04-15 Turbulent ice-ocean boundary layers in the well-mixed regime: insights from direct numerical simulations Louis-Alexandre Couston et.al. 2404.09545 null
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580 null
2024-04-12 Functional reducibility of higher-order networks Maxime Lucas et.al. 2404.08547 link
2024-04-12 Echoes of darkness: Supernova-neutrino-boosted dark matter from all galaxies Yen-Hsun Lin et.al. 2404.08528 link
2024-04-12 Generalized Hydrodynamics for the Volterra lattice: Ballistic and nonballistic behavior of correlation functions Guido Mazzuca et.al. 2404.08499 null
2024-04-12 PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction Siming Shan et.al. 2404.08412 null
2024-04-12 Estimate of force noise from electrostatic patch potentials in LISA Pathfinder Stefano Vitale et.al. 2404.08340 null
2024-04-12 Struggle with Adversarial Defense? Try Diffusion Yujie Li et.al. 2404.08273 null
2024-04-12 An XRISM observation proposal: Gas velocity in the merging cluster Abell 2256 Takayuki Tamura et.al. 2404.08267 null
2024-04-12 Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models Zeyu Yang et.al. 2404.08254 null
2024-04-12 An Asymptotically-Correct Implicit-Explicit Time Integration Scheme for Finite Volume Radiation-Hydrodynamics Chong-Chong He et.al. 2404.08247 link
2024-04-11 OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Moreno D'Incà et.al. 2404.07990 link
2024-04-11 ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Ming Li et.al. 2404.07987 link
2024-04-11 View Selection for 3D Captioning via Diffusion Ranking Tiange Luo et.al. 2404.07984 null
2024-04-11 Taming Stable Diffusion for Text to 360° Panorama Image Generation Cheng Zhang et.al. 2404.07949 link
2024-04-11 Active Carpets in floating viscous films Felipe A. Barros et.al. 2404.07856 null
2024-04-11 Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations Yunhong Deng et.al. 2404.07844 null
2024-04-11 The Cattaneo-Christov approximation of Fourier heat-conductive compressible fluids Timothée Crin-Barat et.al. 2404.07809 null
2024-04-11 ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Lifan Jiang et.al. 2404.07773 link
2024-04-11 An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization Minshuo Chen et.al. 2404.07771 null
2024-04-11 Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations Yufeng Yue et.al. 2404.07770 null
2024-04-10 GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models Zewei Zhang et.al. 2404.07206 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199 null
2024-04-10 InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models Jiale Xu et.al. 2404.07191 link
2024-04-10 Move Anything with Layered Scene Diffusion Jiawei Ren et.al. 2404.07178 null
2024-04-10 Understanding Dynamics in Coarse-Grained Models: IV. Connection of Fine-Grained and Coarse-Grained Dynamics with the Stokes-Einstein and Stokes-Einstein-Debye Relations Jaehyeok Jin et.al. 2404.07156 null
2024-04-10 A conservative Eulerian finite element method for transport and diffusion in moving domains Maxim Olshanskii et.al. 2404.07130 link
2024-04-10 Open reaction-diffusion systems: bridging probabilistic theory across scales Mauricio J. del Razo et.al. 2404.07119 null
2024-04-10 Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion Alexander Lobashev et.al. 2404.07029 link
2024-04-10 On the conjugate interface conditions and Galilean invariance Yang Hu et.al. 2404.07025 null
2024-04-10 Non-Degenerate One-Time Pad and the integrity of perfectly secret messages Alex Shafarenko et.al. 2404.07022 null
2024-04-09 Convergence analysis of novel discontinuous Galerkin methods for a convection dominated problem Satyajith Bommana Boyana et.al. 2404.06490 null
2024-04-09 Uncovering Tidal Treasures: Automated Classification of Faint Tidal Features in DECaLS Data Alexander J. Gordon et.al. 2404.06487 null
2024-04-09 GeoDirDock: Guiding Docking Along Geodesic Paths Raúl Miñán et.al. 2404.06481 null
2024-04-09 Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion Fan Yang et.al. 2404.06429 null
2024-04-09 ZeST: Zero-Shot Material Transfer from a Single Image Ta-Ying Cheng et.al. 2404.06425 null
2024-04-09 Policy-Guided Diffusion Matthew Thomas Jackson et.al. 2404.06356 link
2024-04-09 Quantum State Generation with Structure-Preserving Diffusion Model Yuchen Zhu et.al. 2404.06336 null
2024-04-09 Compensating slice emittance growth in high brightness photoinjectors using sacrificial charge W. H. Li et.al. 2404.06312 null
2024-04-09 NoiseNCA: Noisy Seed Improves Spatio-Temporal Continuity of Neural Cellular Automata Ehsan Pajouheshgar et.al. 2404.06279 null
2024-04-09 A Large-Scale Simulation Method for Neuromorphic Circuits Amir Shahhosseini et.al. 2404.06255 null
2024-04-08 The neutrino background from non-jetted active galactic nuclei P. Padovani et.al. 2404.05690 null
2024-04-08 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Kunpeng Song et.al. 2404.05674 link
2024-04-08 NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement Giordano Cicchetti et.al. 2404.05669 link
2024-04-08 YaART: Yet Another ART Rendering Technology Sergey Kastryulin et.al. 2404.05666 null
2024-04-08 BinaryDM: Towards Accurate Binarization of Diffusion Model Xingyu Zheng et.al. 2404.05662 link
2024-04-08 Convergence rates for the finite volume scheme of the stochastic heat equation Niklas Sapountzoglou et.al. 2404.05655 null
2024-04-09 The persistence of high altitude non-equilibrium diffuse ionized gas in simulations of star forming galaxies Lewis McCallum et.al. 2404.05651 null
2024-04-08 Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model Jichang Yang et.al. 2404.05648 link
2024-04-08 eDIG-CHANGES II: Project Design and Initial Results on NGC 3556 Jiang-Tao Li et.al. 2404.05628 null
2024-04-08 Learning a Category-level Object Pose Estimator without Pose Annotations Fengrui Tian et.al. 2404.05626 null
2024-04-05 Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models Sangwon Jang et.al. 2404.04243 null
2024-04-05 ToolEENet: Tool Affordance 6D Pose Estimation Yunlong Wang et.al. 2404.04193 null
2024-04-05 Nonlocally coupled moisture model for convective self-aggregation Tomoro Yanase et.al. 2404.04146 null
2024-04-05 Rare events, time crystals and symmetry-breaking dynamical phase transitions Rubén Hurtado-Gutiérrez et.al. 2404.04135 null
2024-04-05 A posteriori error analysis of a space-time hybridizable discontinuous Galerkin method for the advection-diffusion problem Yuan Wang et.al. 2404.04130 null
2024-04-05 Dynamic Prompt Optimizing for Text-to-Image Generation Wenyi Mo et.al. 2404.04095 link
2024-04-05 A first passage model of intravitreal drug delivery and residence time, in relation to ocular geometry, individual variability, and injection location Patricia Lamirande et.al. 2404.04086 null
2024-04-05 Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation Mingyuan Zhou et.al. 2404.04057 link
2024-04-05 InstructHumans: Editing Animated 3D Human Textures with Instructions Jiayin Zhu et.al. 2404.04037 null
2024-04-05 Impacts of non-thermal emission on the images of black hole shadow and extended jets in two-temperature GRMHD simulations Mingyuan Zhang et.al. 2404.04033 null
2024-04-04 MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation Hanzhe Hu et.al. 2404.03656 null
2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dongzhi Jiang et.al. 2404.03653 link
2024-04-04 The More You See in 2D, the More You Perceive in 3D Xinyang Han et.al. 2404.03652 null
2024-04-04 DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior Yiming Zhang et.al. 2404.03642 null
2024-04-04 LCM-Lookahead for Encoder-based Text-to-Image Personalization Rinon Gal et.al. 2404.03620 null
2024-04-04 DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images Zhou Jie et.al. 2404.03595 link
2024-04-04 PointInfinity: Resolution-Invariant Point Diffusion Models Zixuan Huang et.al. 2404.03566 null
2024-04-04 Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models Siyuan Mei et.al. 2404.03541 null
2024-04-04 Impact of the Magnetic Horizon on the Interpretation of the Pierre Auger Observatory Spectrum and Composition Data The Pierre Auger Collaboration et.al. 2404.03533 null
2024-04-04 Significantly Enhanced Vacancy Diffusion in Mn-containing Alloys Huaqing Guan et.al. 2404.03339 null
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903 link
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899 null
2024-04-03 On the Scalability of Diffusion-based Text-to-Image Generation Hao Li et.al. 2404.02883 null
2024-04-03 Uniqueness of the blow-down limit for triple junction problem Zhiyuan Geng et.al. 2404.02859 null
2024-04-03 Efficient Quantum Circuits for Non-Unitary and Unitary Diagonal Operators with Space-Time-Accuracy trade-offs Julien Zylberman et.al. 2404.02819 null
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767 null
2024-04-03 Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Wentian Zhang et.al. 2404.02747 link
2024-04-03 InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation Haofan Wang et.al. 2404.02733 link
2024-04-03 Harnessing the Power of Large Vision Language Models for Synthetic Image Detection Mamadou Keita et.al. 2404.02726 link
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148 link
2024-04-02 A Stabilized Parametric Finite Element Method for Surface Diffusion with an Arbitrary Surface Energy Yulin Zhang et.al. 2404.02083 null
2024-04-02 WcDT: World-centric Diffusion Transformer for Traffic Scene Generation Chen Yang et.al. 2404.02082 link
2024-04-02 Brownian Particles and Matter Waves Nicos Makris et.al. 2404.02016 null
2024-04-02 Superionic Fluoride Gate Dielectrics with Low Diffusion Barrier for Advanced Electronics Kui Meng et.al. 2404.02011 null
2024-04-02 AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design Xinze Li et.al. 2404.02003 null
2024-04-02 Rigorous derivation of an effective model for coupled Stokes advection, reaction and diffusion with freely evolving microstructure Markus Gahn et.al. 2404.01983 null
2024-04-02 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959 link
2024-04-02 Nonlinear stability for active suspensions Helge Dietert et.al. 2404.01906 null
2024-04-02 On the surface helium abundance of B-type hot subdwarf stars from the WD+MS channel of Type Ia supernovae Rui-Jie Ji et.al. 2404.01905 null
2024-03-29 Relation Rectification in Diffusion Model Yinwei Wu et.al. 2403.20249 null
2024-03-29 Graph Neural Aggregation-diffusion with Metastability Kaiyuan Cui et.al. 2403.20221 null
2024-03-29 Scaled Brownian motion with random anomalous diffusion exponent Hubert Woszczek et.al. 2403.20206 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193 null
2024-03-29 Energy solutions of the Cauchy-Dirichlet problem for fractional nonlinear diffusion equations Goro Akagi et.al. 2403.20176 null
2024-03-29 Na Vacancy Driven Phase Transformation and Fast Ion Conduction in W-doped Na $_3$SbS$_4$ from Machine Learning Force Fields Johan Klarbring et.al. 2403.20138 null
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105 null
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079 null
2024-03-29 Efficacy of the Sterile Insect Technique in the presence of inaccessible areas: A study using two-patch models Pierre-Alexandre Bliman et.al. 2403.20069 null
2024-03-29 Optimal s-boxes against alternative operations Marco Calderini et.al. 2403.20059 null
2024-03-28 GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling Bowen Zhang et.al. 2403.19655 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652 null
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645 null
2024-03-28 In the driver's mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles Samir H. A. Mohammad et.al. 2403.19637 null
2024-03-28 Generalisation of the Spectral Difference scheme for the diffused-interface five equation model Niccolò Tonicello et.al. 2403.19623 null
2024-03-28 More on Black Holes Perceiving the Dark Dimension Luis A. Anchordoqui et.al. 2403.19604 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593 null
2024-03-28 Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics Norman Di Palo et.al. 2403.19578 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818 null
2024-03-27 Garment3DGen: 3D Garment Stylization and Texture Generation Nikolaos Sarafianos et.al. 2403.18816 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807 link
2024-03-27 Dimension-independent functional inequalities by tensorization and projection arguments Fabrice Baudoin et.al. 2403.18799 null
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791 link
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775 link
2024-03-27 Convergence rates under a range invariance condition with application to electrical impedance tomography Barbara Kaltenbacher et.al. 2403.18704 null
2024-03-27 A Diffusion-Based Generative Equalizer for Music Restoration Eloi Moliner et.al. 2403.18636 link
2024-03-28 FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing Trong-Tung Nguyen et.al. 2403.18605 null
2024-03-27 HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Hao Xu et.al. 2403.18575 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936 null
2024-03-26 SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models Kashyap Chitta et.al. 2403.17933 null
2024-03-26 The instability mechanism of compact multiplanet systems Caleb Lammers et.al. 2403.17928 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924 link
2024-03-26 Emergent Anomalous Hydrodynamics at Infinite Temperature in a Long-Range XXZ Model Ang Yang et.al. 2403.17912 null
2024-03-26 The Solution to an Impulse Control Problem Motivated by Optimal Harvesting Zhesheng Liu et.al. 2403.17875 null
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870 null
2024-03-26 Universal entropy transport far from equilibrium across the BCS-BEC crossover Jeffrey Mohan et.al. 2403.17838 null
2024-03-26 The memory of Rayleigh-Taylor turbulence S. Thévenin et.al. 2403.17832 null
2024-03-26 DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions Sammy Christen et.al. 2403.17827 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803 link
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790 null
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776 null
2024-03-25 Stochastic Inertial Dynamics Via Time Scaling and Averaging Rodrigo Maulen-Soto et.al. 2403.16775 null
2024-03-25 Multilevel Modeling as a Methodology for the Simulation of Human Mobility Luca Serena et.al. 2403.16745 null
2024-03-25 A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models Nils Ingelhag et.al. 2403.16730 null
2024-03-25 Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728 link
2024-03-25 The effect of inter-track coupling on H $_2$O$_2$ productions Ramin Abolfath et.al. 2403.16722 null
2024-03-25 Phase Transformation in Lithium Niobate-Lithium Tantalate Solid Solutions (LiNb $_{1-x}$Ta$_x$O$_3$ ) Fatima El Azzouzi et.al. 2403.16717 null
2024-03-25 The Directionality of Gravitational and Thermal Diffusive Transport in Geologic Fluid Storage Anna Herring et.al. 2403.16659 null
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385 null
2024-03-22 Energy-dependent Boosted Dark Matter from Diffuse Supernova Neutrino Background Anirban Das et.al. 2403.15367 null
2024-03-22 Ultrasound Imaging based on the Variance of a Diffusion Restoration Model Yuxin Zhang et.al. 2403.15316 null
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309 null
2024-03-22 Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies Nicolò Botteghi et.al. 2403.15267 null
2024-03-22 Spectral Motion Alignment for Video Motion Transfer using Diffusion Models Geon Yeong Park et.al. 2403.15249 null
2024-03-22 Shadow Generation for Composite Image Using Diffusion model Qingyang Liu et.al. 2403.15234 link
2024-03-22 Broad Instantaneous Bandwidth Microwave Spectrum Analyzer with a Microfabricated Atomic Vapor Cell Yongqi Shi et.al. 2403.15155 null
2024-03-22 Oxygenation of CO and NO on Amorphous Solid Water Meenu Upadhyay et.al. 2403.15141 null
2024-03-21 Simplified Diffusion Schrödinger Bridge Zhicong Tang et.al. 2403.14623 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621 link
2024-03-21 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617 null
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602 null
2024-03-21 Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors Nikolaos Tsagkas et.al. 2403.14526 null
2024-03-21 Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting Alicia Durrer et.al. 2403.14499 link
2024-03-21 Periodicity from X-ray sources within the inner Galactic disk Samaresh Mondal et.al. 2403.14480 null
2024-03-21 Analysing Diffusion Segmentation for Medical Images Mathias Öttl et.al. 2403.14440 null
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429 null
2024-03-20 On Pretraining Data Diversity for Self-Supervised Learning Hasan Abed Al Kader Hammoud et.al. 2403.13808 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800 null
2024-03-20 DepthFM: Fast Monocular Depth Estimation with Flow Matching Ming Gui et.al. 2403.13788 null
2024-03-20 Anomalous diffusion in polydisperse granular gases: Monte Carlo simulations Anna S. Bodrova et.al. 2403.13772 null
2024-03-20 Disentangling the anisotropic radio sky: Fisher forecasts for 21cm arrays Zheng Zhang et.al. 2403.13768 null
2024-03-20 Statistical estimation of full-sky radio maps from 21cm array visibility data using Gaussian Constrained Realisations Katrine A. Glasscock et.al. 2403.13766 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745 link
2024-03-20 Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes Yifan Chen et.al. 2403.13724 null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963 link
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962 link
2024-03-19 TexTile: A Differentiable Metric for Texture Tileability Carlos Rodriguez-Pardo et.al. 2403.12961 link
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957 null
2024-03-19 Zero-Reference Low-Light Enhancement via Physical Quadruple Priors Wenjing Wang et.al. 2403.12933 null
2024-03-19 You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs Yihong Luo et.al. 2403.12931 link
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915 link
2024-03-19 H $α$/H$β$ a Galactic Low Energy Cosmic Rays tracer I. Busa et.al. 2403.12872 null
2024-03-19 D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation Jun Yamada et.al. 2403.12861 null
2024-03-19 Generative Enhancement for 3D Medical Images Lingting Zhu et.al. 2403.12852 link
2024-03-18 Scaling limit of heavy tailed nearly unstable INAR( $\infty$ ) processes and rough fractional diffusions Yingli Wang et.al. 2403.11773 null
2024-03-18 Irradiation induced mineral changes of NWA10580 meteorite determined by infrared analysis I. Gyollai et.al. 2403.11725 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706 link
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697 null
2024-03-18 Narrow absorption lines from intervening material in supernovae I. Measurements and temporal evolution Santiago González-Gaitán et.al. 2403.11677 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667 null
2024-03-18 Diffusion-Based Environment-Aware Trajectory Prediction Theodor Westny et.al. 2403.11643 null
2024-03-18 Arc2Face: A Foundation Model of Human Faces Foivos Paraperas Papantoniou et.al. 2403.11641 link
2024-03-18 Quasinormal Modes of Near-Extremal Electric and Magnetic Black Branes Swapnil Nitin Shah et.al. 2403.11640 null
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627 link
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518 link
2024-03-15 Active transport of a passive colloid in a bath of run-and-tumble particles Tanumoy Dhar et.al. 2403.10508 null
2024-03-15 MusicHiFi: Fast High-Fidelity Stereo Vocoding Ge Zhu et.al. 2403.10493 null
2024-03-15 New functional inequalities with applications to the arctan-fast diffusion equation Rafael Granero-Belinchón et.al. 2403.10458 null
2024-03-15 Variance sum rule: proofs and solvable models Ivan Di Terlizzi et.al. 2403.10442 null
2024-03-15 SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy Alison Bartsch et.al. 2403.10401 null
2024-03-15 Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding Pengkun Liu et.al. 2403.10395 link
2024-03-15 Denoising Task Difficulty-based Curriculum for Training Diffusion Models Jin-Young Kim et.al. 2403.10348 null
2024-03-15 Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: Numerical Analysis Jai Tushar et.al. 2403.10282 null
2024-03-15 Towards Generalizable Deepfake Video Detection with Thumbnail Layout and Graph Reasoning Yuting Xu et.al. 2403.10261 link
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625 null
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623 link
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616 null
2024-03-14 Generative reconstruction of 3D volume elements for Ti-6Al-4V basketweave microstructure by optimization of CNN-based microstructural descriptors Vincent Blümer et.al. 2403.09609 null
2024-03-14 The effect of spatially-varying collision frequency on the development of the Rayleigh-Taylor instability John Rodman et.al. 2403.09591 null
2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Zunnan Xu et.al. 2403.09471 null
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468 link
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764 null
2024-03-13 Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08758 null
2024-03-13 Efficient Combinatorial Optimization via Heat Diffusion Hengyuan Ma et.al. 2403.08757 link
2024-03-13 Sticky-threshold diffusions, local time approximation and parameter estimation Alexis Anagnostakis et.al. 2403.08754 null
2024-03-13 Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08749 null
2024-03-14 GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing Jing Wu et.al. 2403.08733 null
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728 link
2024-03-13 Historical Astronomical Diagrams Decomposition in Geometric Primitives Syrine Kalleli et.al. 2403.08721 null
2024-03-13 Limits on the OH Molecule in the Smith High Velocity Cloud Anthony H. Minter et.al. 2403.08704 null
2024-03-13 Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment Paraskevas Pegios et.al. 2403.08700 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860 link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842 null
2024-03-12 MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model Guibo Luo et.al. 2403.07838 null
2024-03-12 Fragmentation of Dense Rotation-Dominated Structures Fed by Collapsing Gravomagneto-Sheetlets and Origin of Misaligned 100 au-Scale Binaries and Multiple Systems Yisheng Tu et.al. 2403.07777 null
2024-03-13 SemCity: Semantic Scene Generation with Triplane Diffusion Jumin Lee et.al. 2403.07773 link
2024-03-12 A first principles study of the Stark shift effect on the zero-phonon line of the NV center in diamond Louis Alaerts et.al. 2403.07771 null
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764 null
2024-03-13 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721 link
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711 link
2024-03-12 Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal Yijun Yang et.al. 2403.07684 link
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976 link
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973 null
2024-03-11 POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations Bosco Garcia-Archilla et.al. 2403.06967 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952 null
2024-03-12 DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations Tianhao Qi et.al. 2403.06951 link
2024-03-11 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction Qing Xiao et.al. 2403.06940 null
2024-03-11 Anderson-Higgs amplitude mode in Josephson junctions Pierre Vallet et.al. 2403.06878 null
2024-03-11 Estimation of parameters and local times in a discretely observed threshold diffusion model Sara Mazzonetto et.al. 2403.06858 null
2024-03-11 Orbital relaxation length from first-principles scattering calculations Max Rang et.al. 2403.06827 null
2024-03-11 A quasilinear Keller-Segel model with saturated discontinuous advection Maria Gualdani et.al. 2403.06820 null
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438 link
2024-03-08 Radiation transport methods in star formation simulations Richard Wünsch et.al. 2403.05410 null
2024-03-08 Simulating conditioned diffusions on manifolds Marc Corstanje et.al. 2403.05409 link
2024-03-08 An implicit algorithm for simulating the dynamics of small dust grains with smoothed particle hydrodynamics Daniel Elsender et.al. 2403.05345 null
2024-03-08 DiffSF: Diffusion Models for Scene Flow Estimation Yushan Zhang et.al. 2403.05327 link
2024-03-08 Disorder-induced instability of a Weyl nodal loop semimetal towards a diffusive topological metal with protected multifractal surface states João S. Silva et.al. 2403.05298 null
2024-03-08 Neutrino fluxes from different classes of galactic sources Silvia Gagliardini et.al. 2403.05288 null
2024-03-08 Patricia's Bad Distributions Louigi Addario-Berry et.al. 2403.05269 null
2024-03-08 Non-additivity in many-body interactions between membrane-deforming spheres increases disorder Ali Azadbakht et.al. 2403.05253 null
2024-03-08 Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI Shoujin Huang et.al. 2403.05245 null
2024-03-07 Effects of mechanical stress, chemical potential, and coverage on hydrogen solubility during hydrogen enhanced decohesion of ferritic steel grain boundaries: A first-principles study Abril Azocar Guzman et.al. 2403.04741 null
2024-03-07 Quantum-enhanced joint estimation of phase and phase diffusion Jayanth Jayakumar et.al. 2403.04722 null
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701 link
2024-03-07 Delving into the Trajectory Long-tail Distribution for Muti-object Tracking Sijia Chen et.al. 2403.04700 link
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692 null
2024-03-07 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634 null
2024-03-07 A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images Cristiana Tiago et.al. 2403.04612 null
2024-03-07 Dynamic critical behavior of the chiral phase transition from the real-time functional renormalization group Johannes V. Roth et.al. 2403.04573 null
2024-03-07 Rescaled Mode-Coupling Scheme for the Quantitative Description of Experimentally Observed Colloid Dynamics Joel Diaz Maier et.al. 2403.04556 null
2024-03-07 Poisson equation with measure data, reconstruction formula and Doob classes of processes Andrzej Rozkosz et.al. 2403.04543 null
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938 link
2024-03-06 Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation Xiao Ma et.al. 2403.03890 null
2024-03-06 Towards a Schauder theory for fractional viscous Hamilton--Jacobi equations Espen R. Jakobsen et.al. 2403.03884 null
2024-03-06 Latent Dataset Distillation with Diffusion Models Brian B. Moser et.al. 2403.03881 null
2024-03-06 Convergence rate of the Smoluchowski-Kramers approximation for diffusions with jumps Chungang Shi et.al. 2403.03877 null
2024-03-06 Accelerating Convergence of Score-Based Diffusion Models, Provably Gen Li et.al. 2403.03852 null
2024-03-06 Two 100 TeV neutrinos coincident with the Seyfert galaxy NGC 7469 Giacomo Sommani et.al. 2403.03752 null
2024-03-06 Diffusion on language model embeddings for protein sequence generation Viacheslav Meshchaninov et.al. 2403.03726 null
2024-03-06 Spectral Algorithms on Manifolds through Diffusion Weichun Xia et.al. 2403.03669 null
2024-03-05 Moment estimates, exponential integrability, concentration inequalities and exit times estimates on evolving manifolds Robert Baumgarth et.al. 2403.03209 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206 null
2024-03-05 MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi et.al. 2403.03194 null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181 link
2024-03-05 The Amplitude Equation for the Space-Fractional Swift-Hohenberg Equation Christian Kuehn et.al. 2403.03158 null
2024-03-05 On dynamics of gasless combustion in slowly varying periodic media: periodic fronts, their stability and propagation-extinction-diffusion-reignition pattern Amanda Matson et.al. 2403.03144 null
2024-03-05 Enhanced beam-beam modeling to include longitudinal variation during weak-strong simulation Derong Xu et.al. 2403.03137 null
2024-03-05 NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Zeqian Ju et.al. 2403.03100 null
2024-03-05 Proof-of-concept for a nonadditive stochastic model of supercooled liquids Antonio Cesar do Prado Rosa Junior et.al. 2403.03041 null
2024-03-05 Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn's Rings Naoya Torii et.al. 2403.03012 null
2024-03-02 Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models Neta Shaul et.al. 2403.01329 null
2024-03-02 Longtime behavior of semilinear multi-term fractional in time diffusion Nataliya Vasylyeva et.al. 2403.01302 null
2024-03-02 Anomalous mass dependency in Hydra endoderm cell cluster diffusion Aline Lütz et.al. 2403.01294 null
2024-03-02 On the Arnold diffusion mechanism in Medium Earth Orbit Elisa Maria Alessi et.al. 2403.01283 null
2024-03-02 Rigidity results for group von Neumann algebras with diffuse center Ionuţ Chifan et.al. 2403.01280 null
2024-03-02 Analyzing the transport coefficients and observables of a rotating QGP medium in kinetic theory framework with a novel approach to the collision integral Shubhalaxmi Rath et.al. 2403.01240 null
2024-03-02 DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction Junwen Xiong et.al. 2403.01226 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212 null
2024-03-02 Atacama Large Aperture Submillimeter Telescope (AtLAST) science: Gas and dust in nearby galaxies Daizhong Liu et.al. 2403.01202 null
2024-03-02 Modelling ion acceleration and transport in corotating interaction regions: the mass-to-charge ratio dependence of the particle spectrum Zheyi Ding et.al. 2403.01201 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481 link
2024-02-29 Towards Generalizable Tumor Synthesis Qi Chen et.al. 2402.19470 link
2024-02-29 Anomalous contribution to galactic rotation curves due to stochastic spacetime Jonathan Oppenheim et.al. 2402.19459 null
2024-02-29 Listening to the Noise: Blind Denoising with Gibbs Diffusion David Heurtel-Depeiges et.al. 2402.19455 link
2024-02-29 Structure Preserving Diffusion Models Haoye Lu et.al. 2402.19369 null
2024-02-29 A new analytical model of the cosmic-ray energy flux for Galactic diffuse radio emission Andrea Bracco et.al. 2402.19367 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330 link
2024-02-29 DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly Gianluca Scarpellini et.al. 2402.19302 link
2024-02-29 Modeling the Progenitor Stars of Observed IIP Supernovae Kai-An You et.al. 2402.19260 link
2024-02-29 Generative models struggle with kirigami metamaterials Gerrit Felsch et.al. 2402.19196 null
2024-02-28 Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations Elie Abdo et.al. 2402.18572 null
2024-02-28 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang et.al. 2402.18567 null
2024-02-28 Photon statistics of resonantly driven spectrally diffusive quantum emitters Aymeric Delteil et.al. 2402.18542 null
2024-02-28 Optimality conditions for sparse optimal control of viscous Cahn-Hilliard systems with logarithmic potential Pierluigi Colli et.al. 2402.18506 null
2024-02-28 Dynamical Regimes of Diffusion Models Giulio Biroli et.al. 2402.18491 null
2024-02-28 Introducing cuDisc: a 2D code for protoplanetary disc structure and evolution calculations Alfie Robinson et.al. 2402.18471 link
2024-02-28 Effect of a perpendicular magnetic field on bilayer graphene under dual gating Mouhamadou Hassane Saley et.al. 2402.18399 null
2024-02-28 Deep Confident Steps to New Pockets: Strategies for Docking Generalization Gabriele Corso et.al. 2402.18396 link
2024-02-28 Topological charge and spin Hall effects due to skyrmions in canted antiferromagnets A. N. Zarezad et.al. 2402.18369 null
2024-02-28 Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model Sangjoon Park et.al. 2402.18362 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768 null
2024-02-27 Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners Yazhou Xing et.al. 2402.17723 null
2024-02-27 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563 null
2024-02-27 Fast Lithium Ion Diffusion in Brownmillerite $\mathrm{Li}{x}\mathrm{{Sr}{2}{Co}{2}{O}{5}}$ Xin Chen et.al. 2402.17557 null
2024-02-27 Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label Xinliang Zhang et.al. 2402.17555 link
2024-02-27 Forming 1D Periodic J-aggregates by Mechanical Bending of BNNTs: Evidence of Activated Molecular Diffusion J. -B. Marceau et.al. 2402.17537 null
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525 link
2024-02-27 Label-Noise Robust Diffusion Models Byeonghu Na et.al. 2402.17517 link
2024-02-27 The Unwanted Dissemination of Science: The Usage of Academic Articles as Ammunition in Contested Discursive Arenas on Twitter Richard Zhang et.al. 2402.17495 null
2024-02-27 EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485 null
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506 link
2024-02-26 Outline-Guided Object Inpainting with Diffusion Models Markus Pobitzer et.al. 2402.16421 null
2024-02-26 Renormalisation Group Methods for Effective Epidemiological Models Stefan Hohenegger et.al. 2402.16409 null
2024-02-26 Entropy production for diffusion processes across a semipermeable interface Paul C Bressloff et.al. 2402.16403 null
2024-02-26 Quantitative Propagation of Chaos for Mean Field Interacting Particle System Xing Huang et.al. 2402.16400 null
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392 link
2024-02-26 Generative AI in Vision: A Survey on Models, Metrics and Applications Gaurav Raut et.al. 2402.16369 null
2024-02-26 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359 null
2024-02-26 Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion Xuantong Liu et.al. 2402.16305 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302 link
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509 link
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504 link
2024-02-23 Length and Velocity Scales in Protoplanetary Disk Turbulence Debanjan Sengupta et.al. 2402.15475 null
2024-02-23 Solute transport due to periodic loading in a soft porous material Matilde Fiori et.al. 2402.15451 null
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429 link
2024-02-23 Dendrites with corners Enugala Sumanth Nani et.al. 2402.15394 null
2024-02-23 Understanding Oversmoothing in Diffusion-Based GNNs From the Perspective of Operator Semigroup Theory Weichen Zhao et.al. 2402.15326 null
2024-02-23 Ubiquitous short-range order in multi-principal element alloys Ying Han et.al. 2402.15305 null
2024-02-23 Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models Shunyu Liu et.al. 2402.15289 link
2024-02-23 Generative Modelling with Tensor Train approximations of Hamilton--Jacobi--Bellman equations David Sommer et.al. 2402.15285 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817 null
2024-02-22 GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion Xueyi Liu et.al. 2402.14810 link
2024-02-22 Consolidating Attention Features for Multi-view Image Editing Or Patashnik et.al. 2402.14792 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780 null
2024-02-22 Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening Zhenrong Shen et.al. 2402.14707 null
2024-02-22 PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model Yukiya Hono et.al. 2402.14692 null
2024-02-22 Error Estimates for First- and Second-Order Lagrange-Galerkin Moving Mesh Schemes for the One-Dimensional Convection-Diffusion Equation Kharisma Surya Putri et.al. 2402.14691 null
2024-02-22 Structure and thermodynamics of defects in Na-feldspar from a neural network potential Alexander Gorfer et.al. 2402.14640 null
2024-02-22 Debiasing Text-to-Image Diffusion Models Ruifei He et.al. 2402.14577 null
2024-02-22 DynGMA: a robust approach for learning stochastic differential equations from data Aiqing Zhu et.al. 2402.14475 link
2024-02-21 D-Flow: Differentiating through Flows for Controlled Generation Heli Ben-Hamu et.al. 2402.14017 null
2024-02-21 SDXL-Lightning: Progressive Adversarial Diffusion Distillation Shanchuan Lin et.al. 2402.13929 null
2024-02-21 Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate Yuchen Liang et.al. 2402.13901 null
2024-02-21 Conformal and nonminimal couplings in fractional cosmology Kevin Marroquín et.al. 2402.13850 null
2024-02-21 The influence of thermal pressure gradients and ionization (im)balance on the ambipolar diffusion and charge-neutral drifts M. M. Gómez-Míguez et.al. 2402.13813 null
2024-02-21 NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion Haoyu Li et.al. 2402.13809 null
2024-02-21 The Geography of Information Diffusion in Online Discourse on Europe and Migration Elisa Leonardelli et.al. 2402.13800 null
2024-02-21 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777 link
2024-02-21 Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion Lianghu Guo et.al. 2402.13776 null
2024-02-21 Music Style Transfer with Time-Varying Inversion of Diffusion Models Sifei Li et.al. 2402.13763 null
2024-02-20 Nonequilibrium fluctuations of chemical reaction networks at criticality: The Schlögl model as paradigmatic case Benedikt Remlein et.al. 2402.13168 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144 link
2024-02-20 Ultrafast lattice disordering can be accelerated by electronic collisional forces Gilberto A. de la Pena Munoz et.al. 2402.13133 null
2024-02-20 How accurate are simulations and experiments for the lattice energies of molecular crystals? Flaviano Della Pia et.al. 2402.13059 null
2024-02-20 Excited state-specific CASSCF theory for the torsion of ethylene Sandra Saade et.al. 2402.13046 null
2024-02-20 Text-Guided Molecule Generation with Diffusion Language Model Haisong Gong et.al. 2402.13040 link
2024-02-20 The Anomalous Long-Ranged Influence of an Inclusion in Momentum-Conserving Active Fluids Thibaut Arnoulx de Pirey et.al. 2402.12996 null
2024-02-20 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974 link
2024-02-20 CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Sohail Ahmed Khan et.al. 2402.12927 link
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376 link
2024-02-19 A Lower Bound for Estimating Fréchet Means Shayan Hundrieser et.al. 2402.12290 null
2024-02-19 Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach Mohammad Heydari et.al. 2402.12272 null
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242 link
2024-02-19 Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations Jonas Beck et.al. 2402.12231 link
2024-02-19 Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training Leo Hyun Park et.al. 2402.12187 null
2024-02-19 Anomalous Diffusion, Prethermalization, and Particle Binding in an Interacting Flat Band System Mirko Daumann et.al. 2402.12180 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099 null
2024-02-19 Malliavin Calculus for rough stochastic differential equations Fabio Bugini et.al. 2402.12056 null
2024-02-19 Constraining the stellar populations of ultra-diffuse galaxies in the MATLAS survey using spectral energy distribution fitting Maria Luisa Buzzo et.al. 2402.12033 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885 null
2024-02-16 Electronic Conductivity Measurements in Solid Electrolytes Using an Ion Blocking Microelectrode: Noise Rejection Based on a Median Filter Veyis Gunes et.al. 2402.10883 null
2024-02-16 Control Color: Multimodal Diffusion-based Interactive Image Colorization Zhexin Liang et.al. 2402.10855 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821 link
2024-02-16 VATr++: Choose Your Words Wisely for Handwritten Text Generation Bram Vanherle et.al. 2402.10798 null
2024-02-16 Nearly-optimal effective stability estimates around Diophantine tori of Hölder Hamiltonians Santiago Barbieri et.al. 2402.10764 null
2024-02-16 Revisiting a Core-Jet Laboratory at High Redshift: Analysis of the Radio Jet in the Quasar PKS 2215+020 at z=3.572 Sándor Frey et.al. 2402.10722 null
2024-02-16 Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation Hongbin Na et.al. 2402.10699 null
2024-02-16 Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm Yuanzhen Xie et.al. 2402.10671 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210 null
2024-02-15 Recovering the Pre-Fine-Tuning Weights of Generative Models Eliahu Horwitz et.al. 2402.10208 link
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207 link
2024-02-15 Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model Mariia Drozdova et.al. 2402.10204 link
2024-02-15 Tracer dynamics in polymer networks: generalized Langevin description Sebastian Milster et.al. 2402.10148 null
2024-02-15 Energy Flux Decomposition in Magnetohydrodynamic Turbulence D. Capocci et.al. 2402.10125 null
2024-02-15 A Blob Method for Mean Field Control With Terminal Constraints Katy Craig et.al. 2402.10124 link
2024-02-15 Collision efficiency of droplets across diffusive, electrostatic and inertial regimes Florian Poydenot et.al. 2402.10117 null
2024-02-15 Quantized Embedding Vectors for Controllable Diffusion Language Models Cheng Kang et.al. 2402.10107 null
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095 null
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368 link
2024-02-14 Investigation of Ga interstitial and vacancy diffusion in $β$-Ga$_2$O$_3$ via split defects: a direct approach via master diffusion equations Channyung Lee et.al. 2402.09354 null
2024-02-14 On the system size dependence of the diffusion coefficients in MD simulations: A simple correction formula for pure dense fluids Sergey Khrapak et.al. 2402.09348 null
2024-02-14 Lattice B-field correlators for heavy quarks Luis Altenkort et.al. 2402.09337 null
2024-02-14 Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio Pablo Alonso-Jiménez et.al. 2402.09318 null
2024-02-14 Disentangling the origin of chemical differences using GHOST C. Saffe et.al. 2402.09278 null
2024-02-14 A Modular Deep Learning-based Approach for Diffuse Optical Tomography Reconstruction Alessandro Benfenati et.al. 2402.09277 null
2024-02-14 Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection Pengfei Zhou et.al. 2402.09242 link
2024-02-14 Modeling of groundwater flow in porous medium layered over inclined impermeable bed Petr Girg et.al. 2402.09215 null
2024-02-14 A universal scaling limit for diffusive amnesic step-reinforced random walks Marco Bertenghi et.al. 2402.09202 null
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682 null
2024-02-13 Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? Guilherme S. Y. Giardini et.al. 2402.08681 null
2024-02-13 Target Score Matching Valentin De Bortoli et.al. 2402.08667 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654 null
2024-02-13 Clustering of primordial black holes from quantum diffusion during inflation Chiara Animali et.al. 2402.08642 null
2024-02-13 Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing Yunji Jung et.al. 2402.08601 null
2024-02-13 Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator Amartya Mukherjee et.al. 2402.08563 null
2024-02-13 Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases Ziyi Zhang et.al. 2402.08552 link
2024-02-13 Branching Interval Partition Diffusions Matthew Buckland et.al. 2402.08548 null
2024-02-13 Hyperballistic transport in dense ionized matter under external AC electric fields Daniele Gamba et.al. 2402.08519 null
2024-02-12 Label-Efficient Model Selection for Text Generation Shir Ashury-Tahan et.al. 2402.07891 null
2024-02-12 High-order harmonic generation in 2D Transition Metal Disulphides Jose Manuel Iglesias et.al. 2402.07850 null
2024-02-12 Self-heating effects and switching dynamics in graphene multiterminal Josephson junctions Máté Kedves et.al. 2402.07831 null
2024-02-12 Towards a mathematical theory for consistency training in diffusion models Gen Li et.al. 2402.07802 null
2024-02-12 Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Jiacheng Ye et.al. 2402.07754 link
2024-02-12 The GALAH survey: Elemental abundances in open clusters using joint effective temperature and surface gravity photometric priors Kevin L. Beeson et.al. 2402.07748 null
2024-02-12 Topological Edge States in Reconfigurable Multi-stable Mechanical Metamaterials Zhen Wang et.al. 2402.07707 null
2024-02-12 Metastability and time scales for parabolic equations with drift 2: the general time scale Claudio Landim et.al. 2402.07695 null
2024-02-12 Cosmology at the Field Level with Probabilistic Machine Learning Adam Rouhiainen et.al. 2402.07694 null
2024-02-12 Higher-order Connection Laplacians for Directed Simplicial Complexes Xue Gong et.al. 2402.07631 null
2024-02-09 The impact of different unravelings in a monitored system of free fermions Giulia Piccitto et.al. 2402.06597 null
2024-02-09 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Brian Yang et.al. 2402.06559 null
2024-02-09 The role of mobility in epidemics near criticality Beatrice Nettuno et.al. 2402.06505 null
2024-02-09 Sequential Flow Matching for Generative Modeling Jongmin Yoon et.al. 2402.06461 null
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436 null
2024-02-09 Enhanced bubble growth near an advancing solidification front Jochem G. Meijer et.al. 2402.06409 null
2024-02-09 Spectral properties of the Dirichlet-to-Neumann operator for spheroids Denis S. Grebenkov et.al. 2402.06372 null
2024-02-09 Sparse identification of nonlocal interaction kernels in nonlinear gradient flow equations via partial inversion Jose A. Carrillo et.al. 2402.06355 null
2024-02-09 Particle Denoising Diffusion Sampler Angus Phillips et.al. 2402.06320 link
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937 null
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933 link
2024-02-08 Dirichlet Flow Matching with Applications to DNA Sequence Design Hannes Stark et.al. 2402.05841 link
2024-02-08 AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning Wamiq Reyaz Para et.al. 2402.05803 null
2024-02-08 Determining the significance and relative importance of parameters of a simulated quenching algorithm using statistical tools Pedro A. Castillo et.al. 2402.05791 null
2024-02-08 Hydrogen abstraction from metal surfaces: When electron-hole pair excitations strongly affect hot-atom recombination Oihana Galparsoro et.al. 2402.05743 null
2024-02-08 First operation of a multi-channel Q-Pix prototype: measuring transverse electron diffusion in a gas time projection chamber Nora Hoch et.al. 2402.05734 null
2024-02-08 DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer Zhiyuan Ma et.al. 2402.05712 link
2024-02-08 Discovery and characterisation of a new Galactic Planetary Nebula W. E. Celnik et.al. 2402.05658 null
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608 link
2024-02-07 Nature of the diffuse emission sources in the H I supershell in the galaxy IC 1613 Anastasiya D. Yarovova et.al. 2402.05107 null
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098 link
2024-02-07 Convergence of spatial branching processes to $α$ -stable CSBPs: Genealogy of semi-pushed fronts Félix Foutel-Rodier et.al. 2402.05096 null
2024-02-07 Interacting particle approximation of cross-diffusion systems Jose Antonio Carrillo et.al. 2402.05094 null
2024-02-07 NITO: Neural Implicit Fields for Resolution-free Topology Optimization Amin Heyrani Nobari et.al. 2402.05073 null
2024-02-07 LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Jiaxiang Tang et.al. 2402.05054 null
2024-02-07 Non-reversible lifts of reversible diffusion processes and relaxation times Andreas Eberle et.al. 2402.05041 null
2024-02-07 Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design Andrew Campbell et.al. 2402.04997 link
2024-02-07 On the Cahn-Hilliard equation with kinetic rate dependent dynamic boundary conditions and non-smooth potentials: Well-posedness and asymptotic limits Maoyin Lv et.al. 2402.04965 null
2024-02-07 Hidden non-equilibrium pathways towards crystalline perfection A. Mangu et.al. 2402.04962 null
2024-02-06 Geometric theory of (extended) time-reversal symmetries in stochastic processes -- Part I: finite dimension Jérémy O'Byrne et.al. 2402.04217 null
2024-02-06 Maximal regularity and optimal control for a non-local Cahn-Hilliard tumour growth model Matteo Fornoni et.al. 2402.04204 null
2024-02-06 SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models Yichen Shi et.al. 2402.04178 link
2024-02-06 Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Ruoqi Zhang et.al. 2402.04080 link
2024-02-06 Generative Modeling of Graphs via Joint Diffusion of Node and Edge Attributes Nimrod Berman et.al. 2402.04046 null
2024-02-06 PAC-Bayesian Adversarially Robust Generalization Bounds for Graph Neural Network Tan Sun et.al. 2402.04038 null
2024-02-06 Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation Zolnamar Dorjsembe et.al. 2402.04031 link
2024-02-06 Space Group Constrained Crystal Generation Rui Jiao et.al. 2402.03992 null
2024-02-06 Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting Yiming Xu et.al. 2402.03981 null
2024-02-06 Weibel- and non-resonant Whistler wave growth in an expanding plasma in a 1D simulation geometry M E Dieckmann et.al. 2402.03925 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305 null
2024-02-05 Zero-shot Object-Level OOD Detection with Context-Aware Inpainting Quang-Huy Nguyen et.al. 2402.03292 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290 link
2024-02-05 Estimating position-dependent and anisotropic diffusivity tensors from molecular dynamics trajectories: Existing methods and future outlook Tiago Domingues et.al. 2402.03285 null
2024-02-05 Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? Anna Yoo Jeong Ha et.al. 2402.03214 null
2024-02-05 Light and Optimal Schrödinger Bridge Matching Nikita Gushchin et.al. 2402.03207 link
2024-02-05 Guidance with Spherical Gaussian Constraint for Conditional Diffusion Lingxiao Yang et.al. 2402.03201 link
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162 null
2024-02-05 Nonlinear feedback of the electrostatic instability on the blazar-induced pair beam and GeV cascade Mahmoud Alawashra et.al. 2402.03127 null
2024-02-05 DARTS: Diffusion Approximated Residual Time Sampling for Low Variance Time-of-flight Rendering in Homogeneous Scattering Medium Qianyue He et.al. 2402.03106 null
2024-02-02 Revealing crucial effects of reservoir environment and hydrocarbon fractions on fluid behaviour in kaolinite pores Rixin Zhao et.al. 2402.01633 null
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590 null
2024-02-02 Transformation semigroups and their applications Katarzyna Pichór et.al. 2402.01572 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566 null
2024-02-02 Resolution dependence of most probable pathways with state-dependent diffusivity Alice L. Thorneywork et.al. 2402.01559 null
2024-02-02 The galactic bubbles of starburst galaxies The influence of galactic large-scale magnetic fields Z. Meliani et.al. 2402.01541 null
2024-02-02 Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations Panos Kakoulidis et.al. 2402.01520 null
2024-02-02 Cross-view Masked Diffusion Transformers for Person Image Synthesis Trung X. Pham et.al. 2402.01516 link
2024-02-02 Binomial-tree approximation for time-inconsistent stopping Erhan Bayraktar et.al. 2402.01482 null
2024-02-02 SVI solutions to stochastic nonlinear diffusion equations on general measure spaces Benjamin Gess et.al. 2402.01479 null
2024-02-01 AToM: Amortized Text-to-Mesh using 2D Diffusion Guocheng Qian et.al. 2402.00867 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864 link
2024-02-01 An Analysis of the Variance of Diffusion-based Speech Enhancement Bunlong Lay et.al. 2402.00811 null
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769 link
2024-02-01 The Sonora Substellar Atmosphere Models. IV. Elf Owl: Atmospheric Mixing and Chemical Disequilibrium with Varying Metallicity and C/O Ratios Sagnick Mukherjee et.al. 2402.00756 null
2024-02-01 Neutral carbon in diffuse interstellar medium: abundance matching with H2 for DLAs at high redshifts Sergei Balashev et.al. 2402.00714 null
2024-02-01 Cylindrically symmetric diffusion model for relativistic heavy-ion collisions Johannes Hoelck et.al. 2402.00628 null
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627 link
2024-02-01 Diffusion-based Light Field Synthesis Ruisheng Gao et.al. 2402.00575 null
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085 null
2024-01-31 An electrodynamic wave model for the action potential Vitaly L. Galinsky et.al. 2401.18051 null
2024-01-31 Reversible, Irreversible and Mixed Regimes for Periodically Driven Disks in Random Obstacle Arrays D. Minogue et.al. 2401.18042 null
2024-01-31 Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the $Δ_2$ condition Julian Fernandez Bonder et.al. 2401.18041 null
2024-01-31 Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations Qi-Zuo Wu et.al. 2401.17982 null
2024-01-31 Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances Xuefeng Gao et.al. 2401.17958 null
2024-01-31 Investigation of Microstructure and Corrosion Resistance of Ti-Al-V Titanium Alloys Obtained by Spark Plasma Sintering Aleksey Nokhrin et.al. 2401.17941 null
2024-01-31 Lipolysis on Lipid Droplets: Mathematical Modelling and Numerical Discretisation Reymart Salcedo Lagunero et.al. 2401.17935 link
2024-01-31 AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error Jonas Ricker et.al. 2401.17879 link
2024-01-31 Multiplicity results for mass constrained Allen-Cahn equations on Riemannian manifolds with boundary Dario Corona et.al. 2401.17847 null
2024-01-30 Study of X-ray emission from the S147 nebula with SRG/eROSITA: X-ray imaging, spectral characterization and a multiwavelength picture Miltiadis Michailidis et.al. 2401.17312 null
2024-01-30 G321.3-3.9: a new supernova remnant observed with multi-band radio data and in the SRG/eROSITA All-Sky Surveys S. Mantovanini et.al. 2401.17294 null
2024-01-30 Discovery of the Goat Horn complex: a $\sim 1000$ deg$^2$ diffuse X-ray source connected to radio loop XII Nicola Locatelli et.al. 2401.17291 null
2024-01-30 A new understanding of the Gemini-Monoceros X-ray enhancement from discoveries with eROSITA Jonathan R. Knies et.al. 2401.17289 null
2024-01-30 Probing the physical properties of the IGM using SRG/eROSITA spectra from blazars E. Gatuzz et.al. 2401.17283 null
2024-01-30 You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation Mehdi Noroozi et.al. 2401.17258 null
2024-01-30 Stochastic motions of the two-dimensional many-body delta-Bose gas Yu-Ting Chen et.al. 2401.17243 null
2024-01-30 ContactGen: Contact-Guided Interactive 3D Human Generation for Partners Dongjun Gu et.al. 2401.17212 null
2024-01-30 Quantum dynamics in one and two dimensions via recursion method Filipp Uskov et.al. 2401.17211 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181 null

(back to top)

Motion

Publish Date Title Authors PDF Code
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508 link
2024-06-10 Human Gaze and Head Rotation during Navigation, Exploration and Object Manipulation in Shared Environments with Robots Tim Schreiter et.al. 2406.06300 null
2024-06-07 SMART: Scene-motion-aware human action recognition framework for mental disorder group Zengyuan Lai et.al. 2406.04649 null
2024-06-03 PDP: Physics-Based Character Animation via Diffusion Policy Takara E. Truong et.al. 2406.00960 null
2024-06-02 Unsupervised Neural Motion Retargeting for Humanoid Teleoperation Satoshi Yagi et.al. 2406.00727 null
2024-06-02 T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences Taeryung Lee et.al. 2406.00636 null
2024-05-30 MotionLLM: Understanding Human Behaviors from Human Motions and Videos Ling-Hao Chen et.al. 2405.20340 null
2024-05-30 RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text Jiaben Chen et.al. 2405.20336 null
2024-05-30 SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations Yujiao Jiang et.al. 2405.19609 null
2024-05-30 Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction Xuehao Gao et.al. 2405.18700 null
2024-05-30 Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking Sequences Vida Adeli et.al. 2405.17817 link
2024-05-28 MotionLLM: Multimodal Motion-Language Learning with Large Language Models Qi Wu et.al. 2405.17013 null
2024-05-27 A Cross-Dataset Study for Text-based 3D Human Motion Retrieval Léore Bensabath et.al. 2405.16909 null
2024-05-25 SuDA: Support-based Domain Adaptation for Sim2Real Motion Capture with Flexible Sensors Jiawei Fang et.al. 2405.16152 null
2024-05-24 FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis Ke Fan et.al. 2405.15763 null
2024-05-24 Learning Generalizable Human Motion Generator with Reinforcement Learning Yunyao Mao et.al. 2405.15541 null
2024-05-24 Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer Zichen Geng et.al. 2405.15439 null
2024-05-24 A Systematic Review on Custom Data Gloves Valerio Belcamino et.al. 2405.15417 null
2024-05-24 On the Identification of Temporally Causal Representation with Instantaneous Dependence Zijian Li et.al. 2405.15325 null
2024-05-24 Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor Haoxuan Qu et.al. 2405.15267 null
2024-05-23 Event-based dataset for the detection and classification of manufacturing assembly tasks Laura Duarte et.al. 2405.14626 link
2024-05-21 MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video Hongsheng Wang et.al. 2405.12806 null
2024-05-21 Towards Using Fast Embedded Model Predictive Control for Human-Aware Predictive Robot Navigation Till Hielscher et.al. 2405.12616 null
2024-05-21 Physics-based Scene Layout Generation from Human Motion Jianan Li et.al. 2405.12460 null
2024-05-23 Flexible Motion In-betweening with Diffusion Models Setareh Cohan et.al. 2405.11126 null
2024-05-17 Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis Zeyi Zhang et.al. 2405.09814 null
2024-05-16 Integrating Uncertainty-Aware Human Motion Prediction into Graph-Based Manipulator Motion Planning Wansong Liu et.al. 2405.09779 null
2024-05-24 ContourCraft: Learning to Resolve Intersections in Neural Multi-Garment Simulations Artur Grigorev et.al. 2405.09522 null
2024-05-13 Generating Human Motion in 3D Scenes from Text Descriptions Zhi Cen et.al. 2405.07784 null
2024-05-13 Establishing a Unified Evaluation Framework for Human Motion Generation: A Comparative Analysis of Metrics Ali Ismail-Fawaz et.al. 2405.07680 link
2024-05-13 Motion Keyframe Interpolation for Any Human Skeleton via Temporally Consistent Point Cloud Sampling and Reconstruction Clinton Mo et.al. 2405.07444 null
2024-05-10 Shape Conditioned Human Motion Generation with Diffusion Model Kebing Xue et.al. 2405.06778 null
2024-05-09 A Mixture of Experts Approach to 3D Human Motion Prediction Edmund Shieh et.al. 2405.06088 link
2024-05-09 StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework Yiheng Huang et.al. 2405.05691 null
2024-05-08 Audio Matters Too! Enhancing Markerless Motion Capture with Audio Signals for String Performance Capture Yitong Jin et.al. 2405.04963 null
2024-05-08 WixUp: A General Data Augmentation Framework for Wireless Perception in Tracking of Humans Yin Li et.al. 2405.04804 null
2024-05-08 Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches Qing Yu et.al. 2405.04771 null
2024-05-07 Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos Junyi Ma et.al. 2405.04370 link
2024-05-06 MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization Massimiliano Pappa et.al. 2405.03803 null
2024-05-06 LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model Haowen Sun et.al. 2405.03485 link
2024-05-05 Multimodal Sense-Informed Prediction of 3D Human Motions Zhenyu Lou et.al. 2405.02911 null
2024-05-05 Efficient Text-driven Motion Generation via Latent Consistency Training Mengxian Hu et.al. 2405.02791 null
2024-05-03 Physics-informed generative neural networks for RF propagation prediction with application to indoor body perception Federica Fieramosca et.al. 2405.02131 null
2024-04-30 MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model Wenxun Dai et.al. 2404.19759 link
2024-04-30 PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios Jingbo Wang et.al. 2404.19722 null
2024-04-30 Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis Shivam Mehta et.al. 2404.19622 null
2024-04-30 Physical Non-inertial Poser (PNP): Modeling Non-inertial Effects in Sparse-inertial Human Motion Capture Xinyu Yi et.al. 2404.19619 null
2024-04-30 Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging Rayan Armani et.al. 2404.19541 link
2024-04-29 4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations Wenbo Wang et.al. 2404.18630 link
2024-04-27 Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs Yiming Bao et.al. 2404.17837 null
2024-04-26 Clustering of Motion Trajectories by a Distance Measure Based on Semantic Features Christoph Zelch et.al. 2404.17269 link
2024-04-25 SHINE: Social Homology Identification for Navigation in Crowded Environments Diego Martinez-Baselga et.al. 2404.16705 null
2024-04-23 WANDR: Intention-guided Human Motion Generation Markos Diomataris et.al. 2404.15383 null
2024-04-20 Efficient Verification of a RADAR SoC Using Formal and Simulation-Based Methods Aman Kumar et.al. 2404.15371 null
2024-04-19 A Weight-aware-based Multi-source Unsupervised Domain Adaptation Method for Human Motion Intention Recognition Xiao-Yin Liu et.al. 2404.15366 link
2024-04-23 TAAT: Think and Act from Arbitrary Texts in Text2Motion Runqi Wang et.al. 2404.14745 null
2024-04-21 MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions Sheng Yan et.al. 2404.13657 link
2024-04-19 Purposer: Putting Human Motion Generation in Context Nicolas Ugrinovic et.al. 2404.12942 null
2024-04-19 MCM: Multi-condition Motion Synthesis Framework Zeyu Ling et.al. 2404.12886 null
2024-04-17 Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion Xinghan Wang et.al. 2404.11375 null
2024-04-17 Following the Human Thread in Social Navigation Luca Scofano et.al. 2404.11327 link
2024-04-16 HumMUSS: Human Motion Understanding using State Space Models Arnab Kumar Mondal et.al. 2404.10880 null
2024-04-15 in2IN: Leveraging individual Information to Generate Human INteractions Pablo Ruiz Ponce et.al. 2404.09988 null
2024-04-15 Learning Human Motion from Monocular Videos via Cross-Modal Manifold Alignment Shuaiying Hou et.al. 2404.09499 null
2024-04-12 Synthesis of Through-Wall Micro-Doppler Signatures of Human Motions Using Generative Adversarial Networks Kainat Yasmeen Shobha Sundar Ram et.al. 2404.08739 null
2024-04-12 EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams Christen Millerdurai et.al. 2404.08640 link
2024-04-11 Model Predictive Trajectory Planning for Human-Robot Handovers Thies Oelerich et.al. 2404.07505 null
2024-04-08 Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning Mahsa Ehsanpour et.al. 2404.05578 null
2024-04-08 Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning Jaewoo Jeong et.al. 2404.05218 link
2024-04-07 A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals Jiangnan Tang et.al. 2404.04890 null
2024-04-05 PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos Yufei Zhang et.al. 2404.04430 null
2024-04-04 Towards more realistic human motion prediction with attention to motion coordination Pengxiang Ding et.al. 2404.03584 null
2024-04-03 MotionChain: Conversational Motion Controllers via Multimodal Prompts Biao Jiang et.al. 2404.01700 link
2024-04-02 Leveraging Digital Perceptual Technologies for Remote Perception and Analysis of Human Biomechanical Processes: A Contactless Approach for Workload and Joint Force Assessment Jesudara Omidokun et.al. 2404.01576 null
2024-04-01 Large Motion Model for Unified Multi-Modal Motion Generation Mingyuan Zhang et.al. 2404.01284 null
2024-04-02 SurMo: Surface-based 4D Motion Modeling for Dynamic Human Rendering Tao Hu et.al. 2404.01225 null
2024-03-29 A Unified Framework for Human-centric Point Cloud Video Understanding Yiteng Xu et.al. 2403.20031 null
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652 null
2024-03-28 RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method Ming Yan et.al. 2403.19501 null
2024-03-28 Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for Communication Mingze Sun et.al. 2403.19467 null
2024-04-01 BAMM: Bidirectional Autoregressive Motion Model Ekkasit Pinyoanuntapong et.al. 2403.19435 link
2024-03-30 Egocentric Scene-aware Human Trajectory Prediction Weizhuo Wang et.al. 2403.19026 null
2024-03-26 Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance Zan Wang et.al. 2403.18036 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936 null
2024-03-30 MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors He Zhang et.al. 2403.17610 null
2024-03-28 Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method Jie Tian et.al. 2403.16169 null
2024-03-26 PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling Xiaoyun Zheng et.al. 2403.16080 link
2024-03-23 Human Motion Prediction under Unexpected Perturbation Jiangbei Yue et.al. 2403.15891 null
2024-03-23 Contact-aware Human Motion Generation from Textual Descriptions Sihan Ma et.al. 2403.15709 null
2024-03-22 GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner Haoxuan Qu et.al. 2403.14947 null
2024-03-21 HCTO: Optimality-Aware LiDAR Inertial Odometry with Hybrid Continuous Time Optimization for Compact Wearable Mapping System Jianping Li et.al. 2403.14173 link
2024-03-21 Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration Zhihao Wang et.al. 2403.14104 null
2024-03-20 CoMo: Controllable Motion Generation through Language Guided Pose Code Editing Yiming Huang et.al. 2403.13900 null
2024-03-20 LaCE-LHMP: Airflow Modelling-Inspired Long-Term Human Motion Prediction By Enhancing Laminar Characteristics in Human Flow Yufei Zhu et.al. 2403.13640 link
2024-03-21 LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment Peishan Cong et.al. 2403.13307 link
2024-03-20 Map-Aware Human Pose Prediction for Robot Follow-Ahead Qingyuan Jiang et.al. 2403.13294 null
2024-03-19 WHAC: World-grounded Humans and Cameras Wanqi Yin et.al. 2403.12959 link
2024-03-18 Graph-Jigsaw Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection Ali Karami et.al. 2403.12172 null
2024-03-18 UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling Yujiao Jiang et.al. 2403.11589 null
2024-03-17 FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction Xiaohan Zhang et.al. 2403.11237 null
2024-03-17 THOR: Text to Human-Object Interaction Diffusion via Relation Intervention Qianyang Wu et.al. 2403.11208 null
2024-03-14 GazeMotion: Gaze-guided Human Motion Forecasting Zhiming Hu et.al. 2403.09885 null
2024-03-14 THÖR-MAGNI: A Large-scale Indoor Motion Capture Recording of Human Movement and Robot Interaction Tim Schreiter et.al. 2403.09285 link
2024-03-13 Scaling Up Dynamic Human-Scene Interaction Modeling Nan Jiang et.al. 2403.08629 null
2024-03-12 DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation Chen Wang et.al. 2403.07788 null
2024-03-19 Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM Zeyu Zhang et.al. 2403.07487 link
2024-03-10 Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation Paweł A. Pierzchlewicz et.al. 2403.06164 link
2024-03-09 MATRIX: Multi-Agent Trajectory Generation with Diverse Contexts Zhuo Xu et.al. 2403.06041 null
2024-03-09 Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information Qiaochu Huang et.al. 2403.05834 link
2024-03-08 Integrating Predictive Motion Uncertainties with Distributionally Robust Risk-Aware Control for Safe Robot Navigation in Crowds Kanghyun Ryu et.al. 2403.05081 link
2024-03-11 Fooling Neural Networks for Motion Forecasting via Adversarial Attacks Edgar Medina et.al. 2403.04954 null
2024-03-06 HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations Peng Dai et.al. 2403.03561 null
2024-03-01 Tri-Modal Motion Retrieval by Learning a Joint Embedding Space Kangning Yin et.al. 2403.00691 null
2024-02-21 Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting Edgar Medina et.al. 2402.19237 link
2024-02-29 MOSAIC: A Modular System for Assistive and Interactive Cooking Huaxiaoyue Wang et.al. 2402.18796 null
2024-02-27 SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents Wei Xiang et.al. 2402.17339 link
2024-02-27 LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment Yiming Ren et.al. 2402.17171 null
2024-03-06 Expressive Whole-Body Control for Humanoid Robots Xuxin Cheng et.al. 2402.16796 null
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509 link
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-20 A Recurrent Neural Network Enhanced Unscented Kalman Filter for Human Motion Prediction Wansong Liu et.al. 2402.13045 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099 null
2024-02-04 Custom IMU-Based Wearable System for Robust 2.4 GHz Wireless Human Body Parts Orientation Tracking and 3D Movement Visualization on an Avatar Javier González-Alonso et.al. 2402.09459 null
2024-01-30 Progress in artificial intelligence applications based on the combination of self-driven sensors and deep learning Weixiang Wan et.al. 2402.09442 null
2024-02-13 Approximately Piecewise E(3) Equivariant Point Networks Matan Atzmon et.al. 2402.08529 null
2024-02-11 Self-Correcting Self-Consuming Loops for Generative Model Training Nate Gillman et.al. 2402.07087 link
2024-02-06 Bidirectional Autoregressive Diffusion Model for Dance Generation Canyu Zhang et.al. 2402.04356 link
2024-02-06 Novel IMU-based Adaptive Estimator of the Center of Rotation of Joints for Movement Analysis Sara García-de-Villa et.al. 2402.04240 null
2024-02-05 Replication of Impedance Identification Experiments on a Reinforcement-Learning-Controlled Digital Twin of Human Elbows Hao Yu et.al. 2402.02904 null
2024-02-01 Transferring human emotions to robot motions using Neural Policy Style Transfer Raul Fernandez-Fernandez et.al. 2402.00663 null
2024-01-25 Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks Tianhe Ren et.al. 2401.14159 link
2024-01-24 Generative Human Motion Stylization in Latent Space Chuan Guo et.al. 2401.13505 null
2024-01-24 GTAutoAct: An Automatic Datasets Generation Framework Based on Game Engine Redevelopment for Action Recognition Xingyu Song et.al. 2401.13414 null
2024-01-23 Workspace Optimization Techniques to Improve Prediction of Human Motion During Human-Robot Collaboration Yi-Shiuan Tung et.al. 2401.12965 null
2024-01-23 Inertial Sensors for Human Motion Analysis: A Comprehensive Review Sara García-de-Villa et.al. 2401.12919 null
2024-01-23 A database of physical therapy exercises with variability of execution collected by wearable sensors Sara García-de-Villa et.al. 2401.12868 null
2024-01-22 Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective Feiyu Yao et.al. 2401.11783 link
2024-01-24 MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation Nhat M. Hoang et.al. 2401.11115 link
2024-01-19 Equivariant Graph Neural Operator for Modeling 3D Dynamics Minkai Xu et.al. 2401.11037 link
2024-01-16 RoHM: Robust Human Motion Reconstruction via Diffusion Siwei Zhang et.al. 2401.08570 null

(back to top)

Image Generation

Publish Date Title Authors PDF Code
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525 link
2024-06-10 The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems Philippe Gonzalez et.al. 2406.06160 null
2024-06-10 ProcessPainter: Learn Painting Process from Sequence Data Yiren Song et.al. 2406.06062 null
2024-06-09 OmniControlNet: Dual-stage Integration for Conditional Image Generation Yilin Wang et.al. 2406.05871 null
2024-06-09 Unified Text-to-Image Generation and Retrieval Leigang Qu et.al. 2406.05814 null
2024-06-11 MLCM: Multistep Consistency Distillation of Latent Diffusion Model Qingsong Xie et.al. 2406.05768 null
2024-06-09 PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction Shangyu Chen et.al. 2406.05641 null
2024-06-09 Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models Philip Wootaek Shin et.al. 2406.05602 null
2024-06-08 Medical Vision Generalist: Unifying Medical Imaging Tasks in Context Sucheng Ren et.al. 2406.05565 link
2024-06-08 Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis Zanlin Ni et.al. 2406.05478 null
2024-06-07 GenHeld: Generating and Editing Handheld Objects Chaerin Min et.al. 2406.05059 link
2024-06-07 GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications Shakhnaz Akhmedova et.al. 2406.05023 link
2024-06-07 AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation Lianyu Pang et.al. 2406.05000 null
2024-06-07 TEDi Policy: Temporally Entangled Diffusion for Robotic Control Sigmund H. Høeg et.al. 2406.04806 null
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746 link
2024-06-07 GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models Diptanu De et.al. 2406.04654 null
2024-06-07 CLoG: Benchmarking Continual Learning of Image Generation Models Haotian Zhang et.al. 2406.04584 link
2024-06-06 Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance Reyhane Askari Hemmat et.al. 2406.04551 null
2024-06-06 GenAI Arena: An Open Evaluation Platform for Generative Models Dongfu Jiang et.al. 2406.04485 null
2024-06-06 Evaluating Large Vision-Language Models' Understanding of Real-World Complexities Through Synthetic Benchmarks Haokun Zhou et.al. 2406.04470 null
2024-06-06 Coherent Zero-Shot Visual Instruction Generation Quynh Phung et.al. 2406.04337 null
2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Yang Sui et.al. 2406.04333 link
2024-06-06 Diffusion-based image inpainting with internal learning Nicolas Cherel et.al. 2406.04206 null
2024-06-06 Machine Learning-Driven Microwave Imaging for Soil Moisture Estimation near Leaky Pipe Mohammad Ramezaninia et.al. 2406.04193 null
2024-06-06 Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis Marianna Ohanyan et.al. 2406.04032 link
2024-06-06 Quantum Implicit Neural Representations Jiaming Zhao et.al. 2406.03873 link
2024-06-06 Semantic Similarity Score for Measuring Visual Similarity at Semantic Level Senran Fan et.al. 2406.03865 null
2024-06-06 Malware Classification Based on Image Segmentation Wanhu Nie et.al. 2406.03831 null
2024-06-07 ReDistill: Residual Encoded Distillation for Peak Memory Reduction Fang Chen et.al. 2406.03744 null
2024-06-06 JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits Minzhou Pan et.al. 2406.03720 link
2024-06-05 Tackling GenAI Copyright Issues: Originality Estimation and Genericization Hiroaki Chiba-Okabe et.al. 2406.03341 null
2024-06-05 Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Hao Wen et.al. 2406.03184 link
2024-06-05 Language-guided Detection and Mitigation of Unknown Dataset Bias Zaiying Zhao et.al. 2406.02889 null
2024-06-06 Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter Peng Xing et.al. 2406.02881 null
2024-06-04 Latent Style-based Quantum GAN for high-quality Image Generation Su Yeon Chang et.al. 2406.02668 null
2024-06-04 DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Zhongpai Gao et.al. 2406.02518 null
2024-06-04 Guiding a Diffusion Model with a Bad Version of Itself Tero Karras et.al. 2406.02507 null
2024-06-04 Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation Jiajun Wang et.al. 2406.02485 link
2024-06-04 Generative Active Learning for Long-tailed Instance Segmentation Muzhi Zhu et.al. 2406.02435 link
2024-06-05 Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Clement Chadebec et.al. 2406.02347 link
2024-06-04 I4VGen: Image as Stepping Stone for Text-to-Video Generation Xiefan Guo et.al. 2406.02230 null
2024-06-04 Analyzing the Feature Extractor Networks for Face Image Synthesis Erdi Sarıtaş et.al. 2406.02153 link
2024-06-04 The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise Yuanhao Ban et.al. 2406.01970 null
2024-06-04 Enhance Image-to-Image Generation with LLaVA Prompt and Negative Prompt Zhicheng Ding et.al. 2406.01956 null
2024-06-04 Plug-and-Play Diffusion Distillation Yi-Ting Hsiao et.al. 2406.01954 null
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048 null
2024-05-31 You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet Zhen Qin et.al. 2405.21022 null
2024-05-31 Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman et.al. 2405.20971 link
2024-05-31 Information Theoretic Text-to-Image Alignment Chao Wang et.al. 2405.20759 null
2024-05-31 Diffusion Models Are Innate One-Step Generators Bowen Zheng et.al. 2405.20750 link
2024-05-31 Cyclic image generation using chaotic dynamics Takaya Tanaka et.al. 2405.20717 link
2024-05-31 Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space Yukai Zhang et.al. 2405.20685 null
2024-05-31 Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling Kidist Amde Mekonnen et.al. 2405.20675 link
2024-05-31 Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation Shuzhou Yang et.al. 2405.20669 link
2024-05-31 Learning Gaze-aware Compositional GAN Nerea Aranjuelo et.al. 2405.20643 link
2024-05-30 SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow Chaoyang Wang et.al. 2405.20282 link
2024-05-30 ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections Massimo Bini et.al. 2405.20271 link
2024-05-30 Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Sanghyeon Na et.al. 2405.20216 null
2024-05-30 RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection Zhiyuan He et.al. 2405.20112 null
2024-05-30 Mitigating annotation shift in cancer classification using single image generative models Marta Buetas Arcas et.al. 2405.19754 link
2024-05-30 Text Guided Image Editing with Automatic Concept Locating and Forgetting Jia Li et.al. 2405.19708 null
2024-05-30 Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian Wei Sun et.al. 2405.19657 null
2024-05-30 Creating Language-driven Spatial Variations of Icon Images Xianghao Xu et.al. 2405.19636 null
2024-05-29 Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models Venkat Venkatasubramanian et.al. 2405.19561 null
2024-05-29 MemControl: Mitigating Memorization in Medical Diffusion Models via Automated Parameter Selection Raman Dutt et.al. 2405.19458 null
2024-05-29 ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning Ruchika Chavhan et.al. 2405.19237 link
2024-05-29 Going beyond compositional generalization, DDPMs can produce zero-shot interpolation Justin Deschenaux et.al. 2405.19201 link
2024-05-29 The ethical situation of DALL-E 2 Eduard Hogea et.al. 2405.19176 null
2024-05-29 Patch-enhanced Mask Encoder Prompt Image Generation Shusong Xu et.al. 2405.19085 null
2024-05-29 EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture Jiaqi Xu et.al. 2405.18991 link
2024-05-29 Topological Perspectives on Optimal Multimodal Embedding Spaces Abdul Aziz A. B et.al. 2405.18867 null
2024-05-29 Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching Yasi Zhang et.al. 2405.18816 null
2024-05-29 SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation Zhenbei Wu et.al. 2405.18801 null
2024-05-30 Inpaint Biases: A Pathway to Accurate and Unbiased Image Generation Jiyoon Myung et.al. 2405.18762 null
2024-05-29 SketchDeco: Decorating B&W Sketches with Colour Chaitat Utintu et.al. 2405.18716 link
2024-05-28 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting Qihang Zhang et.al. 2405.18424 null
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407 null
2024-05-28 Multi-modal Generation via Cross-Modal In-Context Learning Amandeep Kumar et.al. 2405.18304 link
2024-05-28 Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? Zebin You et.al. 2405.18029 null
2024-05-28 Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection Zhengji Li et.al. 2405.17905 null
2024-05-27 RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance Jiaojiao Fan et.al. 2405.17661 null
2024-05-27 Prompt Optimization with Human Feedback Xiaoqiang Lin et.al. 2405.17346 link
2024-05-27 From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan Creation Xiaoyu Li et.al. 2405.17236 null
2024-05-27 Training-free Editioning of Text-to-Image Models Jinqi Wang et.al. 2405.17069 null
2024-05-27 The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models Saravanan Kandasamy et.al. 2405.17068 null
2024-05-27 Glauber Generative Model: Discrete Diffusion Models via Binary Classification Harshit Varma et.al. 2405.17035 null
2024-05-27 Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation Liang Shi et.al. 2405.16895 null
2024-05-27 Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks Yunqi Zhang et.al. 2405.16860 link
2024-05-27 Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection Gihyun Kwon et.al. 2405.16823 null
2024-05-27 TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing Xinyu Zhang et.al. 2405.16803 null
2024-05-27 PromptFix: You Prompt and We Fix the Photo Yongsheng Yu et.al. 2405.16785 null
2024-05-24 FastDrag: Manipulate Anything in One Step Xuanjia Zhao et.al. 2405.15769 null
2024-05-24 Learning to Discretize Denoising Diffusion ODEs Vinh Tong et.al. 2405.15506 null
2024-05-24 A Misleading Gallery of Fluid Motion by Generative Artificial Intelligence Ali Kashefi et.al. 2405.15406 link
2024-05-24 Stochastic SR for Gaussian microtextures Emile Pierret et.al. 2405.15399 null
2024-05-24 Challenges and Opportunities in 3D Content Generation Ke Zhao et.al. 2405.15335 null
2024-05-24 Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model Mingyang Yi et.al. 2405.15330 null
2024-05-24 SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance Guibao Shen et.al. 2405.15321 null
2024-05-24 Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion Aoxue Li et.al. 2405.15313 null
2024-05-24 Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient Yongliang Wu et.al. 2405.15304 null
2024-05-24 StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models Chengming Xu et.al. 2405.15287 null
2024-05-23 Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin et.al. 2405.14867 link
2024-05-23 Semantica: An Adaptable Image-Conditioned Diffusion Model Manoj Kumar et.al. 2405.14857 null
2024-05-23 TerDiT: Ternary Diffusion Models with Transformers Xudong Lu et.al. 2405.14854 link
2024-05-23 Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models Katherine Xu et.al. 2405.14828 null
2024-05-24 Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation Hongxu Jiang et.al. 2405.14802 link
2024-05-23 Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy Shengfang Zhai et.al. 2405.14800 null
2024-05-23 RetAssist: Facilitating Vocabulary Learners with Generative Images in Story Retelling Practices Qiaoyi Chen et.al. 2405.14794 null
2024-05-23 EditWorld: Simulating World Dynamics for Instruction-Following Image Editing Ling Yang et.al. 2405.14785 link
2024-05-23 OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance Shuheng Ge et.al. 2405.14709 null
2024-05-23 Learning Multi-dimensional Human Preference for Text-to-Image Generation Sixian Zhang et.al. 2405.14705 null
2024-05-21 Personalized Residuals for Concept-Driven Text-to-Image Generation Cusuh Ham et.al. 2405.12978 null
2024-05-21 An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation Zhiyu Tan et.al. 2405.12914 link
2024-05-21 Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations Antoine Legrand et.al. 2405.12728 null
2024-05-21 EmoEdit: Evoking Emotions through Image Manipulation Jingyuan Yang et.al. 2405.12661 null
2024-05-21 CustomText: Customized Textual Image Generation using Diffusion Models Shubham Paliwal et.al. 2405.12531 null
2024-05-21 Customize Your Own Paired Data via Few-shot Way Jinshu Chen et.al. 2405.12490 null
2024-05-20 Diffusion for World Modeling: Visual Details Matter in Atari Eloi Alonso et.al. 2405.12399 link
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211 link
2024-05-20 Diffusion Models for Generating Ballistic Spacecraft Trajectories Tyler Presser et.al. 2405.11738 link
2024-05-19 URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images Zoey Chen et.al. 2405.11656 null
2024-05-18 UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers Duo Peng et.al. 2405.11336 null
2024-05-18 On the Trajectory Regularity of ODE-based Diffusion Sampling Defang Chen et.al. 2405.11326 link
2024-05-18 TriLoRA: Integrating SVD for Advanced Style Personalization in Text-to-Image Generation Chengcheng Feng et.al. 2405.11236 null
2024-05-18 ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image Editing Ying Jin et.al. 2405.11190 null
2024-05-17 Improving face generation quality and prompt following with synthetic captions Michail Tarasiou et.al. 2405.10864 null
2024-05-17 Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image Jianshun Zeng et.al. 2405.10504 null
2024-05-17 Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers Rya Sanovar et.al. 2405.10480 null
2024-05-16 UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models Sahel Sharifymoghaddam et.al. 2405.10311 null
2024-05-16 VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing Binghui Chen et.al. 2405.09985 null
2024-05-16 KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment Zhengxu Shi et.al. 2405.09964 null
2024-05-16 Chameleon: Mixed-Modal Early-Fusion Foundation Models Chameleon Team et.al. 2405.09818 null
2024-05-16 MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis Joseph Cho et.al. 2405.09806 null
2024-05-16 Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images Memoona Aziz et.al. 2405.09426 null
2024-05-15 DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations Nima Fathi et.al. 2405.09288 link
2024-05-14 Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Zhimin Li et.al. 2405.08748 link
2024-05-15 Similarity Metrics for MR Image-To-Image Translation Melanie Dohmen et.al. 2405.08431 null
2024-05-14 Compositional Text-to-Image Generation with Dense Blob Representations Weili Nie et.al. 2405.08246 null
2024-05-13 RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations Chengde Lin et.al. 2405.08114 link
2024-05-13 CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models Nick Stracke et.al. 2405.07913 null
2024-05-13 SAR Image Synthesis with Diffusion Models Denisa Qosja et.al. 2405.07776 null
2024-05-12 Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning Jiarui Wang et.al. 2405.07346 link
2024-05-12 Stable Signature is Unstable: Removing Image Watermark from Diffusion Models Yuepeng Hu et.al. 2405.07145 null
2024-05-12 MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototyping Mingyue Yuan et.al. 2405.07131 null
2024-05-11 Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior Ce Wang et.al. 2405.07044 link
2024-05-11 Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation Shengyuan Liu et.al. 2405.06948 null
2024-05-10 Deep MMD Gradient Flow without adversarial training Alexandre Galashov et.al. 2405.06780 null
2024-05-10 Controllable Image Generation With Composed Parallel Token Prediction Jamie Stirling et.al. 2405.06535 null
2024-05-14 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461 null
2024-05-09 Photonic quantum generative adversarial networks for classical data Tigran Sedrakyan et.al. 2405.06023 null
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953 null
2024-05-09 Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models Zhe Ma et.al. 2405.05846 null
2024-05-10 MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation Yuxiang Wei et.al. 2405.05806 link
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800 null
2024-05-09 Exploring Text-Guided Single Image Editing for Remote Sensing Images Fangzhou Han et.al. 2405.05769 null
2024-05-09 End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base Shuling Li et.al. 2405.05738 null
2024-05-09 VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis Zhihan Ju et.al. 2405.05667 null
2024-05-09 A Survey on Personalized Content Synthesis with Diffusion Models Xulu Zhang et.al. 2405.05538 null
2024-05-08 Cross-Modality Translation with Generative Adversarial Networks to Unveil Alzheimer's Disease Biomarkers Reihaneh Hassanzadeh et.al. 2405.05462 null
2024-05-08 DrawL: Understanding the Effects of Non-Mainstream Dialects in Prompted Image Generation Joshua N. Williams et.al. 2405.05382 link
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255 link
2024-05-08 Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI Keqiang Fan et.al. 2405.04974 null
2024-05-08 HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis Zhihan Ju et.al. 2405.04902 null
2024-05-08 FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation Xuehai He et.al. 2405.04834 null
2024-05-07 TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model Yongming Zhang et.al. 2405.04675 null
2024-05-07 ResNCT: A Deep Learning Model for the Synthesis of Nephrographic Phase Images in CT Urography Syed Jamal Safdar Gardezi et.al. 2405.04629 null
2024-05-07 Towards Geographic Inclusion in the Evaluation of Text-to-Image Models Melissa Hall et.al. 2405.04457 null
2024-05-07 Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation Jihyun Kim et.al. 2405.04356 null
2024-05-08 Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer Zhuoyi Yang et.al. 2405.04312 link
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290 null
2024-05-07 SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing Yuying Ge et.al. 2405.04007 link
2024-05-07 Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model Joo Young Choi et.al. 2405.03958 null
2024-05-06 Generated Contents Enrichment Mahdi Naseri et.al. 2405.03650 null
2024-05-06 CCDM: Continuous Conditional Diffusion Models for Image Generation Xin Ding et.al. 2405.03546 link
2024-05-05 Data-Efficient Molecular Generation with Hierarchical Textual Inversion Seojin Kim et.al. 2405.02845 null
2024-05-05 ImageInWords: Unlocking Hyper-Detailed Image Descriptions Roopal Garg et.al. 2405.02793 link
2024-05-04 U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers Yuchuan Tian et.al. 2405.02730 link
2024-05-03 Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI Minhui Yu et.al. 2405.02504 null
2024-05-03 Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification Siqi Yin et.al. 2405.02155 null
2024-05-03 AI-generated art perceptions with GenFrame -- an image-generating picture frame Peter Kun et.al. 2405.01901 null
2024-05-03 Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition Yichun Tai et.al. 2405.01872 null
2024-05-02 Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning Rafael Elberg et.al. 2405.01705 link
2024-05-02 LocInv: Localization-aware Inversion for Text-Guided Image Editing Chuanming Tang et.al. 2405.01496 link
2024-05-02 Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance Kelvin C. K. Chan et.al. 2405.01356 null
2024-05-02 Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration Praveen Kumar Chandaliya et.al. 2405.01273 null
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248 null
2024-05-02 On Mechanistic Knowledge Localization in Text-to-Image Generative Models Samyadeep Basu et.al. 2405.01008 null
2024-05-01 SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models Burak Can Biner et.al. 2405.00878 null
2024-05-01 Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers Palawat Busaranuvong et.al. 2405.00858 null
2024-05-01 TexSliders: Diffusion-Based Texture Editing in CLIP Space Julia Guerrero-Viu et.al. 2405.00672 null
2024-05-01 RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models Zheng Zeng et.al. 2405.00666 null
2024-05-01 UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement Ruiquan Ge et.al. 2405.00542 link
2024-05-01 Compressive Sensing Imaging Using Caustic Lens Mask Generated by Periodic Perturbation in a Ripple Tank Doğan Tunca Arık et.al. 2405.00407 null
2024-05-01 Streamlining Image Editing with Layered Diffusion Brushes Peyman Gholami et.al. 2405.00313 null
2024-04-30 DOCCI: Descriptions of Connected and Contrasting Images Yasumasa Onoe et.al. 2404.19753 null
2024-04-30 Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Yunhao Ge et.al. 2404.19752 null
2024-04-30 SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration Yuto Nakashima et.al. 2404.19693 null
2024-04-30 TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models Teng Zhou et.al. 2404.19475 null
2024-04-30 InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Chanran Kim et.al. 2404.19427 null
2024-05-01 FOTS: A Fast Optical Tactile Simulator for Sim2Real Learning of Tactile-motor Robot Manipulation Skills Yongqiang Zhao et.al. 2404.19217 link
2024-04-30 NeRF-Insert: 3D Local Editing with Multimodal Control Signals Benet Oriol Sabat et.al. 2404.19204 null
2024-04-29 DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing Minghao Chen et.al. 2404.18929 null
2024-04-29 TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation Junhao Cheng et.al. 2404.18919 link
2024-04-29 Hide and Seek: How Does Watermarking Impact Face Recognition? Yuguang Yao et.al. 2404.18890 null
2024-04-29 Learning Mixtures of Gaussians Using Diffusion Models Khashayar Gatmiry et.al. 2404.18869 null
2024-04-29 FlexiFilm: Long Video Generation with Flexible Conditions Yichen Ouyang et.al. 2404.18620 link
2024-04-29 Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting Tianyidan Xie et.al. 2404.18598 null
2024-04-29 SIDBench: A Python Framework for Reliably Assessing Synthetic Image Detection Methods Manos Schinas et.al. 2404.18552 link
2024-04-29 Towards Image Synthesis with Photon Counting Stellar Intensity Interferometry Alessia Spolon et.al. 2404.18507 null
2024-04-29 Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology Luzhe Huang et.al. 2404.18458 null
2024-04-29 PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images Jiquan Yuan et.al. 2404.18409 link
2024-04-26 Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement Zishu Yao et.al. 2404.17400 null
2024-04-26 Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection Jiawei Song et.al. 2404.17254 null
2024-04-26 Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis Shivangi Yadav et.al. 2404.17105 null
2024-04-25 REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao et.al. 2404.16767 link
2024-04-27 Denoising: from classical methods to deep CNNs Jean-Eric Campagne et.al. 2404.16617 link
2024-04-25 MuseumMaker: Continual Style Customization without Catastrophic Forgetting Chenxi Liu et.al. 2404.16612 null
2024-04-25 AudioScenic: Audio-Driven Video Scene Editing Kaixin Shen et.al. 2404.16581 null
2024-04-25 Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models Parul Gupta et.al. 2404.16556 null
2024-04-25 OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images Ye Mao et.al. 2404.16538 null
2024-04-25 Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series Aimi Okabayashi et.al. 2404.16409 link
2024-04-26 Guardians of the Quantum GAN Archisman Ghosh et.al. 2404.16156 null
2024-04-24 Spinning solar jets explained through the interplay between plasma sheets and vortex columns Sahel Dey et.al. 2404.16096 null
2024-04-24 Editable Image Elements for Controllable Synthesis Jiteng Mu et.al. 2404.16029 null
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449 null
2024-04-23 GLoD: Composing Global Contexts and Local Details in Image Generation Moyuru Yamada et.al. 2404.15447 null
2024-04-23 From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation Zehuan Huang et.al. 2404.15267 null
2024-04-23 Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment Tianwei Zhou et.al. 2404.15163 null
2024-04-23 Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation Xun Wu et.al. 2404.15100 null
2024-04-23 SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models Bo Lin et.al. 2404.14755 null
2024-04-23 FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction Hang Hua et.al. 2404.14715 null
2024-04-22 The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking Yuying Li et.al. 2404.14581 null
2024-04-22 GeoDiffuser: Geometry-Based Image Editing with Diffusion Models Rahul Sajnani et.al. 2404.14403 null
2024-04-22 SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation Yuying Ge et.al. 2404.14396 link
2024-04-22 MultiBooth: Towards Generating All Your Concepts in an Image from Text Chenyang Zhu et.al. 2404.14239 link
2024-04-22 RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance Chengrui Wang et.al. 2404.13984 null
2024-04-23 Accelerating Image Generation with Sub-path Linear Approximation Model Chen Xu et.al. 2404.13903 null
2024-04-22 Towards Better Text-to-Image Generation Alignment via Attention Modulation Yihang Wu et.al. 2404.13899 null
2024-04-21 Enforcing Conditional Independence for Fair Representation Learning and Causal Image Generation Jensen Hwa et.al. 2404.13798 null
2024-04-21 Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control Maria Mihaela Trusca et.al. 2404.13766 null
2024-04-21 ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis Zichen Tang et.al. 2404.13711 link
2024-04-21 Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models Vitali Petsiuk et.al. 2404.13706 null
2024-04-19 Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images Santosh et.al. 2404.12908 link
2024-04-19 Generative Modelling with High-Order Langevin Dynamics Ziqiang Shi et.al. 2404.12814 null
2024-04-19 PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy Zepeng Jiang et.al. 2404.12730 null
2024-04-19 How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples Dren Fazlija et.al. 2404.12653 null
2024-04-18 Lazy Diffusion Transformer for Interactive Image Editing Yotam Nitzan et.al. 2404.12382 null
2024-04-18 Customizing Text-to-Image Diffusion with Camera Viewpoint Control Nupur Kumari et.al. 2404.12333 null
2024-04-18 Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models Israel A. Laurensi et.al. 2404.12260 null
2024-04-18 StyleBooth: Image Style Editing with Multimodal Instruction Zhen Han et.al. 2404.12154 link
2024-04-18 First 2D electron density measurements using Coherence Imaging Spectroscopy in the MAST-U Super-X divertor N. Lonigro et.al. 2404.12021 null
2024-04-18 ©Plug-in Authorization for Human Content Copyright Protection in Text-to-Image Model Chao Zhou et.al. 2404.11962 null
2024-04-18 LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Thibault Castells et.al. 2404.11936 null
2024-04-18 EdgeFusion: On-Device Text-to-Image Generation Thibault Castells et.al. 2404.11925 null
2024-04-18 FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models Wei Wu et.al. 2404.11895 null
2024-04-18 Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans Lixing Tan et.al. 2404.11889 null
2024-04-17 On the Scalability of GNNs for Molecular Graphs Maciej Sypetkowski et.al. 2404.11568 null
2024-04-17 MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation Kuan-Chieh et.al. 2404.11565 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537 null
2024-04-17 Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt Zhanjie Zhang et.al. 2404.11474 link
2024-04-17 Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks Eri Hosonuma et.al. 2404.11280 null
2024-04-17 Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case João Gabriel Vinholi et.al. 2404.11243 null
2024-04-17 TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing Sherry X. Chen et.al. 2404.11120 link
2024-04-16 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Yuchi Wang et.al. 2404.10763 link
2024-04-16 Adversarial Identity Injection for Semantic Face Image Synthesis Giuseppe Tarollo et.al. 2404.10408 null
2024-04-16 Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery Payal Varshney et.al. 2404.10356 null
2024-04-16 CanvasPic: An Interactive Tool for Freely Generating Facial Images Based on Spatial Layout Jiafu Wei et.al. 2404.10352 null
2024-04-17 OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model Runyi Li et.al. 2404.10312 null
2024-04-16 OneActor: Consistent Character Generation via Cluster-Conditioned Guidance Jiahao Wang et.al. 2404.10267 null
2024-04-16 Diffusion assisted image reconstruction in optoacoustic tomography M. G. González et.al. 2404.10239 null
2024-04-15 Multi-objective evolutionary GAN for tabular data synthesis Nian Ran et.al. 2404.10176 link
2024-04-15 ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis Aashish Anantha Ramakrishnan et.al. 2404.10141 link
2024-04-15 HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing Mude Hui et.al. 2404.09990 null
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732 link
2024-04-15 In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation Han Xue et.al. 2404.09633 null
2024-04-15 Magic Clothing: Controllable Garment-Driven Image Synthesis Weifeng Chen et.al. 2404.09512 link
2024-04-15 Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models Peifei Zhu et.al. 2404.09401 null
2024-04-14 DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling Xuening Yuan et.al. 2404.09227 null
2024-04-13 InverseVis: Revealing the Hidden with Curved Sphere Tracing Kai Lawonn et.al. 2404.09092 null
2024-04-13 Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives Yidan Liu et.al. 2404.08926 null
2024-04-12 E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data Aref Azizpour et.al. 2404.08814 link
2024-04-12 Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation Brinnae Bent et.al. 2404.08799 link
2024-04-12 Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts Yang Li et.al. 2404.08341 link
2024-04-11 Latent Guard: a Safety Framework for Text-to-image Generation Runtao Liu et.al. 2404.08031 link
2024-04-11 Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models Mazda Moayeri et.al. 2404.08030 null
2024-04-11 OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Moreno D'Incà et.al. 2404.07990 link
2024-04-11 Taming Stable Diffusion for Text to 360° Panorama Image Generation Cheng Zhang et.al. 2404.07949 link
2024-04-11 Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification Tuong Vy Nguyen et.al. 2404.07754 null
2024-04-11 Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models Tuomas Kynkäänniemi et.al. 2404.07724 null
2024-04-11 Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis Marc Aubreville et.al. 2404.07676 link
2024-04-11 Implicit and Explicit Language Guidance for Diffusion-based Visual Perception Hefeng Wang et.al. 2404.07600 null
2024-04-11 ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation Stanislav Frolov et.al. 2404.07564 null
2024-04-11 CAT: Contrastive Adapter Training for Personalized Image Generation Jae Wan Park et.al. 2404.07554 link
2024-04-10 Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models Yasi Zhang et.al. 2404.07389 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199 null
2024-04-10 A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks Neel Mishra et.al. 2404.07172 link
2024-04-10 Fine color guidance in diffusion models and its application to image compression at extremely low bitrates Tom Bordin et.al. 2404.06865 null
2024-04-10 UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion Junsheng Zhou et.al. 2404.06851 null
2024-04-10 MedRG: Medical Report Grounding with Multi-modal Large Language Model Ke Zou et.al. 2404.06798 null
2024-04-10 Deep Generative Data Assimilation in Multimodal Setting Yongquan Qu et.al. 2404.06665 link
2024-04-09 GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis Srikumar Sastry et.al. 2404.06637 link
2024-04-09 High Noise Scheduling is a Must Mahmut S. Gokmen et.al. 2404.06353 null
2024-04-09 Hyperparameter-Free Medical Image Synthesis for Sharing Data and Improving Site-Specific Segmentation Alexander Chebykin et.al. 2404.06240 link
2024-04-09 DiffHarmony: Latent Diffusion Model Meets Image Harmonization Pengfei Zhou et.al. 2404.06139 null
2024-04-09 Tackling Structural Hallucination in Image Translation with Local Diffusion Seunghoi Kim et.al. 2404.05980 null
2024-04-09 StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion Ming Tao et.al. 2404.05979 link
2024-04-08 SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing Jing Gu et.al. 2404.05717 null
2024-04-08 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Kunpeng Song et.al. 2404.05674 link
2024-04-08 Automatic Controllable Colorization via Imagination Xiaoyan Cong et.al. 2404.05661 null
2024-04-08 UniFL: Improve Stable Diffusion via Unified Feedback Learning Jiacheng Zhang et.al. 2404.05595 null
2024-04-08 Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models Saman Motamed et.al. 2404.05519 null
2024-04-08 Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI Hugo Caselles-Dupré et.al. 2404.05468 null
2024-04-08 Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt Zhiqi Huang et.al. 2404.05331 null
2024-04-08 MC $^2$ : Multi-concept Guidance for Customized Multi-concept Generation Jiaxiu Jiang et.al. 2404.05268 link
2024-04-08 Text-to-Image Synthesis for Any Artistic Styles: Advancements in Personalized Artistic Image Generation via Subdivision and Dual Binding Junseo Park et.al. 2404.05256 null
2024-04-08 A secure and private ensemble matcher using multi-vault obfuscated templates Babak Poorebrahim Gilkalaye et.al. 2404.05205 null
2024-04-04 No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance Vishaal Udandarao et.al. 2404.04125 link
2024-04-05 3D Facial Expressions through Analysis-by-Neural-Synthesis George Retsinas et.al. 2404.04104 null
2024-04-05 Dynamic Prompt Optimizing for Text-to-Image Generation Wenyi Mo et.al. 2404.04095 link
2024-04-05 Physics-Inspired Synthesized Underwater Image Dataset Reina Kaneko et.al. 2404.03998 null
2024-04-05 Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Gihyun Kwon et.al. 2404.03913 null
2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dongzhi Jiang et.al. 2404.03653 link
2024-04-04 Reference-Based 3D-Aware Image Editing with Triplane Bahri Batuhan Bilecen et.al. 2404.03632 null
2024-04-04 Robust Concept Erasure Using Task Vectors Minh Pham et.al. 2404.03631 null
2024-04-04 Multi Positive Contrastive Learning with Pose-Consistent Generated Images Sho Inayoshi et.al. 2404.03256 null
2024-04-04 Would Deep Generative Models Amplify Bias in Future Models? Tianwei Chen et.al. 2404.03242 null
2024-04-04 Diverse and Tailored Image Generation for Zero-shot Multi-label Classification Kaixin Zhang et.al. 2404.03144 null
2024-04-04 GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis Emmanouil Nikolakakis et.al. 2404.03126 null
2024-04-03 Many-to-many Image Generation with Auto-regressive Diffusion Models Ying Shen et.al. 2404.03109 null
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905 link
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899 null
2024-04-03 Deep Image Composition Meets Image Forgery Eren Tahir et.al. 2404.02897 link
2024-04-03 On the Scalability of Diffusion-based Text-to-Image Generation Hao Li et.al. 2404.02883 null
2024-04-03 MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation Petru-Daniel Tudosiu et.al. 2404.02790 null
2024-04-03 InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation Haofan Wang et.al. 2404.02733 link
2024-04-03 Model-agnostic Origin Attribution of Generated Images with Few-shot Examples Fengyuan Liu et.al. 2404.02697 null
2024-04-03 Severity Controlled Text-to-Image Generative Model Bias Manipulation Jordan Vice et.al. 2404.02530 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148 link
2024-04-02 3D Congealing: 3D-Aware Image Alignment in the Wild Yunzhi Zhang et.al. 2404.02125 null
2024-04-02 Fashion Style Editing with Generative Human Prior Chaerin Kong et.al. 2404.01984 null
2024-04-02 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959 link
2024-04-02 Real, fake and synthetic faces -- does the coin have three sides? Shahzeb Naeem et.al. 2404.01878 null
2024-04-02 Disentangled Pre-training for Human-Object Interaction Detection Zhuolong Li et.al. 2404.01725 link
2024-04-01 PlayFutures: Imagining Civic Futures with AI and Puppets Supratim Pait et.al. 2404.01527 null
2024-04-01 Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data Matthias Gerstgrasser et.al. 2404.01413 null
2024-04-01 An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance Simran Khanuja et.al. 2404.01247 link
2024-04-01 Uncovering the Text Embedding in Text-to-Image Diffusion Models Hu Yu et.al. 2404.01154 null
2024-03-29 Benchmarking Counterfactual Image Generation Thomas Melistas et.al. 2403.20287 link
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105 null
2024-03-29 SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image Yunhao Li et.al. 2403.20018 link
2024-04-02 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964 null
2024-03-28 Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks Pooria Ashrafian et.al. 2403.19880 link
2024-03-28 Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization Yuhang Li et.al. 2403.19866 null
2024-03-28 CLoRA: A Contrastive Approach to Compose Multiple LoRA Models Tuna Han Salih Meral et.al. 2403.19776 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653 link
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645 null
2024-03-28 Collaborative Interactive Evolution of Art in the Latent Space of Deep Generative Models Ole Hall et.al. 2403.19620 link
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593 null
2024-03-28 Imperceptible Protection against Style Imitation from Diffusion Models Namhyuk Ahn et.al. 2403.19254 null
2024-03-28 QNCD: Quantization Noise Correction for Diffusion Models Huanpeng Chu et.al. 2403.19140 link
2024-03-28 Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs John R. McNulty et.al. 2403.19107 null
2024-03-28 Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation Yutong He et.al. 2403.19103 null
2024-03-28 Purposeful remixing with generative AI: Constructing designer voice in multimodal composing Xiao Tan et.al. 2403.19095 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818 null
2024-03-27 Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching Jannis Chemseddine et.al. 2403.18705 link
2024-03-27 InstructBrush: Learning Attention-based Instruction Optimization for Image Editing Ruoyu Zhao et.al. 2403.18660 null
2024-03-28 FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing Trong-Tung Nguyen et.al. 2403.18605 null
2024-03-27 Attention Calibration for Disentangled Text-to-Image Personalization Yanbing Zhang et.al. 2403.18551 link
2024-03-27 DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis Zhongxi Chen et.al. 2403.18471 link
2024-03-27 U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models Ilias Mitsouras et.al. 2403.18425 null
2024-03-27 ECNet: Effective Controllable Text-to-Image Diffusion Models Sicheng Li et.al. 2403.18417 null
2024-03-27 Ship in Sight: Diffusion Models for Ship-Image Super Resolution Luigi Sigillo et.al. 2403.18370 link
2024-03-26 Tutorial on Diffusion Models for Imaging and Vision Stanley H. Chan et.al. 2403.18103 null
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870 null
2024-03-26 CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation Yongrui Yu et.al. 2403.17770 null
2024-03-26 LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection Yunpeng Luo et.al. 2403.17465 null
2024-03-25 DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment Stella Bounareli et.al. 2403.17217 null
2024-03-25 FlashFace: Human Image Personalization with High-fidelity Identity Preservation Shilong Zhang et.al. 2403.17008 null
2024-03-25 SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer Rui Zhu et.al. 2403.17004 null
2024-03-25 Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation Omer Dahary et.al. 2403.16990 null
2024-03-25 Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance Jingyuan Zhu et.al. 2403.16954 null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790 null
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776 null
2024-03-25 Multi-Scale Texture Loss for CT denoising with GANs Francesco Di Feola et.al. 2403.16640 link
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627 link
2024-03-25 An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models Zizhao Hu et.al. 2403.16530 null
2024-03-25 Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Sanyam Lakhanpal et.al. 2403.16422 null
2024-03-25 Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation Yingshan Chang et.al. 2403.16394 null
2024-03-25 FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models Lin Zhao et.al. 2403.16379 null
2024-03-23 Feature Manipulation for DDPM based Change Detection Zhenglin Li et.al. 2403.15943 null
2024-03-23 Cognitive resilience: Unraveling the proficiency of image-captioning models to interpret masked visual content Zhicheng Du et.al. 2403.15876 link
2024-03-22 DragAPart: Learning a Part-Level Motion Prior for Articulated Objects Ruining Li et.al. 2403.15382 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 link
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309 null
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152 null
2024-03-22 MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration Zhichao Wei et.al. 2403.15059 null
2024-03-22 Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning Bumsoo Kim et.al. 2403.15048 null
2024-03-22 Generative Active Learning for Image Synthesis Personalization Xulu Zhang et.al. 2403.14987 link
2024-03-22 CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model Seungdae Han et.al. 2403.14944 link
2024-03-22 Geometric Generative Models based on Morphological Equivariant PDEs and GANs El Hadji S. Diop et.al. 2403.14897 null
2024-03-21 Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing Alberto Baldrati et.al. 2403.14828 link
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602 null
2024-03-21 DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing Yueru Jia et.al. 2403.14487 link
2024-03-22 AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks Max Ku et.al. 2403.14468 null
2024-03-21 Analysing Diffusion Segmentation for Medical Images Mathias Öttl et.al. 2403.14440 null
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429 null
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291 link
2024-03-21 Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations Xun Lin et.al. 2403.14250 null
2024-03-21 StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN Jongwoo Choi et.al. 2403.14186 link
2024-03-21 QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping Zhuang Xiong et.al. 2403.14070 null
2024-03-21 LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models Hantao Zhang et.al. 2403.14066 link
2024-03-20 Learning from Models and Data for Visual Grounding Ruozhen He et.al. 2403.13804 null
2024-03-20 Step-Calibrated Diffusion for Biomedical Optical Image Restoration Yiwei Lyu et.al. 2403.13680 link
2024-03-20 ReGround: Improving Textual and Spatial Grounding at No Cost Yuseung Lee et.al. 2403.13589 null
2024-03-20 Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing Hangeol Chang et.al. 2403.13551 link
2024-03-20 Diversity-aware Channel Pruning for StyleGAN Compression Jiwoo Chung et.al. 2403.13548 link
2024-03-21 IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Siying Cui et.al. 2403.13535 null
2024-03-20 Deepfake Detection without Deepfakes: Generalization via Synthetic Frequency Patterns Injection Davide Alessandro Coccomini et.al. 2403.13479 null
2024-03-20 S2DM: Sector-Shaped Diffusion Models for Video Generation Haoran Lang et.al. 2403.13408 null
2024-03-20 IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis Feng Liu et.al. 2403.13378 link
2024-03-20 AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Jingkun An et.al. 2403.13352 null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963 link
2024-03-19 Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties Efrain Torres-Lomas et.al. 2403.12935 null
2024-03-19 You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs Yihong Luo et.al. 2403.12931 link
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915 link
2024-03-19 Generative Enhancement for 3D Medical Images Lingting Zhu et.al. 2403.12852 link
2024-03-19 How Spammers and Scammers Leverage AI-Generated Images on Facebook for Audience Growth Renee DiResta et.al. 2403.12838 null
2024-03-19 Total Disentanglement of Font Images into Style and Character Class Features Daichi Haraguchi et.al. 2403.12784 null
2024-03-19 Towards Controllable Face Generation with Semantic Latent Diffusion Models Alex Ergasti et.al. 2403.12743 link
2024-03-19 Tuning-Free Image Customization with Image and Text Guidance Pengzhi Li et.al. 2403.12658 null
2024-03-19 LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing Yazeed Alharbi et.al. 2403.12585 null
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667 null
2024-03-18 QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation Zhizhen Zhou et.al. 2403.11626 null
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614 link
2024-03-18 EffiVED:Efficient Video Editing via Text-instruction Diffusion Models Zhenghao Zhang et.al. 2403.11568 link
2024-03-18 Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors Ruicheng Wang et.al. 2403.11503 null
2024-03-18 DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation Jeongsol Kim et.al. 2403.11415 null
2024-03-17 StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining Tushar Kataria et.al. 2403.11340 null
2024-03-17 Fast Personalized Text-to-Image Syntheses With Attention Injection Yuxuan Zhang et.al. 2403.11284 null
2024-03-17 Understanding Diffusion Models by Feynman's Path Integral Yuji Hirono et.al. 2403.11262 null
2024-03-15 Denoising Task Difficulty-based Curriculum for Training Diffusion Models Jin-Young Kim et.al. 2403.10348 null
2024-03-15 Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder Jinseok Kim et.al. 2403.10255 null
2024-03-15 SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation Peng Zheng et.al. 2403.10166 null
2024-03-15 E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance Tianrui Huang et.al. 2403.10133 null
2024-03-15 Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling Baoquan Zhang et.al. 2403.10071 null
2024-03-15 SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model Tao Wu et.al. 2403.10044 null
2024-03-15 ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images Xiangtian Xue et.al. 2403.10004 null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638 null
2024-03-14 Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Zeyu Liu et.al. 2403.09622 null
2024-03-14 PrompTHis: Visualizing the Process and Influence of Prompt Editing during Text-to-Image Creation Yuhan Guo et.al. 2403.09615 null
2024-03-14 Counterfactual contrastive learning: robust representations via causal image synthesis Melanie Roschewitz et.al. 2403.09605 link
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468 link
2024-03-14 Mitigating attribute amplification in counterfactual image generation Tian Xia et.al. 2403.09422 null
2024-03-14 Machine Learning Processes as Sources of Ambiguity: Insights from AI Art Christian Sivertsen et.al. 2403.09374 null
2024-03-14 Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction Hanyu Chen et.al. 2403.09355 null
2024-03-14 Video Editing via Factorized Diffusion Distillation Uriel Singer et.al. 2403.09334 null
2024-03-14 Noise Dimension of GAN: An Image Compression Perspective Ziran Zhu et.al. 2403.09196 null
2024-03-13 iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer Dinh-Khoi Vo et.al. 2403.08746 link
2024-03-13 HAIFIT: Human-Centered AI for Fashion Image Translation Jianan Jiang et.al. 2403.08651 link
2024-03-13 An Analysis of Human Alignment of Latent Diffusion Models Lorenz Linhardt et.al. 2403.08469 null
2024-03-13 Generating Synthetic Computed Tomography for Radiotherapy: SynthRAD2023 Challenge Report Evi M. C. Huijben et.al. 2403.08447 null
2024-03-13 Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification Shuhan Li et.al. 2403.08407 null
2024-03-13 Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation Tianyi Chu et.al. 2403.08294 null
2024-03-13 VIGFace: Virtual Identity Generation Model for Face Image Synthesis Minsoo Kim et.al. 2403.08277 null
2024-03-13 Make Me Happier: Evoking Emotions Through Image Diffusion Models Qing Lin et.al. 2403.08255 null
2024-03-12 Pix2Pix-OnTheFly: Leveraging LLMs for Instruction-Guided Image Editing Rodrigo Santos et.al. 2403.08004 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860 link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842 null
2024-03-12 BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives Ivo M. Baltruschat et.al. 2403.07800 null
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764 null
2024-03-12 Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings Sahand Sharifzadeh et.al. 2403.07750 null
2024-03-13 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721 link
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711 link
2024-03-12 Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation Michael Ogezi et.al. 2403.07605 null
2024-03-12 Block-wise LoRA: Revisiting Fine-grained LoRA for Effective Personalization and Stylization in Text-to-Image Generation Likun Li et.al. 2403.07500 null
2024-03-12 Backdoor Attack with Mode Mixture Latent Modification Hongwei Zhang et.al. 2403.07463 null
2024-03-11 Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs Áron Samuel Kovács et.al. 2403.06855 null
2024-03-11 Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting Wenting Chen et.al. 2403.06835 null
2024-03-11 Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection Chuangchuang Tan et.al. 2403.06803 link
2024-03-11 FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation Pengchong Qiao et.al. 2403.06775 link
2024-03-11 Distribution-Aware Data Expansion with Diffusion Models Haowei Zhu et.al. 2403.06741 link
2024-03-11 Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback Adarsh N L et.al. 2403.06735 null
2024-03-11 FFAD: A Novel Metric for Assessing Generated Time Series Data Utilizing Fourier Transform and Auto-encoder Yang Chen et.al. 2403.06576 null
2024-03-11 Active Generation for Image Classification Tao Huang et.al. 2403.06517 null
2024-03-11 Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning Woojung Han et.al. 2403.06516 null
2024-03-11 3D-aware Image Generation and Editing with Multi-modal Conditions Bo Li et.al. 2403.06470 null
2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola Yijiang Li et.al. 2403.05523 null
2024-03-08 A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN Cristiana Tiago et.al. 2403.05384 null
2024-03-08 Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation Juan I. Pisula et.al. 2403.05325 null
2024-03-08 Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Junyan Wang et.al. 2403.05239 null
2024-03-08 Synthetic Privileged Information Enhances Medical Image Representation Learning Lucas Farndale et.al. 2403.05220 null
2024-03-08 Denoising Autoregressive Representation Learning Yazhe Li et.al. 2403.05196 null
2024-03-08 ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Xiwei Hu et.al. 2403.05135 null
2024-03-08 Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation Joseph Cho et.al. 2403.05131 null
2024-03-08 Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis Muxi Chen et.al. 2403.05125 null
2024-03-08 CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion Wendi Zheng et.al. 2403.05121 null
2024-03-07 Photonic probabilistic machine learning using quantum vacuum noise Seou Choi et.al. 2403.04731 null
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692 null
2024-03-07 StableDrag: Stable Dragging for Point-based Image Editing Yutao Cui et.al. 2403.04437 null
2024-03-07 Discriminative Probing and Tuning for Text-to-Image Generation Leigang Qu et.al. 2403.04321 null
2024-03-06 PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement Zhijie Wang et.al. 2403.04014 link
2024-03-06 Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer Naifu Xue et.al. 2403.03736 null
2024-03-06 Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving He Li et.al. 2403.03541 null
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485 link
2024-03-06 FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion Hao Wang et.al. 2403.03463 null
2024-03-07 DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network Xiangquan Gui et.al. 2403.03456 null
2024-03-06 Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing Bingyan Liu et.al. 2403.03431 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206 null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181 link
2024-03-05 Doubly Abductive Counterfactual Inference for Text-based Image Editing Xue Song et.al. 2403.02981 link
2024-03-05 Bias in Generative AI Mi Zhou et.al. 2403.02726 null
2024-03-04 Transformer for Times Series: an Application to the S&P500 Pierre Brugiere et.al. 2403.02523 null
2024-03-04 NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function Abdullah Nazhat Abdullah et.al. 2403.02411 link
2024-03-04 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Jiaxiang Cheng et.al. 2403.02084 link
2024-03-04 PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis Zhengyao Lv et.al. 2403.01852 link
2024-03-04 ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models Lukas Höllein et.al. 2403.01807 link
2024-03-05 AtomoVideo: High Fidelity Image-to-Video Generation Litong Gong et.al. 2403.01800 null
2024-03-02 Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models Neta Shaul et.al. 2403.01329 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212 null
2024-03-01 Improving Android Malware Detection Through Data Augmentation Using Wasserstein Generative Adversarial Networks Kawana Stalin et.al. 2403.00890 null
2024-03-01 Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks Yuhao Liu et.al. 2403.00644 null
2024-03-01 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Ander Salaberria et.al. 2403.00587 link
2024-03-01 Rethinking cluster-conditioned diffusion models Nikolas Adaloglou et.al. 2403.00570 null
2024-03-01 VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Xiangxiang Chu et.al. 2403.00522 link
2024-03-01 An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels Shumpei Takezaki et.al. 2403.00452 null
2024-03-01 LoMOE: Localized Multi-Object Editing via Multi-Diffusion Goirik Chakrabarty et.al. 2403.00437 null
2024-03-01 ChartReformer: Natural Language-Driven Chart Image Editing Pengyu Yan et.al. 2403.00209 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330 link
2024-02-29 Disentangling representations of retinal images with generative models Sarah Müller et.al. 2402.19186 null
2024-02-29 Trajectory Consistency Distillation Jianbin Zheng et.al. 2402.19159 link
2024-02-29 Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection Christos Koutlis et.al. 2402.19091 null
2024-02-29 WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis Paul Friedrich et.al. 2402.19043 link
2024-02-29 ViewFusion: Towards Multi-View Consistency via Interpolated Denoising Xianghui Yang et.al. 2402.18842 link
2024-02-29 A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D Xiaohan Fei et.al. 2402.18780 null
2024-02-28 FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes Ziying Pan et.al. 2402.18331 link
2024-02-28 Balancing Act: Distribution-Guided Debiasing in Diffusion Models Rishubh Parihar et.al. 2402.18206 null
2024-02-28 VulMCI : Code Splicing-based Pixel-row Oversampling for More Continuous Vulnerability Image Generation Tao Peng et.al. 2402.18189 link
2024-02-28 Block and Detail: Scaffolding Sketch-to-Image Generation Vishnu Sarukkai et.al. 2402.18116 null
2024-02-28 Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis Yanzuo Lu et.al. 2402.18078 link
2024-02-28 SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model Bin Cao et.al. 2402.18068 null
2024-02-27 CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing Chufeng Xiao et.al. 2402.17624 null
2024-02-27 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563 null
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525 link
2024-02-28 DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models Shyam Marjit et.al. 2402.17412 null
2024-02-27 Accelerating Diffusion Sampling with Optimized Time Steps Shuchen Xue et.al. 2402.17376 null
2024-02-27 One-Shot Structure-Aware Stylized Image Synthesis Hansam Cho et.al. 2402.17275 link
2024-02-27 Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation Daiqing Li et.al. 2402.17245 null
2024-02-27 Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System Majid Memari et.al. 2402.17204 null
2024-02-27 Transparent Image Layer Diffusion using Latent Transparency Lvmin Zhang et.al. 2402.17113 link
2024-02-27 T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality Susan Epstein et.al. 2402.17101 null
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506 link
2024-02-26 Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion Xuantong Liu et.al. 2402.16305 null
2024-02-25 Towards Efficient Quantum Hybrid Diffusion Models Francesca De Falco et.al. 2402.16147 null
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504 link
2024-02-23 BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators Yu Tian et.al. 2402.15218 null
2024-02-23 The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling Jiajun Ma et.al. 2402.15170 null
2024-02-22 LLMBind: A Unified Modality-Task Integration Framework Bin Zhu et.al. 2402.14891 null
2024-02-22 Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Willi Menapace et.al. 2402.14797 null
2024-02-22 Consolidating Attention Features for Multi-view Image Editing Or Patashnik et.al. 2402.14792 null
2024-02-25 Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening Zhenrong Shen et.al. 2402.14707 null
2024-02-22 Visual Hallucinations of Multi-modal Large Language Models Wen Huang et.al. 2402.14683 link
2024-02-22 Semantic Image Synthesis with Unconditional Generator Jungwoo Chae et.al. 2402.14395 null
2024-02-22 MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion Xin-Yang Zheng et.al. 2402.14253 null
2024-02-21 T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching Zizheng Pan et.al. 2402.14167 link
2024-02-21 SDXL-Lightning: Progressive Adversarial Diffusion Distillation Shanchuan Lin et.al. 2402.13929 null
2024-02-21 SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model Xudong Ling et.al. 2402.13737 null
2024-02-21 Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving Mehdi Azarafza et.al. 2402.13602 link
2024-02-21 Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models Chen Wu et.al. 2402.13490 null
2024-02-20 Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control Denis Lukovnikov et.al. 2402.13404 null
2024-02-20 CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples Jianrui Zhang et.al. 2402.13254 link
2024-02-20 UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing Jianhong Bai et.al. 2402.13185 null
2024-02-21 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974 link
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908 link
2024-02-20 Two-stage Rainfall-Forecasting Diffusion Model XuDong Ling et.al. 2402.12779 link
2024-02-20 A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis Nailei Hei et.al. 2402.12760 link
2024-02-20 MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion Sen Li et.al. 2402.12741 link
2024-02-20 MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction Shitao Tang et.al. 2402.12712 null
2024-02-20 Robust-Wide: Robust Watermarking against Instruction-driven Image Editing Runyi Hu et.al. 2402.12688 null
2024-02-19 The (R)Evolution of Multimodal Large Language Models: A Survey Davide Caffagni et.al. 2402.12451 null
2024-02-19 Revisiting registration-based synthesis: A focus on unsupervised MR image synthesis Savannah P. Hays et.al. 2402.12288 null
2024-02-19 Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability Xuelin Qian et.al. 2402.12225 null
2024-02-19 Groot: Adversarial Testing for Generative Text-to-Image Models with Tree-based Semantic Transformation Yi Liu et.al. 2402.12100 null
2024-02-19 DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Chong Zeng et.al. 2402.11929 null
2024-02-18 SDiT: Spiking Diffusion Model with Transformer Shu Yang et.al. 2402.11588 null
2024-02-18 Visual Concept-driven Image Generation with Text-to-Image Diffusion Model Tanzila Rahman et.al. 2402.11487 null
2024-02-18 Deep learning methods for Hamiltonian parameter estimation and magnetic domain image generation in twisted van der Waals magnets Woo Seok Lee et.al. 2402.11434 null
2024-02-17 TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method Chenyan Zhang et.al. 2402.11274 link
2024-02-16 The Male CEO and the Female Assistant: Probing Gender Biases in Text-To-Image Models Through Paired Stereotype Test Yixin Wan et.al. 2402.11089 null
2024-02-16 Universal Prompt Optimizer for Safe Text-to-Image Generation Zongyu Wu et.al. 2402.10882 link
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821 link
2024-02-16 Exploring Precision and Recall to assess the quality and diversity of LLMs Le Bronnec Florian et.al. 2402.10693 null
2024-02-16 UMAIR-FPS: User-aware Multi-modal Animation Illustration Recommendation Fusion with Painting Style Yan Kang et.al. 2402.10381 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210 null
2024-02-15 Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning Euclid Collaboration et.al. 2402.10187 link
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095 null
2024-02-15 Accelerating Parallel Sampling of Diffusion Models Zhiwei Tang et.al. 2402.09970 link
2024-02-15 Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation Junjie Shentu et.al. 2402.09966 link
2024-02-15 Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community Arman Isajanyan et.al. 2402.09872 link
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368 link
2024-02-14 Switch EMA: A Free Lunch for Better Flatness and Sharpness Siyuan Li et.al. 2402.09240 link
2024-02-14 L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Yutaro Yamada et.al. 2402.09052 null
2024-02-14 Multi-modality transrectal ultrasound vudei classification for identification of clinically significant prostate cancer Hong Wu et.al. 2402.08987 link
2024-02-13 Towards the Detection of AI-Synthesized Human Face Images Yuhang Lu et.al. 2402.08750 null
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654 null
2024-02-13 Captions Are Worth a Thousand Words: Enhancing Product Retrieval with Pretrained Image-to-Text Models Jason Tang et.al. 2402.08532 null
2024-02-12 Using AI for Wavefront Estimation with the Rubin Observatory Active Optics System John Franklin Crenshaw et.al. 2402.08094 null
2024-02-14 Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions Frank Cole et.al. 2402.08082 null
2024-02-12 Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback Cansu Korkmaz et.al. 2402.07597 null
2024-02-12 Discovering Universal Semantic Triggers for Text-to-Image Synthesis Shengfang Zhai et.al. 2402.07562 null
2024-02-11 The Aleph & Other Metaphors for Image Generation Gonzalo Ramos et.al. 2402.07104 null
2024-02-10 Disentangled Latent Energy-Based Style Translation: An Image-Level Structural MRI Harmonization Framework Mengqi Wu et.al. 2402.06875 null
2024-02-09 Cardiac ultrasound simulation for autonomous ultrasound navigation Abdoul Aziz Amadou et.al. 2402.06463 null
2024-02-08 Collaborative Control for Geometry-Conditioned PBR Image Generation Shimon Vainer et.al. 2402.05919 null
2024-02-08 CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes Yi-Ting Pan et.al. 2402.05728 null
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608 link
2024-02-08 Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application Bumsoo Kim et.al. 2402.05448 null
2024-02-08 Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport Jaemoo Choi et.al. 2402.05443 null
2024-02-08 MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis Dewei Zhou et.al. 2402.05408 link
2024-02-07 Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Nicholas Konz et.al. 2402.05210 link
2024-02-07 ChatScratch: An AI-Augmented System Toward Autonomous Visual Programming Learning for Children Aged 6-12 Liuqing Chen et.al. 2402.04975 null
2024-02-07 Noise Map Guidance: Inversion with Spatial Context for Real Image Editing Hansam Cho et.al. 2402.04625 link
2024-02-07 Text2Street: Controllable Text-to-image Generation for Street Views Jinming Su et.al. 2402.04504 null
2024-02-07 ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation Jirayu Burapacheep et.al. 2402.04492 link
2024-02-06 FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution Qi Zhou et.al. 2402.03705 null
2024-02-06 QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning Haoxuan Wang et.al. 2402.03666 link
2024-02-05 Assessing the Efficacy of Invisible Watermarks in AI-Generated Medical Images Xiaodan Xing et.al. 2402.03473 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290 link
2024-02-05 Training-Free Consistent Text-to-Image Generation Yoad Tewel et.al. 2402.03286 null
2024-02-05 IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images Vincent Roca et.al. 2402.03227 link
2024-02-05 InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions Yiyuan Zhang et.al. 2402.03040 link
2024-02-05 SynthVision -- Harnessing Minimal Input for Maximal Output in Computer Vision Models using Synthetic Image data Yudara Kularathne et.al. 2402.02826 null
2024-02-04 DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing Chong Mou et.al. 2402.02583 link
2024-02-04 M $^3$ Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing Mohammadreza Mofayezi et.al. 2402.02369 null
2024-02-03 Diffusion Cross-domain Recommendation Yuner Xuan et.al. 2402.02182 link
2024-02-03 S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation Yurui Chen et.al. 2402.02112 null
2024-02-02 The galactic bubbles of starburst galaxies The influence of galactic large-scale magnetic fields Z. Meliani et.al. 2402.01541 null
2024-02-02 Cross-view Masked Diffusion Transformers for Person Image Synthesis Trung X. Pham et.al. 2402.01516 link
2024-02-02 Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors Dingcheng Yang et.al. 2402.01369 link
2024-02-02 Can MLLMs Perform Text-to-Image In-Context Learning? Yuchen Zeng et.al. 2402.01293 link
2024-02-02 Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion? Cristian Sbrolli et.al. 2402.01241 null
2024-02-01 Unconditional Latent Diffusion Models Memorize Patient Imaging Data Salman Ul Hassan Dar et.al. 2402.01054 null
2024-02-01 AI-generated faces free from racial and gender stereotypes Nouar AlDahoul et.al. 2402.01002 link
2024-02-01 Examining the Influence of Digital Phantom Models in Virtual Imaging Trials for Tomographic Breast Imaging Amar Kavuri et.al. 2402.00812 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769 link
2024-01-31 SeFi-IDE: Semantic-Fidelity Identity Embedding for Personalized Diffusion-Based Generation Yang Li et.al. 2402.00631 null
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627 link
2024-02-01 Masked Conditional Diffusion Model for Enhancing Deepfake Detection Tiewen Chen et.al. 2402.00541 null
2024-02-01 High-Quality Medical Image Generation from Free-hand Sketch Quan Huu Cap et.al. 2402.00353 null
2024-02-01 Machine Unlearning for Image-to-Image Generative Models Guihong Li et.al. 2402.00351 link
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085 null
2024-01-31 Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation Yuanhuiyi Lyu et.al. 2401.17664 null
2024-01-31 Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model Yafei Dong et.al. 2401.17593 null
2024-01-31 Task-Oriented Diffusion Model Compression Geonung Kim et.al. 2401.17547 null
2024-01-31 Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels Negar Arabzadeh et.al. 2401.17543 null
2024-01-30 OmniSCV: An Omnidirectional Synthetic Image Generator for Computer Vision Bruno Berenguel-Baeta et.al. 2401.17061 link
2024-01-30 Repositioning the Subject within Image Yikai Wang et.al. 2401.16861 link
2024-01-30 X-ray Image Generation as a Method of Performance Prediction for Real-Time Inspection: a Case Study Vladyslav Andriiashen et.al. 2401.16847 link
2024-01-30 LATENTPATCH: A Non-Parametric Approach for Face Generation and Editing Benjamin Samuth et.al. 2401.16830 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459 null
2024-01-29 Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models Zhongjie Duan et.al. 2401.16224 null
2024-01-29 Spatial-Aware Latent Initialization for Controllable Image Generation Wenqiang Sun et.al. 2401.16157 null
2024-01-29 Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You Felix Friedrich et.al. 2401.16092 link
2024-01-29 Diffusion Facial Forgery Detection Harry Cheng et.al. 2401.15859 link
2024-01-29 2L3: Lifting Imperfect Generated 2D Images into Accurate 3D Yizheng Chen et.al. 2401.15841 null

(back to top)

Video Generation

Publish Date Title Authors PDF Code
2024-06-10 NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Ting-Hsuan Chen et.al. 2406.06523 null
2024-06-10 FRAG: Frequency Adapting Group for Diffusion Video Editing Sunjae Yoon et.al. 2406.06044 null
2024-06-09 Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion Ge Ya Luo et.al. 2406.05630 null
2024-06-08 Training-Free Robust Interactive Video Object Segmentation Xiaoli Wei et.al. 2406.05485 null
2024-06-08 MotionClone: Training-Free Motion Cloning for Controllable Video Generation Pengyang Ling et.al. 2406.05338 null
2024-06-07 CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion Xingrui Wang et.al. 2406.05082 null
2024-06-07 Zero-Shot Video Editing through Adaptive Sliding Score Distillation Lianghan Zhu et.al. 2406.04888 null
2024-06-07 Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior Tanvir Mahmud et.al. 2406.04873 null
2024-06-07 Online Continual Learning of Video Diffusion Models From a Single Video Stream Jason Yoo et.al. 2406.04814 null
2024-06-06 GenAI Arena: An Open Evaluation Platform for Generative Models Dongfu Jiang et.al. 2406.04485 null
2024-06-06 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Lin Chen et.al. 2406.04325 null
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324 null
2024-06-06 VideoTetris: Towards Compositional Text-to-Video Generation Ye Tian et.al. 2406.04277 link
2024-06-05 VideoPhy: Evaluating Physical Commonsense for Video Generation Hritik Bansal et.al. 2406.03520 null
2024-06-05 Searching Priors Makes Text-to-Video Synthesis Better Haoran Cheng et.al. 2406.03215 null
2024-06-05 Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control Jingyun Xue et.al. 2406.03035 null
2024-06-05 Controllable Talking Face Generation by Implicit Facial Keypoints Editing Dong Zhao et.al. 2406.02880 null
2024-06-06 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Tianchen Zhao et.al. 2406.02540 null
2024-06-04 V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation Cong Wang et.al. 2406.02511 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-06-04 I4VGen: Image as Stepping Stone for Text-to-Video Generation Xiefan Guo et.al. 2406.02230 null
2024-06-04 Learning Temporally Consistent Video Depth from Video Diffusion Priors Jiahao Shao et.al. 2406.01493 null
2024-06-03 DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors Tianyu Huang et.al. 2406.01476 link
2024-06-04 Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation Enhui Ma et.al. 2406.01349 null
2024-06-03 UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation Xiang Wang et.al. 2406.01188 null
2024-06-03 ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation Shaoshu Yang et.al. 2406.00908 link
2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation Haiyu Zhang et.al. 2405.20674 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325 link
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320 link
2024-05-30 CV-VAE: A Compatible Video VAE for Latent Generative Video Models Sijie Zhao et.al. 2405.20279 link
2024-06-02 MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model Muyao Niu et.al. 2405.20222 link
2024-05-30 Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion Jiangkai Wu et.al. 2405.20032 null
2024-05-30 Streaming Video Diffusion: Online Video Editing with Diffusion Models Feng Chen et.al. 2405.19726 link
2024-05-30 DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark Haoxing Chen et.al. 2405.19707 link
2024-05-29 EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture Jiaqi Xu et.al. 2405.18991 link
2024-05-29 T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback Jiachen Li et.al. 2405.18750 null
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407 null
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406 link
2024-05-28 VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers Jun Zheng et.al. 2405.18326 null
2024-05-28 EG4D: Explicit Generation of 4D Object without Score Distillation Qi Sun et.al. 2405.18132 link
2024-05-28 MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling Bowen Zhang et.al. 2405.18003 link
2024-05-28 Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation Akio Hayakawa et.al. 2405.17842 null
2024-05-27 RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance Jiaojiao Fan et.al. 2405.17661 null
2024-05-27 ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance Jiannan Huang et.al. 2405.17532 link
2024-05-27 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control Zhengfei Kuang et.al. 2405.17414 null
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405 null
2024-05-27 Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability Shenyuan Gao et.al. 2405.17398 link
2024-05-28 Controllable Longer Image Animation with Diffusion Models Qiang Wang et.al. 2405.17306 null
2024-05-27 Sync4D: Video Guided Controllable Dynamics for Physics-Based 4D Generation Zhoujie Fu et.al. 2405.16849 null
2024-05-27 Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection Gihyun Kwon et.al. 2405.16823 null
2024-05-27 Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels Yikai Wang et.al. 2405.16822 null
2024-05-26 Towards Multi-Task Multi-Modal Models: A Video Generative Perspective Lijun Yu et.al. 2405.16728 null
2024-05-26 I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models Wenqi Ouyang et.al. 2405.16537 null
2024-05-28 Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation Jinlin Liu et.al. 2405.16393 null
2024-05-24 A Misleading Gallery of Fluid Motion by Generative Artificial Intelligence Ali Kashefi et.al. 2405.15406 link
2024-05-24 iVideoGPT: Interactive VideoGPTs are Scalable World Models Jialong Wu et.al. 2405.15223 link
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864 null
2024-05-24 Fisher Flow Matching for Generative Modeling over Discrete Data Oscar Davis et.al. 2405.14664 null
2024-05-24 PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control Yong Zhong et.al. 2405.14582 null
2024-05-23 MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes Ruiyuan Gao et.al. 2405.14475 null
2024-05-22 ReVideo: Remake a Video with Motion and Content Control Chong Mou et.al. 2405.13865 null
2024-05-22 MotionCraft: Physics-based Zero-Shot Video Generation Luca Savant Aira et.al. 2405.13557 null
2024-05-22 Enhanced Creativity and Ideation through Stable Video Synthesis Elijah Miller et.al. 2405.13357 null
2024-05-21 CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers Andrew Marmon et.al. 2405.13195 null
2024-05-21 OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models Zhaojian Yu et.al. 2405.12843 link
2024-05-21 DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control Hong Chen et.al. 2405.12796 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211 link
2024-05-20 ViViD: Video Virtual Try-on using Diffusion Models Zixun Fang et.al. 2405.11794 null
2024-05-19 FIFO-Diffusion: Generating Infinite Videos from Text without Training Jihwan Kim et.al. 2405.11473 link
2024-05-17 From Sora What We Can See: A Survey of Text-to-Video Generation Rui Sun et.al. 2405.10674 link
2024-05-16 MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis Joseph Cho et.al. 2405.09806 null
2024-05-15 Dance Any Beat: Blending Beats with Visuals in Dance Video Generation Xuanchen Wang et.al. 2405.09266 null
2024-05-13 The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective Andrew Shin et.al. 2405.08720 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547 link
2024-05-08 Reviewing Intelligent Cinematography: AI research for camera-based video production Adrian Azzarelli et.al. 2405.05039 null
2024-05-15 TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation Hritik Bansal et.al. 2405.04682 null
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496 null
2024-05-07 Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation Dogucan Yaman et.al. 2405.04327 null
2024-05-07 Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models Fan Bao et.al. 2405.04233 null
2024-05-07 Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models Zhixuan Chu et.al. 2405.04180 link
2024-05-07 Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method Peisong He et.al. 2405.04133 null
2024-05-06 Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond Zheng Zhu et.al. 2405.03520 link
2024-05-06 Video Diffusion Models: A Survey Andrew Melnik et.al. 2405.03150 null
2024-05-10 Matten: Video Generation with Mamba-Attention Yu Gao et.al. 2405.03025 null
2024-05-02 StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Yupeng Zhou et.al. 2405.01434 link
2024-05-05 VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization Yuliang Liu et.al. 2404.19652 link
2024-04-30 Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model Wentao Lei et.al. 2404.19277 null
2024-04-29 FlexiFilm: Long Video Generation with Flexible Conditions Yichen Ouyang et.al. 2404.18620 link
2024-04-25 Synthesizing Audio from Silent Video using Sequence to Sequence Modeling Hugo Garrido-Lestache Belinchon et.al. 2404.17608 link
2024-04-25 V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection Xuanyu Zhang et.al. 2404.16824 null
2024-04-25 TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models Haomiao Ni et.al. 2404.16306 null
2024-04-26 Semantically consistent Video-to-Audio Generation using Multimodal Language Large Model Gehui Chen et.al. 2404.16305 null
2024-04-24 Beyond Deepfake Images: Detecting AI-Generated Videos Danial Samadi Vahdati et.al. 2404.15955 null
2024-05-01 MotionMaster: Training-free Camera Motion Transfer For Video Generation Teng Hu et.al. 2404.15789 null
2024-04-23 ID-Animator: Zero-Shot Identity-Preserving Human Video Generation Xuanhua He et.al. 2404.15275 link
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381 link
2024-04-23 Accelerating Image Generation with Sub-path Linear Approximation Model Chen Xu et.al. 2404.13903 null
2024-04-27 Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap Bowen Qu et.al. 2404.13573 link
2024-04-21 Motion-aware Latent Diffusion Models for Video Frame Interpolation Zhilin Huang et.al. 2404.13534 null
2024-04-20 Music Consistency Models Zhengcong Fei et.al. 2404.13358 null
2024-04-19 PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation Tianyuan Zhang et.al. 2404.13026 null
2024-04-19 ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model Dingming Liu et.al. 2404.12903 null
2024-04-18 GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models Sai Sree Harsha et.al. 2404.12541 null
2024-04-18 On the Content Bias in Fréchet Video Distance Songwei Ge et.al. 2404.12391 null
2024-04-18 RoboDreamer: Learning Compositional World Models for Robot Imagination Siyuan Zhou et.al. 2404.12377 null
2024-04-18 AniClipart: Clipart Animation with Text-to-Video Priors Ronghuan Wu et.al. 2404.12347 null
2024-04-15 Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model Han Lin et.al. 2404.09967 null
2024-04-16 LoopAnimate: Loopable Salient Object Animation Fanyi Wang et.al. 2404.09172 null
2024-04-13 THQA: A Perceptual Quality Assessment Database for Talking Heads Yingjie Zhou et.al. 2404.09003 link
2024-04-16 LoopGaussian: Creating 3D Cinemagraph with Multi-view Images via Eulerian Motion Field Jiyang Li et.al. 2404.08966 link
2024-04-11 S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing Guangzhi Wang et.al. 2404.08111 null
2024-04-10 A Transformer-Based Model for the Prediction of Human Gaze Behavior on Videos Suleyman Ozdel et.al. 2404.07351 null
2024-04-16 Deep Generative Data Assimilation in Multimodal Setting Yongquan Qu et.al. 2404.06665 link
2024-04-10 Flying with Photons: Rendering Novel Views of Propagating Light Anagh Malik et.al. 2404.06493 null
2024-04-08 Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models Saman Motamed et.al. 2404.05519 null
2024-04-08 Action-conditioned video data improves predictability Meenakshi Sarkar et.al. 2404.05439 null
2024-04-07 MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators Shenghai Yuan et.al. 2404.05014 link
2024-04-07 AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment Yuanfeng Xu et.al. 2404.04946 null
2024-04-03 Translation-based Video-to-Video Synthesis Pratim Saha et.al. 2404.04283 null
2024-04-03 MeshBrush: Painting the Anatomical Mesh with Neural Stylization for Endoscopy John J. Han et.al. 2404.02999 null
2024-04-02 CameraCtrl: Enabling Camera Control for Text-to-Video Generation Hao He et.al. 2404.02101 link
2024-04-02 Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model Xu He et.al. 2404.01862 link
2024-03-28 A Review of Multi-Modal Large Language and Vision Models Kilian Carolan et.al. 2404.01322 null
2024-04-01 Evaluating Text-to-Visual Generation with Image-to-Text Generation Zhiqiu Lin et.al. 2404.01291 link
2024-03-30 Grid Diffusion Models for Text-to-Video Generation Taegyeong Lee et.al. 2404.00234 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193 null
2024-04-03 MI-NeRF: Learning a Single Face NeRF from Multiple Identities Aggelina Chatziagapi et.al. 2403.19920 null
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593 null
2024-03-27 TextCraftor: Your Text Encoder Can be Image Quality Controller Yanyu Li et.al. 2403.18978 null
2024-03-26 Tutorial on Diffusion Models for Imaging and Vision Stanley H. Chan et.al. 2403.18103 null
2024-03-26 TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani et.al. 2403.17920 null
2024-03-26 Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields Rüveyda Yilmaz et.al. 2403.17808 null
2024-03-26 ExpressEdit: Video Editing with Natural Language and Sketching Bekzat Tilekbay et.al. 2403.17693 null
2024-03-25 TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models Zhongwei Zhang et.al. 2403.17005 null
2024-03-25 A Survey on Long Video Generation: Challenges, Methods, and Prospects Chengxuan Li et.al. 2403.16407 null
2024-03-24 Opportunities and challenges in the application of large artificial intelligence models in radiology Liangrui Pan et.al. 2403.16112 null
2024-03-24 EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing Xiangpeng Yang et.al. 2403.16111 null
2024-03-24 Edit3K: Universal Representation Learning for Video Editing Components Xin Gu et.al. 2403.16048 null
2024-03-23 Adaptive Super Resolution For One-Shot Talking-Head Generation Luchuan Song et.al. 2403.15944 link
2024-03-22 Spectral Motion Alignment for Video Motion Transfer using Diffusion Models Geon Yeong Park et.al. 2403.15249 null
2024-03-21 StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text Roberto Henschel et.al. 2403.14773 link
2024-03-22 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617 null
2024-03-21 Explorative Inbetweening of Time and Space Haiwen Feng et.al. 2403.14611 null
2024-03-22 AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks Max Ku et.al. 2403.14468 null
2024-03-21 Enabling Visual Composition and Animation in Unsupervised Video Generation Aram Davtyan et.al. 2403.14368 null
2024-03-21 StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN Jongwoo Choi et.al. 2403.14186 link
2024-03-21 Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition Sihyun Yu et.al. 2403.14148 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745 link
2024-03-20 VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis Yumeng Li et.al. 2403.13501 link
2024-03-20 S2DM: Sector-Shaped Diffusion Models for Video Generation Haoran Lang et.al. 2403.13408 null
2024-03-20 Mora: Enabling Generalist Video Generation via A Multi-Agent Framework Zhengqing Yuan et.al. 2403.13248 link
2024-03-19 AnimateDiff-Lightning: Cross-Model Diffusion Distillation Shanchuan Lin et.al. 2403.12706 null
2024-03-18 CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility Bojia Zi et.al. 2403.12035 null
2024-03-18 Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation Axel Sauer et.al. 2403.12015 null
2024-03-18 VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model Qi Zuo et.al. 2403.12010 null
2024-03-18 DreamMotion: Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing Hyeonho Jeong et.al. 2403.12002 null
2024-03-19 Subjective-Aligned Dateset and Metric for Text-to-Video Quality Assessment Tengchuan Kou et.al. 2403.11956 link
2024-03-18 Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing Juan Zhang et.al. 2403.11700 null
2024-03-18 EffiVED:Efficient Video Editing via Text-instruction Diffusion Models Zhenghao Zhang et.al. 2403.11568 link
2024-03-17 Endora: Video Generation Models as Endoscopy Simulators Chenxin Li et.al. 2403.11050 null
2024-03-15 DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers Xuanlei Zhao et.al. 2403.10266 link
2024-03-15 Animate Your Motion: Turning Still Images into Dynamic Videos Mingxiao Li et.al. 2403.10179 null
2024-03-14 Video Editing via Factorized Diffusion Distillation Uriel Singer et.al. 2403.09334 null
2024-03-17 Intention-driven Ego-to-Exo Video Generation Hongchen Luo et.al. 2403.09194 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764 null
2024-03-13 Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts Yue Ma et.al. 2403.08268 link
2024-03-12 AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production Jiuniu Wang et.al. 2403.07952 null
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711 link
2024-03-15 DragAnything: Motion Control for Anything using Entity Representation Weijia Wu et.al. 2403.07420 link
2024-03-11 Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions Lan Wang et.al. 2403.07198 null
2024-03-11 DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation Guosheng Zhao et.al. 2403.06845 null
2024-03-11 A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos Weixia Zhang et.al. 2403.06421 link
2024-03-11 Video Generation with Consistency Tuning Chaoyi Wang et.al. 2403.06356 null
2024-03-10 FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing Youyuan Zhang et.al. 2403.06269 null
2024-03-10 BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering Xinmin Qiu et.al. 2403.06243 null
2024-03-10 VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models Wenhao Wang et.al. 2403.06098 link
2024-03-10 Reframe Anything: LLM Agent for Open World Video Reframing Jiawang Cao et.al. 2403.06070 null
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438 link
2024-03-08 Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation Joseph Cho et.al. 2403.05131 null
2024-03-07 A spatiotemporal style transfer algorithm for dynamic visual stimulus generation Antonino Greco et.al. 2403.04940 null
2024-03-08 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634 null
2024-03-05 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation Weijie Li et.al. 2403.02827 null
2024-03-06 UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control Xuweiyi Chen et.al. 2403.02332 link
2024-03-05 AtomoVideo: High Fidelity Image-to-Video Generation Litong Gong et.al. 2403.01800 null
2024-03-02 SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code Ziniu Hu et.al. 2403.01248 null
2024-03-01 Abductive Ego-View Accident Video Understanding for Safe Driving Perception Jianwu Fang et.al. 2403.00436 null
2024-02-29 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Tsai-Shien Chen et.al. 2402.19479 null
2024-02-28 Context-aware Talking Face Video Generation Meidai Xuanyuan et.al. 2402.18092 null
2024-02-27 EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485 null
2024-02-27 Sora Generates Videos with Stunning Geometrical Consistency Xuanyi Li et.al. 2402.17403 null
2024-02-28 Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Yixin Liu et.al. 2402.17177 link
2024-02-27 Video as the New Language for Real-World Decision Making Sherry Yang et.al. 2402.17139 null
2024-03-04 Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing Ling Yang et.al. 2402.16627 link
2024-02-22 Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Willi Menapace et.al. 2402.14797 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780 null
2024-02-22 Place Anything into Any Video Ziling Liu et.al. 2402.14316 null
2024-02-21 Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation Kihong Kim et.al. 2402.13729 null
2024-02-24 UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing Jianhong Bai et.al. 2402.13185 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144 link
2024-02-20 VGMShield: Mitigating Misuse of Video Generative Models Yan Pang et.al. 2402.13126 link
2024-02-19 Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same Sungjun Ahn et.al. 2402.12412 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099 null
2024-02-16 Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation Lanqing Guo et.al. 2402.10491 link
2024-02-15 LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing Bryan Wang et.al. 2402.10294 null
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368 link
2024-02-10 Denoising Diffusion Probabilistic Models in Six Simple Steps Richard E. Turner et.al. 2402.04384 null
2024-02-06 ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation Weiming Ren et.al. 2402.04324 link
2024-02-05 Projected Generative Diffusion Models for Constraint Satisfaction Jacob K Christopher et.al. 2402.03559 null
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162 null
2024-02-05 InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions Yiyuan Zhang et.al. 2402.03040 link
2024-02-04 Video Editing for Video Retrieval Bin Zhu et.al. 2402.02335 null
2024-02-06 DeCoF: Generated Video Detection via Frame Consistency Long Ma et.al. 2402.02085 null
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769 link
2024-02-01 DRSM: efficient neural 4d decomposition for dynamic reconstruction in stationary monocular cameras Weixing Xie et.al. 2402.00740 null
2024-01-30 Anything in Any Scene: Photorealistic Video Object Insertion Chen Bai et.al. 2401.17509 null
2024-01-31 Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling Xiaoyu Shi et.al. 2401.15977 null
2024-01-28 Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes Weifeng Liu et.al. 2401.15668 link
2024-01-29 Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval Dezhao Luo et.al. 2401.13329 null
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945 null
2024-01-19 Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion Zuoyue Li et.al. 2401.10786 null
2024-01-18 Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution Xin Yuan et.al. 2401.10404 null
2024-01-22 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150 null
2024-01-18 WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens Xiaofeng Wang et.al. 2401.09985 null
2024-01-18 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects Zhao Wang et.al. 2401.09962 null
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414 link

(back to top)

cv-arxiv-daily's People

Contributors

vincentqyw avatar asdasdad738 avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.