Git Product home page Git Product logo

rl-arxiv-daily's Introduction

Contributors Forks Stargazers Issues

Updated on 2024.07.16

Usage instructions: here

Table of Contents
  1. RL
  2. SLAM
  3. NeRF

RL

Publish Date Title Authors PDF Code
2024-07-15 Walking the Values in Bayesian Inverse Reinforcement Learning Ondrej Bajgar et.al. 2407.10971 null
2024-07-15 BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning Haohong Lin et.al. 2407.10967 null
2024-07-15 Hedging Beyond the Mean: A Distributional Reinforcement Learning Perspective for Hedging Portfolios with Structured Products Anil Sharma et.al. 2407.10903 null
2024-07-15 Offline Reinforcement Learning with Imputed Rewards Carlo Romeo et.al. 2407.10839 null
2024-07-15 Exploration in Knowledge Transfer Utilizing Reinforcement Learning Adam Jedlička et.al. 2407.10835 null
2024-07-15 GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents Haoyuan Jiang et.al. 2407.10811 null
2024-07-15 Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning Alessandro Montenegro et.al. 2407.10775 null
2024-07-15 Balancing the Scales: Reinforcement Learning for Fair Classification Leon Eshuijs et.al. 2407.10629 null
2024-07-15 Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena Haipeng Luo et.al. 2407.10627 null
2024-07-15 Three Dogmas of Reinforcement Learning David Abel et.al. 2407.10583 null
2024-07-12 Learning Coordinated Maneuver in Adversarial Environments Zechen Hu et.al. 2407.09469 null
2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 null
2024-07-12 A Benchmark Environment for Offline Reinforcement Learning in Racing Games Girolamo Macaluso et.al. 2407.09415 link
2024-07-12 Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments Zoya Volovikova et.al. 2407.09287 null
2024-07-12 GNN with Model-based RL for Multi-agent Systems Hanxiao Chen et.al. 2407.09249 null
2024-07-12 Constrained Intrinsic Motivation for Reinforcement Learning Xiang Zheng et.al. 2407.09247 null
2024-07-12 Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network Shun Kotoku et.al. 2407.09124 null
2024-07-12 New Desiderata for Direct Preference Optimization Xiangkun Hu et.al. 2407.09072 null
2024-07-12 Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control Huayu Chen et.al. 2407.09024 null
2024-07-12 Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control Sicong Jiang et.al. 2407.08964 null
2024-07-11 MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces Wayne Wu et.al. 2407.08725 null
2024-07-11 RoboMorph: Evolving Robot Morphology using Large Language Models Kevin Qiu et.al. 2407.08626 null
2024-07-11 A Review of Nine Physics Engines for Reinforcement Learning Research Michael Kaup et.al. 2407.08590 null
2024-07-11 HACMan++: Spatially-Grounded Motion Primitives for Manipulation Bowen Jiang et.al. 2407.08585 null
2024-07-11 TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations Junik Bae et.al. 2407.08464 null
2024-07-11 Distributed Deep Reinforcement Learning Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing Cui Zhang et.al. 2407.08462 null
2024-07-11 Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning Shulin Song et.al. 2407.08458 link
2024-07-11 A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning Adrien Banse et.al. 2407.08324 null
2024-07-11 A Deep Reinforcement Learning Framework and Methodology for Reducing the Sim-to-Real Gap in ASV Navigation Luis F W Batista et.al. 2407.08263 null
2024-07-11 Gradient Boosting Reinforcement Learning Benjamin Fuhrer et.al. 2407.08250 link
2024-07-10 Learning In-Hand Translation Using Tactile Skin With Shear and Normal Force Sensing Jessica Yin et.al. 2407.07885 null
2024-07-10 Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation Eugene Teoh et.al. 2407.07868 null
2024-07-10 Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems Gianluigi Silvestri et.al. 2407.07794 null
2024-07-11 BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark Nikita Chernyadev et.al. 2407.07788 null
2024-07-10 Continuous Control with Coarse-to-fine Reinforcement Learning Younggyo Seo et.al. 2407.07787 null
2024-07-10 Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control Elahe Delavari et.al. 2407.07684 null
2024-07-10 Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning Dake Zhang et.al. 2407.07631 null
2024-07-10 Resource Allocation for Twin Maintenance and Computing Task Processing in Digital Twin Vehicular Edge Computing Network Yu Xie et.al. 2407.07575 link
2024-07-10 CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias Jiacheng Shen et.al. 2407.07454 link
2024-07-10 Real-time system optimal traffic routing under uncertainties -- Can physics models boost reinforcement learning? Zemian Ke et.al. 2407.07364 null
2024-07-09 Safe and Reliable Training of Learning-Based Aerospace Controllers Udayan Mandal et.al. 2407.07088 link
2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 link
2024-07-09 Can Learned Optimization Make Reinforcement Learning Less Difficult? Alexander David Goldie et.al. 2407.07082 link
2024-07-09 A Unified Approach to Multi-task Legged Navigation: Temporal Logic Meets Reinforcement Learning Jesse Jiang et.al. 2407.06931 null
2024-07-09 Intercepting Unauthorized Aerial Robots in Controlled Airspace Using Reinforcement Learning Francisco Giral et.al. 2407.06909 null
2024-07-09 Learning From Crowdsourced Noisy Labels: A Signal Processing Perspective Shahana Ibrahim et.al. 2407.06902 null
2024-07-09 Energy Efficient Fair STAR-RIS for Mobile Users Ashok S. Kumar et.al. 2407.06868 null
2024-07-09 Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning Augustine N. Mavor-Parker et.al. 2407.06756 null
2024-07-09 Hierarchical Average-Reward Linearly-solvable Markov Decision Processes Guillermo Infante et.al. 2407.06690 null
2024-07-09 Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning Fanyue Wei et.al. 2407.06642 link
2024-07-08 Periodic agent-state based Q-learning for POMDPs Amit Sinha et.al. 2407.06121 null
2024-07-08 QTRL: Toward Practical Quantum Reinforcement Learning via Quantum-Train Chen-Yu Liu et.al. 2407.06103 null
2024-07-08 Stranger Danger! Identifying and Avoiding Unpredictable Pedestrians in RL-based Social Robot Navigation Sara Pohland et.al. 2407.06056 link
2024-07-08 iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement Aoyu Pang et.al. 2407.06025 link
2024-07-08 On Bellman equations for continuous-time policy evaluation I: discretization and approximation Wenlong Mou et.al. 2407.05966 null
2024-07-08 Graph Anomaly Detection with Noisy Labels by Reinforcement Learning Zhu Wang et.al. 2407.05934 null
2024-07-08 FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging Pranab Sahoo et.al. 2407.05800 link
2024-07-08 Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning Jakob Nyberg et.al. 2407.05775 link
2024-07-08 Multi-agent Reinforcement Learning-based Network Intrusion Detection System Amine Tellache et.al. 2407.05766 null
2024-07-08 $\mathrm{E^{2}CFD}$ : Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model Zepeng Wang et.al. 2407.05580 null
2024-07-05 Graph Reinforcement Learning in Power Grids: A Survey Mohamed Hassouna et.al. 2407.04522 null
2024-07-05 Using Petri Nets as an Integrated Constraint Mechanism for Reinforcement Learning Tasks Timon Sachweh et.al. 2407.04481 null
2024-07-05 Hindsight Preference Learning for Offline Preference-based Reinforcement Learning Chen-Xiao Gao et.al. 2407.04451 link
2024-07-05 Enhancing Safety for Autonomous Agents in Partly Concealed Urban Traffic Environments Through Representation-Based Shielding Pierre Haritz et.al. 2407.04343 link
2024-07-05 Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning I Lee et.al. 2407.04315 null
2024-07-05 Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling Jiawei Xu et.al. 2407.04285 null
2024-07-05 Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator Mehryar Abbasi et.al. 2407.04258 null
2024-07-05 PA-LOCO: Learning Perturbation-Adaptive Locomotion for Quadruped Robots Zhiyuan Xiao et.al. 2407.04224 null
2024-07-05 Autoverse: An Evolvable Game Langugage for Learning Robust Embodied Agents Sam Earle et.al. 2407.04221 null
2024-07-04 Orchestrating LLMs with Different Personalizations Jin Peng Zhou et.al. 2407.04181 null
2024-07-03 Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations Trevor Ablett et.al. 2407.03311 link
2024-07-03 A Review of the Applications of Deep Learning-Based Emergent Communication Brendon Boldt et.al. 2407.03302 null
2024-07-03 Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks Mintae Kim et.al. 2407.03280 null
2024-07-03 Policy-guided Monte Carlo on general state spaces: Application to glass-forming mixtures Leonardo Galliano et.al. 2407.03275 null
2024-07-03 PPO-based Dynamic Control of Uncertain Floating Platforms in the Zero-G Environment Mahya Ramezani et.al. 2407.03224 null
2024-07-03 Combining AI Control Systems and Human Decision Support via Robustness and Criticality Walt Woods et.al. 2407.03210 null
2024-07-03 Reinforcement Learning for Sequence Design Leveraging Protein Language Models Jithendaraa Subramanian et.al. 2407.03154 null
2024-07-03 Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes Asaf Cassel et.al. 2407.03065 null
2024-07-03 Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment Janghwan Lee et.al. 2407.03051 null
2024-07-03 On the Client Preference of LLM Fine-tuning in Federated Learning Feijie Wu et.al. 2407.03038 null
2024-07-03 PWM: Policy Learning with Large World Models Ignat Georgiev et.al. 2407.02466 null
2024-07-02 Predicting Visual Attention in Graphic Design Documents Souradeep Chakraborty et.al. 2407.02439 null
2024-07-02 Reinforcement Learning and Machine ethics:a systematic review Ajay Vishwanath et.al. 2407.02425 null
2024-07-02 Talking to Machines: do you read me? Lina M. Rojas-Barahona et.al. 2407.02354 null
2024-07-02 DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics Tyler Ga Wei Lum et.al. 2407.02274 null
2024-07-02 Safe CoR: A Dual-Expert Approach to Integrating Imitation Learning and Safe Reinforcement Learning Using Constraint Rewards Hyeokjin Kwon et.al. 2407.02245 null
2024-07-02 Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization Yuchen Hu et.al. 2407.02243 null
2024-07-02 Safety-Driven Deep Reinforcement Learning Framework for Cobots: A Sim2Real Approach Ammar N. Abbas et.al. 2407.02231 link
2024-07-02 Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning Zakariae El Asri et.al. 2407.02217 null
2024-07-02 Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning Yifang Chen et.al. 2407.02119 null
2024-06-28 PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators Kuo-Hao Zeng et.al. 2406.20083 null
2024-06-28 Applying RLAIF for Code Generation with API-usage in Lightweight LLMs Sujan Dutta et.al. 2406.20060 null
2024-06-28 HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid Xinyu Xu et.al. 2406.19972 link
2024-06-28 Operator World Models for Reinforcement Learning Pietro Novelli et.al. 2406.19861 null
2024-06-28 3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints Yoonkyu Yoo et.al. 2406.19848 null
2024-06-28 Reinforcement Learning for Efficient Design and Control Co-optimisation of Energy Systems Marine Cauz et.al. 2406.19825 null
2024-06-28 Identifying Ordinary Differential Equations for Data-efficient Model-based Reinforcement Learning Tobias Nagel et.al. 2406.19817 null
2024-06-28 Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs Shiyu Zhang et.al. 2406.19812 link
2024-06-28 Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels Jie Zhang et.al. 2406.19769 null
2024-07-01 Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors Emma Cramer et.al. 2406.19768 link
2024-06-27 Efficient World Models with Context-Aware Tokenization Vincent Micheli et.al. 2406.19320 link
2024-06-27 Averaging log-likelihoods in direct alignment Nathan Grinsztajn et.al. 2406.19188 null
2024-06-27 Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion Yannis Flet-Berliac et.al. 2406.19185 null
2024-06-27 Learning Pareto Set for Multi-Objective Continuous Robot Control Tianye Shu et.al. 2406.18924 link
2024-06-27 Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning Nishesh Singh et.al. 2406.18899 null
2024-06-27 State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems Tochukwu Elijah Ogri et.al. 2406.18804 null
2024-06-26 Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks Emanuel Figetakis et.al. 2406.18741 null
2024-06-26 Confident Natural Policy Gradient for Local Planning in $q_π$ -realizable Constrained MDPs Tian Tian et.al. 2406.18529 null
2024-06-26 Mental Modeling of Reinforcement Learning Agents by Language Models Wenhao Lu et.al. 2406.18505 null
2024-06-26 Preference Elicitation for Offline Reinforcement Learning Alizée Pace et.al. 2406.18450 null
2024-06-26 Mixture of Experts in a Mixture of RL settings Timon Willi et.al. 2406.18420 null
2024-06-26 AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors Hao Shi et.al. 2406.18394 null
2024-06-26 Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control Zifan Liu et.al. 2406.18351 null
2024-06-26 AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations Adam Dahlgren Lindström et.al. 2406.18346 null
2024-06-26 Spatial-temporal Hierarchical Reinforcement Learning for Interpretable Pathology Image Super-Resolution Wenting Chen et.al. 2406.18310 link
2024-06-26 Combining Automated Optimisation of Hyperparameters and Reward Shape Julian Dierkes et.al. 2406.18293 link
2024-06-27 Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems Italo Luis da Silva et.al. 2406.18245 link
2024-06-25 EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data Jesse Zhang et.al. 2406.17768 null
2024-06-25 When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning Claas Voelcker et.al. 2406.17718 link
2024-06-25 Privacy Preserving Reinforcement Learning for Population Processes Samuel Yang-Zhao et.al. 2406.17649 null
2024-06-25 KANQAS: Kolmogorov Arnold Network for Quantum Architecture Search Akash Kundu et.al. 2406.17630 link
2024-06-25 Leveraging Reinforcement Learning in Red Teaming for Advanced Ransomware Attack Simulations Cheng Wang et.al. 2406.17576 null
2024-06-25 On the consistency of hyper-parameter selection in value-based deep reinforcement learning Johan Obando-Ceron et.al. 2406.17523 link
2024-06-25 BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO Sebastian Dittert et.al. 2406.17490 null
2024-06-25 CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems Zhen Chen et.al. 2406.17425 null
2024-06-25 Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning Tianfu Wang et.al. 2406.17334 link
2024-06-25 The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game Lanyu Yang et.al. 2406.17326 null
2024-06-24 Confidence Aware Inverse Constrained Reinforcement Learning Sriram Ganapathi Subramanian et.al. 2406.16782 link
2024-06-24 WARP: On the Benefits of Weight Averaged Rewarded Policies Alexandre Ramé et.al. 2406.16768 null
2024-06-24 The MRI Scanner as a Diagnostic: Image-less Active Sampling Yuning Du et.al. 2406.16754 null
2024-06-24 OCALM: Object-Centric Assessment with Language Models Timo Kaufmann et.al. 2406.16748 null
2024-06-24 Adversarial Contrastive Decoding: Boosting Safety Alignment of Large Language Models via Opposite Prompt Optimization Zhengyue Zhao et.al. 2406.16743 null
2024-06-24 Probabilistic Subgoal Representations for Hierarchical Reinforcement learning Vivienne Huiling Wang et.al. 2406.16707 null
2024-06-24 Decentralized RL-Based Data Transmission Scheme for Energy Efficient Harvesting Rafaela Scaciota et.al. 2406.16624 null
2024-06-24 Towards Physically Talented Aerial Robots with Tactically Smart Swarm Behavior thereof: An Efficient Co-design Approach Prajit KrisshnaKumar et.al. 2406.16612 null
2024-06-24 $\text{Alpha}^2$ : Discovering Logical Formulaic Alphas using Deep Reinforcement Learning Feng Xu et.al. 2406.16505 link
2024-06-24 Towards Comprehensive Preference Data Collection for Reward Modeling Yulan Hu et.al. 2406.16486 null
2024-06-21 MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Xuan He et.al. 2406.15252 null
2024-06-21 Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning Sattar Vakili et.al. 2406.15250 null
2024-06-21 Deep UAV Path Planning with Assured Connectivity in Dense Urban Setting Jiyong Oh et.al. 2406.15225 null
2024-06-21 KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertainty Philipp Becker et.al. 2406.15131 null
2024-06-21 A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning Gianluca Drappo et.al. 2406.15124 null
2024-06-21 Towards General Negotiation Strategies with End-to-End Reinforcement Learning Bram M. Renting et.al. 2406.15096 null
2024-06-21 KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning Jiahan Chen et.al. 2406.15073 null
2024-06-21 Behaviour Distillation Andrei Lupu et.al. 2406.15042 link
2024-06-21 SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning Matthias Weissenbacher et.al. 2406.15025 link
2024-06-21 Evolution of Rewards for Food and Motor Action by Simulating Birth and Death Yuji Kanagawa et.al. 2406.15016 null
2024-06-20 CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics Jiawei Gao et.al. 2406.14558 null
2024-06-20 MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading Chuqiao Zong et.al. 2406.14537 link
2024-06-20 RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold Amrith Setlur et.al. 2406.14532 link
2024-06-20 Learning telic-controllable state representations Nadav Amir et.al. 2406.14476 null
2024-06-20 Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue Huifang Du et.al. 2406.14457 null
2024-06-20 Revealing the learning process in reinforcement learning agents through attention-oriented metrics Charlotte Beylier et.al. 2406.14324 null
2024-06-20 Resource Optimization for Tail-Based Control in Wireless Networked Control Systems Rasika Vijithasena et.al. 2406.14301 null
2024-06-21 REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretability Shuang Ao et.al. 2406.14214 link
2024-06-20 Optimizing Novelty of Top-k Recommendations using Large Language Models and Reinforcement Learning Amit Sharma et.al. 2406.14169 null
2024-06-20 Tractable Equilibrium Computation in Markov Games through Risk Aversion Eric Mazumdar et.al. 2406.14156 null
2024-06-18 Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts Haoxiang Wang et.al. 2406.12845 link
2024-06-18 Injection Optimization at Particle Accelerators via Reinforcement Learning: From Simulation to Real-World Application Awal Awal et.al. 2406.12735 null
2024-06-18 A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning Flora Angileri et.al. 2406.12667 null
2024-06-18 Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry A. L. García Navarro et.al. 2406.12602 link
2024-06-18 Discovering Minimal Reinforcement Learning Environments Jarek Liesen et.al. 2406.12589 link
2024-06-18 RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation Shuting Wang et.al. 2406.12566 null
2024-06-18 A Super-human Vision-based Reinforcement Learning Agent for Autonomous Racing in Gran Turismo Miguel Vasco et.al. 2406.12563 null
2024-06-18 Offline Imitation Learning with Model-based Reverse Augmentation Jie-Jing Shao et.al. 2406.12550 null
2024-06-18 Demonstrating Agile Flight from Pixels without State Estimation Ismail Geles et.al. 2406.12505 null
2024-06-18 Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning Harry Robertshaw et.al. 2406.12499 null
2024-06-17 WPO: Enhancing RLHF with Weighted Preference Optimization Wenxuan Zhou et.al. 2406.11827 link
2024-06-17 Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics Runzhe Wu et.al. 2406.11810 null
2024-06-17 Run Time Assured Reinforcement Learning for Six Degree-of-Freedom Spacecraft Inspection Kyle Dunlap et.al. 2406.11795 null
2024-06-17 Optimal Transport-Assisted Risk-Sensitive Q-Learning Zahra Shahrooei et.al. 2406.11774 null
2024-06-17 Measuring memorization in RLHF for code completion Aneesh Pappu et.al. 2406.11715 null
2024-06-18 The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation Noah Golowich et.al. 2406.11686 null
2024-06-17 Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs Min Hua et.al. 2406.11653 null
2024-06-18 Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions Noah Golowich et.al. 2406.11640 null
2024-06-17 Style Transfer with Multi-iteration Preference Optimization Shuai Liu et.al. 2406.11581 null
2024-06-17 Intersymbolic AI: Interlinking Symbolic AI and Subsymbolic AI André Platzer et.al. 2406.11563 null
2024-06-14 Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs Rui Yang et.al. 2406.10216 null
2024-06-14 A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors Naaman Tan et.al. 2406.10203 link
2024-06-14 Misam: Using ML in Dataflow Selection of Sparse-Sparse Matrix Multiplication Sanjali Yadav et.al. 2406.10166 null
2024-06-14 Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models Carson Denison et.al. 2406.10162 link
2024-06-14 Bridging the Communication Gap: Artificial Agents Learning Sign Language through Imitation Federico Tavella et.al. 2406.10043 null
2024-06-14 ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR Vishwanath Pratap Singh et.al. 2406.09999 null
2024-06-14 Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model Siemen Herremans et.al. 2406.09976 link
2024-06-14 InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning Tiancheng Li et.al. 2406.09973 null
2024-06-14 Finite-Time Analysis of Simultaneous Double Q-learning Hyunjun Na et.al. 2406.09946 null
2024-06-14 I Know How: Combining Prior Policies to Solve New Tasks Malio Li et.al. 2406.09835 link
2024-06-13 Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms Miaosen Zhang et.al. 2406.09397 null
2024-06-13 Is Value Learning Really the Main Bottleneck in Offline RL? Seohong Park et.al. 2406.09329 null
2024-06-13 AutomaChef: A Physics-informed Demonstration-guided Learning Framework for Granular Material Manipulation Minglun Wei et.al. 2406.09178 null
2024-06-13 Adaptive Actor-Critic Based Optimal Regulation for Drift-Free Uncertain Nonlinear Systems Ashwin P. Dani et.al. 2406.09097 null
2024-06-13 DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning Xuemin Hu et.al. 2406.09089 null
2024-06-13 Data-driven modeling and supervisory control system optimization for plug-in hybrid electric vehicles Hao Zhang et.al. 2406.09082 null
2024-06-13 Latent Assistance Networks: Rediscovering Hyperbolic Tangents in RL Jacob E. Kooi et.al. 2406.09079 null
2024-06-13 Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation Claude Formanek et.al. 2406.09068 null
2024-06-13 CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms Arda Sarp Yenicesu et.al. 2406.09030 null
2024-06-13 XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Alexander Nikulin et.al. 2406.08973 null
2024-06-12 RILe: Reinforced Imitation Learning Mert Albaba et.al. 2406.08472 null
2024-06-12 Adaptive Swarm Mesh Refinement using Deep Reinforcement Learning with Local Rewards Niklas Freymuth et.al. 2406.08440 null
2024-06-12 RRLS : Robust Reinforcement Learning Suite Adil Zouitine et.al. 2406.08406 link
2024-06-12 Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning Yuhui Wang et.al. 2406.08404 null
2024-06-12 Time-Constrained Robust MDPs Adil Zouitine et.al. 2406.08395 null
2024-06-12 Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning Mohammadreza Nakhaei et.al. 2406.08238 link
2024-06-12 Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning Max Weltevrede et.al. 2406.08069 null
2024-06-12 Deep reinforcement learning with positional context for intraday trading Sven Goluža et.al. 2406.08013 null
2024-06-12 Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning Yizhe Huang et.al. 2406.08002 null
2024-06-12 Semantic-Aware Resource Allocation Based on Deep Reinforcement Learning for 5G-V2X HetNets Zhiyu Shao et.al. 2406.07996 link
2024-06-11 CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Zeyuan Liu et.al. 2406.07541 null
2024-06-11 Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang et.al. 2406.07455 null
2024-06-11 Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization Weiliang Zhang et.al. 2406.07418 null
2024-06-11 Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks Bjarke Madsen et.al. 2406.07383 null
2024-06-11 World Models with Hints of Large Language Models for Goal Achieving Zeyuan Liu et.al. 2406.07381 null
2024-06-11 EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning Yijun Hao et.al. 2406.07342 null
2024-06-11 Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling Constantin Waubert de Puiseau et.al. 2406.07325 null
2024-06-12 Multi-objective Reinforcement learning from AI Feedback Marcus Williams et.al. 2406.07295 link
2024-06-11 Hybrid Reinforcement Learning from Offline Observation Alone Yuda Song et.al. 2406.07253 null
2024-06-11 A generic and robust quantum agent inspired by deep meta-reinforcement learning Zibo Miao et.al. 2406.07225 null
2024-06-10 Verification-Guided Shielding for Deep Reinforcement Learning Davide Corsi et.al. 2406.06507 null
2024-06-10 Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation Mohidul Haque Mridul et.al. 2406.06500 null
2024-06-10 Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity Calarina Muslimani et.al. 2406.06495 null
2024-06-10 Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots Bahador Beigomi et.al. 2406.06460 link
2024-06-10 Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? Denis Tarasov et.al. 2406.06309 link
2024-06-10 Learning-based cognitive architecture for enhancing coordination in human groups Antonio Grotta et.al. 2406.06297 null
2024-06-10 Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization Jesse van Remmerden et.al. 2406.06184 null
2024-06-10 Mastering truss structure optimization with tree search Gabriel E. Garayalde et.al. 2406.06145 null
2024-06-10 EXPIL: Explanatory Predicate Invention for Learning in Games Jingyuan Sha et.al. 2406.06107 link
2024-06-10 Sim-To-Real Transfer for Visual Reinforcement Learning of Deformable Object Manipulation for Robot-Assisted Surgery Paul Maria Scheikl et.al. 2406.06092 null
2024-06-07 LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration Tavor Lipman et.al. 2406.05107 null
2024-06-07 Massively Multiagent Minigames for Training Generalist Agents Kyoung Whan Choe et.al. 2406.05071 link
2024-06-07 Online Frequency Scheduling by Learning Parallel Actions Anastasios Giovanidis et.al. 2406.05041 null
2024-06-07 Optimizing Automatic Differentiation with Deep Reinforcement Learning Jamie Lohoff et.al. 2406.05027 null
2024-06-07 Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable Systems Rohan Paleja et.al. 2406.05003 null
2024-06-07 SLOPE: Search with Learned Optimal Pruning-based Expansion Davor Bokan et.al. 2406.04935 link
2024-06-07 Sim-to-real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning Arvi Jonnarth et.al. 2406.04920 null
2024-06-07 Stabilizing Extreme Q-learning by Maclaurin Expansion Motoki Omura et.al. 2406.04896 null
2024-06-07 Primitive Agentic First-Order Optimization R. Sala et.al. 2406.04841 null
2024-06-07 Algorithms for learning value-aligned policies considering admissibility relaxation Andrés Holgado-Sánchez et.al. 2406.04838 null
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323 null
2024-06-06 Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models Xiang Ji et.al. 2406.04274 null
2024-06-06 MARLander: A Local Path Planning for Drone Swarms using Multiagent Deep Reinforcement Learning Demetros Aschu et.al. 2406.04159 null
2024-06-06 Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning Abdullah Akgül et.al. 2406.04088 null
2024-06-06 Bootstrapping Expectiles in Reinforcement Learning Pierre Clavier et.al. 2406.04081 null
2024-06-06 Spatio-temporal Early Prediction based on Multi-objective Reinforcement Learning Wei Shao et.al. 2406.04035 link
2024-06-06 Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents Yoann Poupart et.al. 2406.04028 link
2024-06-06 HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning Quentin Delfosse et.al. 2406.03997 link
2024-06-06 AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control Rudolf Reiter et.al. 2406.03995 null
2024-06-06 Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning Lin Liu et.al. 2406.03978 link
2024-06-05 Automating Turkish Educational Quiz Generation Using Large Language Models Kamyar Zeinalipour et.al. 2406.03397 link
2024-06-05 LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback Timon Ziegenbein et.al. 2406.03363 null
2024-06-05 UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning Yu Zhang et.al. 2406.03324 null
2024-06-05 Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning Mohamed Elsayed et.al. 2406.03276 link
2024-06-05 Prompt-based Visual Alignment for Zero-shot Policy Transfer Haihan Gao et.al. 2406.03250 null
2024-06-05 Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning Inwoo Hwang et.al. 2406.03234 link
2024-06-05 CommonPower: Supercharging Machine Learning for Smart Grids Michael Eichelbeck et.al. 2406.03231 link
2024-06-05 Object Manipulation in Marine Environments using Reinforcement Learning Ahmed Nader et.al. 2406.03223 null
2024-06-05 Adaptive Distance Functions via Kelvin Transformation Rafael I. Cabral Muchacho et.al. 2406.03200 null
2024-06-05 DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays Bo Xia et.al. 2406.03102 null
2024-06-04 Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and Posterior Value Optimisation in Finite-State MDPs Filippo Valdettaro et.al. 2406.02456 link
2024-06-04 A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies Md Mirajul Islam et.al. 2406.02450 null
2024-06-04 Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning Shidi Deng et.al. 2406.02437 null
2024-06-04 Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Philip Anastassiou et.al. 2406.02430 link
2024-06-04 Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning Jiaxu Wang et.al. 2406.02370 null
2024-06-04 How to Explore with Belief: State Entropy Maximization in POMDPs Riccardo Zamboni et.al. 2406.02295 null
2024-06-04 Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling Arthur Müller et.al. 2406.02294 null
2024-06-04 Test-Time Regret Minimization in Meta Reinforcement Learning Mirco Mutti et.al. 2406.02282 null
2024-06-04 Reinforcement Learning with Lookahead Information Nadav Merlis et.al. 2406.02258 null
2024-06-04 Quantum Computing in Wireless Communications and Networking: A Tutorial-cum-Survey Wei Zhao et.al. 2406.02240 null
2024-05-31 Exploratory Preference Optimization: Harnessing Implicit Q-Approximation for Sample-Efficient RLHF* Tengyang Xie et.al. 2405.21046 null
2024-05-31 Direct Alignment of Language Models via Quality-Aware Self-Refinement Runsheng Yu et.al. 2405.21040 null
2024-06-03 Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles Jiesong Lian et.al. 2405.21027 null
2024-05-31 Generating Triangulations and Fibrations with Reinforcement Learning Per Berglund et.al. 2405.21017 null
2024-05-31 Bayesian Design Principles for Offline-to-Online Reinforcement Learning Hao Hu et.al. 2405.20984 link
2024-05-31 Goal-Oriented Sensor Reporting Scheduling for Non-linear Dynamic System Monitoring Prasoon Raghuwanshi et.al. 2405.20983 null
2024-05-31 SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales Tianyang Xu et.al. 2405.20974 link
2024-05-31 Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman et.al. 2405.20971 link
2024-05-31 Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation Shangding Gu et.al. 2405.20860 null
2024-05-31 Improving Reward Models with Synthetic Critiques Zihuiwen Ye et.al. 2405.20850 null
2024-05-30 Group Robust Preference Optimization in Reward-free RLHF Shyam Sundhar Ramesh et.al. 2405.20304 link
2024-05-30 Evaluating Large Language Model Biases in Persona-Steered Generation Andy Liu et.al. 2405.20253 link
2024-05-30 InstructionCP: A fast approach to transfer Large Language Models into target language Kuang-Ming Chen et.al. 2405.20175 null
2024-05-30 Enhancing Battlefield Awareness: An Aerial RIS-assisted ISAC System with Deep Reinforcement Learning Hyunsang Cho et.al. 2405.20168 null
2024-05-30 Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation Wooseong Cho et.al. 2405.20165 null
2024-05-31 NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models Kai Wu et.al. 2405.20081 null
2024-05-30 Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads Avelina Asada Hadji-Kyriacou et.al. 2405.20053 link
2024-05-30 Deep Reinforcement Learning for Intrusion Detection in IoT: A Survey Afrah Gueriani et.al. 2405.20038 null
2024-05-30 Safe Multi-agent Reinforcement Learning with Natural Language Constraints Ziyan Wang et.al. 2405.20018 null
2024-05-30 LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning Hyungho Na et.al. 2405.19998 link
2024-05-29 Self-Exploring Language Models: Active Preference Elicitation for Online Alignment Shenao Zhang et.al. 2405.19332 link
2024-05-29 Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF Shicong Cen et.al. 2405.19320 null
2024-05-29 Robust Preference Optimization through Reward Model Distillation Adam Fisch et.al. 2405.19316 null
2024-05-29 Rich-Observation Reinforcement Learning with Continuous Latent Dynamics Yuda Song et.al. 2405.19269 null
2024-05-29 Exploring the impact of traffic signal control and connected and automated vehicles on intersections safety: A deep reinforcement learning approach Amir Hossein Karbasi et.al. 2405.19236 null
2024-05-29 Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning Hanye Zhao et.al. 2405.19189 link
2024-05-29 A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning Arthur Juliani et.al. 2405.19153 null
2024-05-29 Learning Interpretable Scheduling Algorithms for Data Processing Clusters Zhibo Hu et.al. 2405.19131 null
2024-05-29 Offline Regularised Reinforcement Learning for Large Language Models Alignment Pierre Harvey Richemond et.al. 2405.19107 null
2024-05-29 OMPO: A Unified Framework for RL under Policy and Dynamics Shifts Yu Luo et.al. 2405.19080 link
2024-05-28 Hierarchical World Models as Visual Whole-Body Humanoid Controllers Nicklas Hansen et.al. 2405.18418 null
2024-05-28 Value Alignment and Trust in Human-Robot Interaction: Insights from Simulation and User Study Shreyas Bhat et.al. 2405.18324 null
2024-05-28 Highway Reinforcement Learning Yuhui Wang et.al. 2405.18289 null
2024-05-28 Extreme Value Monte Carlo Tree Search Masataro Asai et.al. 2405.18248 null
2024-05-28 Recurrent Natural Policy Gradient for POMDPs Semih Cayci et.al. 2405.18221 null
2024-05-28 Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving Zhi Zheng et.al. 2405.18209 link
2024-05-28 Mutation-Bias Learning in Games Johann Bauer et.al. 2405.18190 null
2024-05-28 Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding Daniel Bethell et.al. 2405.18180 link
2024-05-28 Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing Wei Zhao et.al. 2405.18166 link
2024-05-28 PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning Martin Balla et.al. 2405.18123 link
2024-05-27 A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning Abdulaziz Almuzairee et.al. 2405.17416 link
2024-05-27 Rethinking Transformers in Solving POMDPs Chenhao Lu et.al. 2405.17358 link
2024-05-27 Opinion-Guided Reinforcement Learning Kyanna Dagenais et.al. 2405.17287 null
2024-05-27 DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems Zhi Zheng et.al. 2405.17272 link
2024-05-27 Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning Adriana Hugessen et.al. 2405.17243 null
2024-05-27 InsigHTable: Insight-driven Hierarchical Table Visualization with Reinforcement Learning Guozheng Li et.al. 2405.17229 null
2024-05-27 Learning Generic and Dynamic Locomotion of Humanoids Across Discrete Terrains Shangqun Yu et.al. 2405.17227 null
2024-05-27 Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning P. Suárez et.al. 2405.17210 null
2024-05-27 CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control Jingqing Ruan et.al. 2405.17152 link
2024-05-27 Q-value Regularized Transformer for Offline Reinforcement Learning Shengchao Hu et.al. 2405.17098 null
2024-05-24 Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment Hao Sun et.al. 2405.15624 null
2024-05-24 Neuromorphic dreaming: A pathway to efficient learning in artificial agents Ingo Blakowski et.al. 2405.15616 link
2024-05-24 OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code Maxence Faldor et.al. 2405.15568 null
2024-05-24 Learning Generalizable Human Motion Generator with Reinforcement Learning Yunyao Mao et.al. 2405.15541 null
2024-05-24 Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces Angeliki Kamoutsi et.al. 2405.15509 link
2024-05-24 Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments Olivia Jullian Parra et.al. 2405.15508 null
2024-05-24 TD3 Based Collision Free Motion Planning for Robot Navigation Hao Liu et.al. 2405.15460 null
2024-05-24 Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics David Boetius et.al. 2405.15430 null
2024-05-24 Model-free reinforcement learning with noisy actions for automated experimental control in optics Lea Richtmann et.al. 2405.15421 link
2024-05-24 Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate Fan-Ming Luo et.al. 2405.15384 link
2024-05-23 Privileged Sensing Scaffolds Reinforcement Learning Edward S. Hu et.al. 2405.14853 null
2024-05-23 Axioms for AI Alignment from Human Feedback Luise Ge et.al. 2405.14758 null
2024-05-23 AGILE: A Novel Framework of LLM Agents Peiyuan Feng et.al. 2405.14751 link
2024-05-23 Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence Minheng Xiao et.al. 2405.14749 null
2024-05-23 SimPO: Simple Preference Optimization with a Reference-Free Reward Yu Meng et.al. 2405.14734 link
2024-05-23 Multi-turn Reinforcement Learning from Preference Human Feedback Lior Shani et.al. 2405.14655 null
2024-05-23 Reinforcement Learning for Fine-tuning Text-to-speech Diffusion Models Jingyi Chen et.al. 2405.14632 null
2024-05-23 Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences Takuya Hiraoka et.al. 2405.14629 link
2024-05-23 Closed-form Symbolic Solutions: A New Perspective on Solving Partial Differential Equations Shu Wei et.al. 2405.14620 null
2024-05-23 Discretization of continuous input spaces in the hippocampal autoencoder Adrian F. Amil et.al. 2405.14600 link
2024-05-21 Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scale Shriram Chennakesavalu et.al. 2405.12961 link
2024-05-21 Effect of Synthetic Jets Actuator Parameters on Deep Reinforcement Learning-Based Flow Control Performance in a Square Cylinder Wang Jia et.al. 2405.12834 null
2024-05-22 Deep Reinforcement Learning for Time-Critical Wilderness Search And Rescue Using Drones Jan-Hendrik Ewers et.al. 2405.12800 null
2024-05-21 Generative AI and Large Language Models for Cyber Security: All Insights You Need Mohamed Amine Ferrag et.al. 2405.12750 null
2024-05-21 Reinforcement Learning Enabled Peer-to-Peer Energy Trading for Dairy Farms Mian Ibad Ali Shah et.al. 2405.12716 null
2024-05-21 A Multimodal Learning-based Approach for Autonomous Landing of UAV Francisco Neves et.al. 2405.12681 null
2024-05-21 Learning Causal Dynamics Models in Object-Oriented Environments Zhongwei Yu et.al. 2405.12615 link
2024-05-21 PhiBE: A PDE-based Bellman Equation for Continuous Time Policy Evaluation Yuhua Zhu et.al. 2405.12535 null
2024-05-21 GASE: Graph Attention Sampling with Edges Fusion for Solving Vehicle Routing Problems Zhenwei Wang et.al. 2405.12475 null
2024-05-21 Physics-based Scene Layout Generation from Human Motion Jianan Li et.al. 2405.12460 null
2024-05-20 Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? Yang Dai et.al. 2405.12094 null
2024-05-20 PARALLELGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation Zhuobin Huang et.al. 2405.12079 null
2024-05-20 Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning Hai Zhang et.al. 2405.12001 null
2024-05-20 Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space Qianmei Liu et.al. 2405.11982 null
2024-05-20 A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers Tom Roth et.al. 2405.11904 null
2024-05-20 Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process Ermo Hua et.al. 2405.11870 link
2024-05-20 Reward-Punishment Reinforcement Learning with Maximum Entropy Jiexin Wang et.al. 2405.11784 null
2024-05-20 Efficient Multi-agent Reinforcement Learning by Planning Qihan Liu et.al. 2405.11778 link
2024-05-20 Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning Xin Liu et.al. 2405.11740 null
2024-05-20 Highway Graph to Accelerate Reinforcement Learning Zidu Yin et.al. 2405.11727 link
2024-05-17 Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: Systematic Literature Review Hongyi Yang et.al. 2405.10883 null
2024-05-17 Automated Radiology Report Generation: A Review of Recent Advances Phillip Sloan et.al. 2405.10842 null
2024-05-17 Combining Teacher-Student with Representation Learning: A Concurrent Teacher-Student Reinforcement Learning Paradigm for Legged Locomotion Hongxi Wang et.al. 2405.10830 null
2024-05-17 Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities Hao Zhou et.al. 2405.10825 null
2024-05-17 A Functional Model Method for Nonconvex Nonsmooth Conditional Stochastic Optimization Andrzej Ruszczyński et.al. 2405.10815 null
2024-05-17 SignLLM: Sign Languages Production Large Language Models Sen Fang et.al. 2405.10718 null
2024-05-17 Sample-Efficient Constrained Reinforcement Learning with General Parameterization Washim Uddin Mondal et.al. 2405.10624 null
2024-05-17 An Efficient Learning Control Framework With Sim-to-Real for String-Type Artificial Muscle-Driven Robotic Systems Jiyue Tao et.al. 2405.10576 null
2024-05-17 Time-Varying Constraint-Aware Reinforcement Learning for Energy Storage Control Jaeik Jeong et.al. 2405.10536 null
2024-05-17 Towards Better Question Generation in QA-Based Event Extraction Zijin Hong et.al. 2405.10517 link
2024-05-16 Stochastic Q-learning for Large Discrete Action Spaces Fares Fourati et.al. 2405.10310 null
2024-05-17 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Yuexiang Zhai et.al. 2405.10292 null
2024-05-16 Keep It Private: Unsupervised Privatization of Online Text Calvin Bao et.al. 2405.10260 link
2024-05-16 A Design Trajectory Map of Human-AI Collaborative Reinforcement Learning Systems: Survey and Taxonomy Zhaoxing Li et.al. 2405.10214 null
2024-05-16 Continuous Transfer Learning for UAV Communication-aware Trajectory Design Chenrui Sun et.al. 2405.10087 null
2024-05-16 Optimizing Search and Rescue UAV Connectivity in Challenging Terrain through Multi Q-Learning Mohammed M. H. Qazzaz et.al. 2405.10042 null
2024-05-16 Reward Centering Abhishek Naik et.al. 2405.09999 null
2024-05-16 Combining RL and IL using a dynamic, performance-based modulation over learning signals and its application to local planning Francisco Leiva et.al. 2405.09760 null
2024-05-16 NIFTY Financial News Headlines Dataset Raeid Saqur et.al. 2405.09747 null
2024-05-15 Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning Sihan Zeng et.al. 2405.09660 null
2024-05-15 Reinforcement Learning-Based Framework for the Intelligent Adaptation of User Interfaces Daniel Gaspar-Figueiredo et.al. 2405.09255 null
2024-05-15 DVS-RG: Differential Variable Speed Limits Control using Deep Reinforcement Learning with Graph State Representation Jingwen Yang et.al. 2405.09163 null
2024-05-15 CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving Dechen Gao et.al. 2405.09111 link
2024-05-15 Chaos-based reinforcement learning with TD3 Toshitaka Matsuki et.al. 2405.09086 null
2024-05-15 Deep Learning in Earthquake Engineering: A Comprehensive Review Yazhou Xie et.al. 2405.09021 null
2024-05-14 Large Language Models for Human-Machine Collaborative Particle Accelerator Tuning through Natural Language Jan Kaiser et.al. 2405.08888 null
2024-05-14 Stable Inverse Reinforcement Learning: Policies from Control Lyapunov Landscapes Samuel Tesfazgi et.al. 2405.08756 null
2024-05-14 Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach Urvij Saroliya et.al. 2405.08754 null
2024-05-14 Reinformer: Max-Return Sequence Modeling for offline RL Zifeng Zhuang et.al. 2405.08740 link
2024-05-14 I-CTRL: Imitation to Control Humanoid Robots Through Constrained Reinforcement Learning Yashuai Yan et.al. 2405.08726 null
2024-05-15 Enhancing Reinforcement Learning in Sensor Fusion: A Comparative Analysis of Cubature and Sampling-based Integration Methods for Rover Search Planning Jan-Hendrik Ewers et.al. 2405.08691 null
2024-05-14 A Distributed Approach to Autonomous Intersection Management via Multi-Agent Reinforcement Learning Matteo Cederle et.al. 2405.08655 link
2024-05-14 vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement Yiwen Zhu et.al. 2405.08638 null
2024-05-14 Optimizing Deep Reinforcement Learning for American Put Option Hedging Reilly Pickard et.al. 2405.08602 null
2024-05-14 Python-Based Reinforcement Learning on Simulink Models Georg Schäfer et.al. 2405.08567 null
2024-05-14 Growing Artificial Neural Networks for Control: the Role of Neuronal Diversity Eleni Nisioti et.al. 2405.08510 link
2024-05-13 RLHF Workflow: From Reward Modeling to Online RLHF Hanze Dong et.al. 2405.07863 link
2024-05-13 Adaptive Exploration for Data-Efficient General Value Function Evaluations Arushi Jain et.al. 2405.07838 link
2024-05-13 Fixed Point Theory Analysis of a Lambda Policy Iteration with Randomization for the Ćirić Contraction Operator Abdelkader Belhenniche et.al. 2405.07824 null
2024-05-13 Hamiltonian-based Quantum Reinforcement Learning for Neural Combinatorial Optimization Georg Kruse et.al. 2405.07790 null
2024-05-13 Hype or Heuristic? Quantum Reinforcement Learning for Join Order Optimisation Maja Franz et.al. 2405.07770 link
2024-05-13 CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization Wei-Ting Tang et.al. 2405.07760 null
2024-05-13 MADRL-Based Rate Adaptation for 360 $\degree$ Video Streaming with Multi-Viewpoint Prediction Haopeng Wang et.al. 2405.07759 null
2024-05-13 Neural Network Compression for Reinforcement Learning Tasks Dmitry A. Ivanov et.al. 2405.07748 null
2024-05-13 Backdoor Removal for Generative Large Language Models Haoran Li et.al. 2405.07667 null
2024-05-14 Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback Asaf Cassel et.al. 2405.07637 null
2024-05-10 Value Augmented Sampling for Language Model Alignment and Personalization Seungwook Han et.al. 2405.06639 link
2024-05-10 EcoEdgeTwin: Enhanced 6G Network via Mobile Edge Computing and Digital Twin Integration Synthia Hossain Karobi et.al. 2405.06507 null
2024-05-10 Advantageous and disadvantageous inequality aversion can be taught through vicarious learning of others' preferences Shen Zhang et.al. 2405.06500 null
2024-05-10 Contextual Affordances for Safe Exploration in Robotic Scenarios William Z. Ye et.al. 2405.06422 null
2024-05-10 Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs Davide Maran et.al. 2405.06363 null
2024-05-10 Learning Latent Dynamic Robust Representations for World Models Ruixiang Sun et.al. 2405.06263 link
2024-05-10 Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning Xiaoyu Wen et.al. 2405.06192 link
2024-05-10 (A Partial Survey of) Decentralized, Cooperative Multi-Agent Reinforcement Learning Christopher Amato et.al. 2405.06161 null
2024-05-09 An RNN-policy gradient approach for quantum architecture search Gang Wang et.al. 2405.05892 null
2024-05-09 Safe Exploration Using Bayesian World Models and Log-Barrier Optimization Yarden As et.al. 2405.05890 null
2024-05-09 Policy Gradient with Active Importance Sampling Matteo Papini et.al. 2405.05630 null
2024-05-09 An Automatic Prompt Generation System for Tabular Data Tasks Ashlesha Akella et.al. 2405.05618 null
2024-05-09 Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning Yuchen Shi et.al. 2405.05542 link
2024-05-08 Model-Free Robust $φ$ -Divergence Reinforcement Learning Using Both Offline and Online Data Kishan Panaganti et.al. 2405.05468 null
2024-05-08 Markowitz Meets Bellman: Knowledge-distilled Reinforcement Learning for Portfolio Management Gang Hu et.al. 2405.05449 null
2024-05-08 Learning to Play Pursuit-Evasion with Dynamic and Sensor Constraints Burak M. Gonultas et.al. 2405.05372 null
2024-05-08 Offline Model-Based Optimization via Policy-Guided Gradient Search Yassine Chemingui et.al. 2405.05349 link
2024-05-08 Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models Aylin Gunal et.al. 2405.05060 null
2024-05-08 Fault Identification Enhancement with Reinforcement Learning (FIERL) Valentina Zaccaria et.al. 2405.04938 link
2024-05-07 RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes Kyle Stachowicz et.al. 2405.04714 null
2024-05-07 Proximal Policy Optimization with Adaptive Exploration Andrei Lixandru et.al. 2405.04664 null
2024-05-07 ACEGEN: Reinforcement learning of generative chemical agents for drug discovery Albert Bou et.al. 2405.04657 link
2024-05-07 TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters Jonathan Wilder Lavington et.al. 2405.04491 null
2024-05-07 Designing, Developing, and Validating Network Intelligence for Scaling in Service-Based Architectures based on Deep Reinforcement Learning Paola Soto et.al. 2405.04441 null
2024-05-08 DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model DeepSeek-AI et.al. 2405.04434 link
2024-05-07 The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin et.al. 2405.04342 link
2024-05-07 Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation Atharvan Dogra et.al. 2405.04325 null
2024-05-07 Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies Paul Templier et.al. 2405.04322 null
2024-05-07 Improving Offline Reinforcement Learning with Inaccurate Simulators Yiwen Hou et.al. 2405.04307 null
2024-05-07 Deep Reinforcement Learning for Multi-User RF Charging with Non-linear Energy Harvesters Amirhossein Azarbahram et.al. 2405.04218 null
2024-05-07 In-context Learning for Automated Driving Scenarios Ziqi Zhou et.al. 2405.04135 link
2024-05-07 Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning Teng Xue et.al. 2405.04082 null
2024-05-06 $ε$ -Policy Gradient for Online Pricing Lukasz Szpruch et.al. 2405.03624 null
2024-05-06 Position Paper: Leveraging Foundational Models for Black-Box Optimization: Benefits, Challenges, and Future Directions Xingyou Song et.al. 2405.03547 null
2024-05-06 ReinWiFi: A Reinforcement-Learning-Based Framework for the Application-Layer QoS Optimization of WiFi Networks Qianren Li et.al. 2405.03526 link
2024-05-06 Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning Stone Tao et.al. 2405.03379 link
2024-05-06 Enhancing Q-Learning with Large Language Model Heuristics Xiefeng Wu et.al. 2405.03341 null
2024-05-06 Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic Review Harry Robertshaw et.al. 2405.03305 null
2024-05-06 End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability Hinrikus Wolf et.al. 2405.03262 null
2024-05-06 Federated Reinforcement Learning with Constraint Heterogeneity Hao Jin et.al. 2405.03236 null
2024-05-06 Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning Caleb Chuck et.al. 2405.03113 null
2024-05-05 Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning Tianchen Zhou et.al. 2405.03082 null
2024-05-03 Geometric Fabrics: a Safe Guiding Medium for Policy Learning Karl Van Wyk et.al. 2405.02250 null
2024-05-03 Learning Optimal Deterministic Policies with Stochastic Policy Gradients Alessandro Montenegro et.al. 2405.02235 null
2024-05-03 The Cambridge RoboMaster: An Agile Multi-Robot Research Platform Jan Blumenkamp et.al. 2405.02198 null
2024-05-03 Simulating the economic impact of rationality through reinforcement learning and agent-based modelling Simone Brusatin et.al. 2405.02161 link
2024-05-03 Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach Anton Plaksin et.al. 2405.02044 null
2024-05-03 Model-based reinforcement learning for protein backbone design Frederic Renard et.al. 2405.01983 null
2024-05-03 Rescale-Invariant Federated Reinforcement Learning for Resource Allocation in V2X Networks Kaidi Xu et.al. 2405.01961 null
2024-05-03 Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization Changliang Zhou et.al. 2405.01906 null
2024-05-03 Reinforcement Learning control strategies for Electric Vehicles and Renewable energy sources Virtual Power Plants Francesco Maldonato et.al. 2405.01889 link
2024-05-03 A Model-based Multi-Agent Personalized Short-Video Recommender System Peilun Zhou et.al. 2405.01847 null
2024-05-02 Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal et.al. 2405.01534 null
2024-05-02 FLAME: Factuality-Aware Alignment for Large Language Models Sheng-Chieh Lin et.al. 2405.01525 null
2024-05-02 NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Gerald Shen et.al. 2405.01481 link
2024-05-02 Goal-conditioned reinforcement learning for ultrasound navigation guidance Abdoul Aziz Amadou et.al. 2405.01409 null
2024-05-02 Learning Force Control for Legged Manipulation Tifanny Portela et.al. 2405.01402 null
2024-05-03 Constrained Reinforcement Learning Under Model Mismatch Zhongchang Sun et.al. 2405.01327 null
2024-05-02 Non-iterative Optimization of Trajectory and Radio Resource for Aerial Network Hyeonsu Lyu et.al. 2405.01314 null
2024-05-02 Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning Liu Qiyuan et.al. 2405.01284 null
2024-05-02 Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation Hao Wang et.al. 2405.01280 null
2024-05-02 Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies Finn Rietz et.al. 2405.01198 null
2024-05-01 Self-Play Preference Optimization for Language Model Alignment Yue Wu et.al. 2405.00675 link
2024-05-01 No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO Skander Moalla et.al. 2405.00662 link
2024-05-01 HUGO -- Highlighting Unseen Grid Options: Combining Deep Reinforcement Learning with a Heuristic Target Topology Approach Malte Lehna et.al. 2405.00629 null
2024-05-01 Koopman-based Deep Learning for Nonlinear System Estimation Zexin Sun et.al. 2405.00627 null
2024-05-01 Queue-based Eco-Driving at Roundabouts with Reinforcement Learning Anna-Lena Schlamp et.al. 2405.00625 null
2024-05-01 The Real, the Better: Aligning Large Language Models with Online Human Behaviors Guanying Jiang et.al. 2405.00578 null
2024-05-01 Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment Zhili Liu et.al. 2405.00557 null
2024-05-01 Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning Lucas-Andreï Thil et.al. 2405.00516 null
2024-05-01 MetaRM: Shifted Distributions Alignment via Meta-Learning Shihan Dou et.al. 2405.00438 null
2024-05-01 UCB-driven Utility Function Search for Multi-objective Reinforcement Learning Yucheng Shi et.al. 2405.00410 link
2024-04-30 Collaborative Control Method of Transit Signal Priority Based on Cooperative Game and Reinforcement Learning Hao Qin et.al. 2404.19683 null
2024-04-30 Towards Generalist Robot Learning from Internet Video: A Survey Robert McCarthy et.al. 2404.19664 null
2024-04-30 Short term vs. long term: optimization of microswimmer navigation on different time horizons Navid Mousavi et.al. 2404.19561 null
2024-04-30 Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation Cengis Hasan et.al. 2404.19462 null
2024-04-30 Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning Mathieu Rita et.al. 2404.19409 link
2024-04-30 Numeric Reward Machines Kristina Levina et.al. 2404.19370 null
2024-04-30 Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning Chenjia Bai et.al. 2404.19346 link
2024-04-30 Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning Qiaosheng Zhang et.al. 2404.19292 null
2024-04-30 DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets Xiaoyu Huang et.al. 2404.19264 null
2024-04-30 Bias Mitigation via Compensation: A Reinforcement Learning Perspective Nandhini Swaminathan et.al. 2404.19256 null
2024-04-29 DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong et.al. 2404.18922 null
2024-04-29 Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty Laixi Shi et.al. 2404.18909 null
2024-04-29 More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness Aaron J. Li et.al. 2404.18870 link
2024-04-29 Performance-Aligned LLMs for Generating Fast Code Daniel Nichols et.al. 2404.18864 null
2024-04-30 Winning the Social Media Influence Battle: Uncertainty-Aware Opinions to Understand and Spread True Information via Competitive Influence Maximization Qi Zhang et.al. 2404.18826 null
2024-04-30 Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies Seyed Soroush Karimi Madahi et.al. 2404.18821 null
2024-04-29 Multi-Agent Synchronization Tasks Rolando Fernandez et.al. 2404.18798 null
2024-04-29 Resource-rational reinforcement learning and sensorimotor causal states Sarah Marzen et.al. 2404.18775 null
2024-04-29 Self-training superconducting neuromorphic circuits using reinforcement learning rules M. L. Schneider et.al. 2404.18774 null
2024-04-29 Adaptive Reinforcement Learning for Robot Control Yu Tang Liu et.al. 2404.18713 link
2024-04-26 Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo Stephen Zhao et.al. 2404.17546 link
2024-04-26 Quantum Multi-Agent Reinforcement Learning for Aerial Ad-hoc Networks Theodora-Augustina Drăgan et.al. 2404.17499 null
2024-04-26 Q-Learning to navigate turbulence without a map Marco Rando et.al. 2404.17495 null
2024-04-26 Adaptive speed planning for Unmanned Vehicle Based on Deep Reinforcement Learning Hao Liu et.al. 2404.17379 null
2024-04-26 When to Trust LLMs: Aligning Confidence with Response Quality Shuchang Tao et.al. 2404.17287 null
2024-04-26 Enhancing Privacy and Security of Autonomous UAV Navigation Vatsal Aggarwal et.al. 2404.17225 null
2024-04-26 An Explainable Deep Reinforcement Learning Model for Warfarin Maintenance Dosing Using Policy Distillation and Action Forging Sadjad Anzabi Zadeh et.al. 2404.17187 null
2024-04-25 Compiler for Distributed Quantum Computing: a Reinforcement Learning Approach Panagiotis Promponas et.al. 2404.17077 link
2024-04-25 Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey Lingfan Bao et.al. 2404.17070 null
2024-04-25 Evaluating Collaborative Autonomy in Opposed Environments using Maritime Capture-the-Flag Competitions Jordan Beason et.al. 2404.17038 null
2024-04-25 REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao et.al. 2404.16767 link
2024-04-25 Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods Min Kyu Shin et.al. 2404.16721 null
2024-04-25 RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments Diego Martinez-Baselga et.al. 2404.16672 null
2024-04-25 Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare Emre Can Acikgoz et.al. 2404.16621 link
2024-04-25 Exploring the Dynamics of Data Transmission in 5G Networks: A Conceptual Analysis Nikita Smirnov et.al. 2404.16508 null
2024-04-25 A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints Bram De Cooman et.al. 2404.16468 null
2024-04-25 Offline Reinforcement Learning with Behavioral Supervisor Tuning Padmanaba Srinivasan et.al. 2404.16399 null
2024-04-25 SwarmRL: Building the Future of Smart Active Systems Samuel Tovey et.al. 2404.16388 link
2024-04-25 Reinforcement Learning with Generative Models for Compact Support Sets Nico Schiavone et.al. 2404.16300 link
2024-04-24 ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling Arjun Somayazulu et.al. 2404.16216 null
2024-04-24 DPO: Differential reinforcement learning with application to optimal configuration search Chandrajit Bajaj et.al. 2404.15617 null
2024-04-24 GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL Lang Qin et.al. 2404.15597 null
2024-04-24 Multi-Agent Reinforcement Learning for Energy Networks: Computational Challenges, Progress and Open Problems Sarah Keren et.al. 2404.15583 null
2024-04-23 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan et.al. 2404.15518 null
2024-04-23 The Power of Resets in Online Reinforcement Learning Zakaria Mhammedi et.al. 2404.15417 null
2024-04-23 Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League Environments Mateus G. Machado et.al. 2404.15410 link
2024-04-23 Reinforcement Learning with Adaptive Control Regularization for Safe Control of Critical Systems Haozhe Tian et.al. 2404.15199 null
2024-04-23 Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation Xun Wu et.al. 2404.15100 null
2024-04-23 Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot Neil Guan et.al. 2404.15096 null
2024-04-23 Using deep reinforcement learning to promote sustainable human behaviour on a common pool resource problem Raphael Koster et.al. 2404.15059 null
2024-04-23 Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems Xiaoshuang Chen et.al. 2404.14961 null
2024-04-23 Multi-Objective Deep Reinforcement Learning for 5G Base Station Placement to Support Localisation for Future Sustainable Traffic Ahmed Al-Tahmeesschi et.al. 2404.14954 null
2024-04-23 MultiSTOP: Solving Functional Equations with Reinforcement Learning Alessandro Trenta et.al. 2404.14909 null
2024-04-23 Unitary Synthesis of Clifford+T Circuits with Reinforcement Learning Sebastian Rietsch et.al. 2404.14865 null
2024-04-23 Evolutionary Reinforcement Learning via Cooperative Coevolution Chengpeng Hu et.al. 2404.14763 null
2024-04-23 Rank2Reward: Learning Shaped Reward Functions from Passive Video Daniel Yang et.al. 2404.14735 null
2024-04-23 Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data Fahim Tajwar et.al. 2404.14367 link
2024-04-22 Multi-Agent Hybrid SAC for Joint SS-DSA in CRNs David R. Nickel et.al. 2404.14319 null
2024-04-22 Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories Ning Yang et.al. 2404.14238 null
2024-04-22 Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems Yiyang Zhu et.al. 2404.14092 null
2024-04-22 Mechanistic Interpretability for AI Safety -- A Review Leonard Bereska et.al. 2404.14082 null
2024-04-22 Research on Robot Path Planning Based on Reinforcement Learning Wang Ruiqi et.al. 2404.14077 link
2024-04-22 Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras Mhairi Dunion et.al. 2404.14064 link
2024-04-22 A survey of air combat behavior modeling using machine learning Patrick Ribu Gorton et.al. 2404.13954 null
2024-04-22 Generating Attractive and Authentic Copywriting from Customer Reviews Yu-Xiang Lin et.al. 2404.13906 null
2024-04-22 Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation Xulin Chen et.al. 2404.13879 null
2024-04-19 Mapping Social Choice Theory to RLHF Jessica Dai et.al. 2404.13038 null
2024-04-19 Deep Reinforcement Learning-Based Active Flow Control of an Elliptical Cylinder: Transitioning from an Elliptical Cylinder to a Circular Cylinder and a Flat Plate Wang Jia et.al. 2404.13003 null
2024-04-19 Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning Lisheng Wu et.al. 2404.12999 null
2024-04-19 MM-PhyRLHF: Reinforcement Learning Framework for Multimodal Physics Question-Answering Avinash Anand et.al. 2404.12926 null
2024-04-19 Zero-Shot Stitching in Reinforcement Learning using Relative Representations Antonio Pio Ricciardi et.al. 2404.12917 null
2024-04-19 MAexp: A Generic Platform for RL-based Multi-Agent Exploration Shaohao Zhu et.al. 2404.12824 link
2024-04-19 Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation Qiang He et.al. 2404.12754 link
2024-04-19 Demonstration of quantum projective simulation on a single-photon-based quantum computer Giacomo Franceschetto et.al. 2404.12729 null
2024-04-19 Energy Conserved Failure Detection for NS-IoT Systems Guojin Liu et.al. 2404.12713 null
2024-04-19 Single-Task Continual Offline Reinforcement Learning Sibo Gai et.al. 2404.12639 null
2024-04-18 From $r$ to $Q^*$ : Your Language Model is Secretly a Q-Function Rafael Rafailov et.al. 2404.12358 null
2024-04-18 Improving the interpretability of GNN predictions through conformal-based graph sparsification Pablo Sanchez-Martin et.al. 2404.12356 link
2024-04-18 Practical Considerations for Discrete-Time Implementations of Continuous-Time Control Barrier Function-Based Safety Filters Lukas Brunke et.al. 2404.12329 null
2024-04-18 ASID: Active Exploration for System Identification in Robotic Manipulation Marius Memmel et.al. 2404.12308 null
2024-04-18 Privacy-Preserving UCB Decision Process Verification via zk-SNARKs Xikun Jiang et.al. 2404.12186 null
2024-04-18 Aligning language models with human preferences Tomasz Korbak et.al. 2404.12150 link
2024-04-19 Robust and Adaptive Deep Reinforcement Learning for Enhancing Flow Control around a Square Cylinder with Varying Reynolds Numbers Wang Jia et.al. 2404.12123 null
2024-04-18 X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner Haoyuan Jiang et.al. 2404.12090 link
2024-04-18 Trajectory Planning for Autonomous Vehicle Using Iterative Reward Prediction in Reinforcement Learning Hyunwoo Park et.al. 2404.12079 null
2024-04-18 Exploring the landscape of large language models: Foundations, techniques, and challenges Milad Moradi et.al. 2404.11973 null
2024-04-17 Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Zezhong Fan et.al. 2404.11589 null
2024-04-17 Deep Policy Optimization with Temporal Logic Constraints Ameesh Shah et.al. 2404.11578 null
2024-04-17 VC Theory for Inventory Policies Yaqi Xie et.al. 2404.11509 null
2024-04-17 Learn to Tour: Operator Design For Solution Feasibility Mapping in Pickup-and-delivery Traveling Salesman Problem Bowen Fang et.al. 2404.11458 null
2024-04-17 What-if Analysis Framework for Digital Twins in 6G Wireless Network Management Elif Ak et.al. 2404.11394 null
2024-04-18 Convergence of Policy Gradient for Stochastic Linear-Quadratic Control Problem in Infinite Horizon Xinpei Zhang et.al. 2404.11382 null
2024-04-17 Following the Human Thread in Social Navigation Luca Scofano et.al. 2404.11327 link
2024-04-17 On Learning Parities with Dependent Noise Noah Golowich et.al. 2404.11325 null
2024-04-17 Physics-informed Actor-Critic for Coordination of Virtual Inertia from Power Distribution Systems Simon Stock et.al. 2404.11149 null
2024-04-17 Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs Kang Wang et.al. 2404.11014 null
2024-04-16 Settling Constant Regrets in Linear Markov Decision Processes Weitong Zhang et.al. 2404.10745 null
2024-04-16 N-Agent Ad Hoc Teamwork Caroline Wang et.al. 2404.10740 null
2024-04-16 Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Hao-Lun Hsu et.al. 2404.10728 null
2024-04-16 Automatic re-calibration of quantum devices by reinforcement learning T. Crosta et.al. 2404.10726 null
2024-04-16 Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study Shusheng Xu et.al. 2404.10719 null
2024-04-16 Simplex Decomposition for Portfolio Allocation Constraints in Reinforcement Learning David Winkel et.al. 2404.10683 null
2024-04-16 SCALE: Self-Correcting Visual Navigation for Mobile Robots via Anti-Novelty Estimation Chang Chen et.al. 2404.10675 null
2024-04-16 Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay Jinmei Liu et.al. 2404.10662 link
2024-04-16 Trajectory Planning using Reinforcement Learning for Interactive Overtaking Maneuvers in Autonomous Racing Scenarios Levent Ögretmen et.al. 2404.10658 null
2024-04-16 Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms Zehao Zhou et.al. 2404.10645 null
2024-04-15 Effective Reinforcement Learning Based on Structural Information Principles Xianghua Zeng et.al. 2404.09760 link
2024-04-15 Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning Linjie Xu et.al. 2404.09715 null
2024-04-15 Learn Your Reference Model for Real Good Alignment Alexey Gorbatovski et.al. 2404.09656 null
2024-04-15 Reliability Estimation of News Media Sources: Birds of a Feather Flock Together Sergio Burdisso et.al. 2404.09565 link
2024-04-15 Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning Tidiane Camaret Ndir et.al. 2404.09521 link
2024-04-14 Egret: Reinforcement Mechanism for Sequential Computation Offloading in Edge Computing Haosong Peng et.al. 2404.09285 null
2024-04-14 A Reinforcement Learning Based Backfilling Strategy for HPC Batch Jobs Elliot Kolker-Hicks et.al. 2404.09264 null
2024-04-14 Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts Jing-Cheng Pang et.al. 2404.09248 null
2024-04-14 Advanced Intelligent Optimization Algorithms for Multi-Objective Optimal Power Flow in Future Power Systems: A Review Yuyan Li et.al. 2404.09203 null
2024-04-14 On Joint Convergence of Traffic State and Weight Vector in Learning-Based Dynamic Routing with Value Function Approximation Yidan Wu et.al. 2404.09188 null
2024-04-14 Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation Ruixin Yang et.al. 2404.09127 link
2024-04-12 Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation Hanlin Tian et.al. 2404.08570 link
2024-04-12 RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs Shreyas Chaudhari et.al. 2404.08555 null
2024-04-12 Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement Lucas Murray et.al. 2404.08523 null
2024-04-12 Prescribing Optimal Health-Aware Operation for Urban Air Mobility with Deep Reinforcement Learning Mina Montazeri et.al. 2404.08497 null
2024-04-12 Dataset Reset Policy Optimization for RLHF Jonathan D. Chang et.al. 2404.08495 link
2024-04-12 Anti-Byzantine Attacks Enabled Vehicle Selection for Asynchronous Federated Learning in Vehicular Edge Computing Cui Zhang et.al. 2404.08444 null
2024-04-12 SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies Maeghal Jain et.al. 2404.08423 null
2024-04-12 TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability Shiwei Lian et.al. 2404.08353 null
2024-04-12 Agile and versatile bipedal robot tracking control through reinforcement learning Jiayi Li et.al. 2404.08246 null
2024-04-12 RLEMMO: Evolutionary Multimodal Optimization Assisted By Deep Reinforcement Learning Hongqiao Lian et.al. 2404.08242 null
2024-04-11 High-Dimension Human Value Representation in Large Language Models Samuel Cahyawijaya et.al. 2404.07900 link
2024-04-11 Data-Driven System Identification of Quadrotors Subject to Motor Delays Jonas Eschmann et.al. 2404.07837 null
2024-04-11 On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning Giuseppe Canonaco et.al. 2404.07826 null
2024-04-11 An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization Minshuo Chen et.al. 2404.07771 null
2024-04-11 Differentially Private Reinforcement Learning with Self-Play Dan Qiao et.al. 2404.07559 null
2024-04-11 Enhancing Policy Gradient with the Polyak Step-Size Adaption Yunxiang Li et.al. 2404.07525 null
2024-04-11 Generative Probabilistic Planning for Optimizing Supply Chain Networks Hyung-il Ahn et.al. 2404.07511 null
2024-04-11 Neural Fault Injection: Generating Software Faults from Natural Language Domenico Cotroneo et.al. 2404.07491 null
2024-04-11 Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains Soichiro Nishimori et.al. 2404.07465 null
2024-04-11 UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning Saichao Liu et.al. 2404.07453 null
2024-04-10 Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery Zohre Karimi et.al. 2404.07185 null
2024-04-10 Adaptive behavior with stable synapses Cristiano Capone et.al. 2404.07150 link
2024-04-10 How Consistent are Clinicians? Evaluating the Predictability of Sepsis Disease Progression with Dynamics Models Unnseo Park et.al. 2404.07148 link
2024-04-10 Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection Linas Nasvytis et.al. 2404.07099 link
2024-04-10 Improving Language Model Reasoning with Self-motivated Learning Yunlong Feng et.al. 2404.07017 null
2024-04-10 Agent-driven Generative Semantic Communication for Remote Surveillance Wanting Yang et.al. 2404.06997 null
2024-04-10 Deep Reinforcement Learning for Mobile Robot Path Planning Hao Liu et.al. 2404.06974 null
2024-04-10 UAV-Assisted Enhanced Coverage and Capacity in Dynamic MU-mMIMO IoT Systems: A Deep Reinforcement Learning Approach MohammadMahdi Ghadaksaz et.al. 2404.06726 null
2024-04-10 Dual Ensemble Kalman Filter for Stochastic Optimal Control Anant A. Joshi et.al. 2404.06696 null
2024-04-09 Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective Victor-Alexandru Darvariu et.al. 2404.06492 null
2024-04-09 Deep Reinforcement Learning-Based Approach for a Single Vehicle Persistent Surveillance Problem with Fuel Constraints Hritik Bana et.al. 2404.06423 null
2024-04-09 The Power in Communication: Power Regularization of Communication for Autonomy in Cooperative Multi-Agent Reinforcement Learning Nancirose Piazza et.al. 2404.06387 null
2024-04-09 Policy-Guided Diffusion Matthew Thomas Jackson et.al. 2404.06356 link
2024-04-09 Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning Yanjie Li et.al. 2404.06330 null
2024-04-09 Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning Xudong Yu et.al. 2404.06188 null
2024-04-09 A quantum information theoretic analysis of reinforcement learning-assisted quantum architecture search Abhishek Sadhu et.al. 2404.06174 null
2024-04-09 Adaptable Recovery Behaviors in Robotics: A Behavior Trees and Motion Generators(BTMG) Approach for Failure Management Faseeh Ahmad et.al. 2404.06129 null
2024-04-09 Automatic Configuration Tuning on Cloud Database: A Survey Limeng Zhang et.al. 2404.06043 null
2024-04-09 Commute with Community: Enhancing Shared Travel through Social Networks Tian Siyuan et.al. 2404.05987 null
2024-04-08 Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer Xinyang Gu et.al. 2404.05695 null
2024-04-08 YaART: Yet Another ART Rendering Technology Sergey Kastryulin et.al. 2404.05666 null
2024-04-08 Dynamic Backtracking in GFlowNet: Enhancing Decision Steps with Reward-Dependent Adjustment Mechanisms Shuai Guo et.al. 2404.05576 null
2024-04-08 Optimal Flow Admission Control in Edge Computing via Safe Reinforcement Learning A. Fox et.al. 2404.05564 null
2024-04-08 Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data Tim Baumgärtner et.al. 2404.05530 null
2024-04-08 CNN-based Game State Detection for a Foosball Table David Hagens et.al. 2404.05357 null
2024-04-08 Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models Yutao Ouyang et.al. 2404.05291 null
2024-04-08 MeSA-DRL: Memory-Enhanced Deep Reinforcement Learning for Advanced Socially Aware Robot Navigation in Crowded Environments Mannan Saeed Muhammad et.al. 2404.05203 null
2024-04-08 Decision Transformer for Wireless Communications: A New Paradigm of Resource Management Jie Zhang et.al. 2404.05199 null
2024-04-07 On the Uniqueness of Solution for the Bellman Equation of LTL Objectives Zetong Xuan et.al. 2404.05074 null
2024-04-05 Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution Tim Seyde et.al. 2404.04253 null
2024-04-05 Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation Lanpei Li et.al. 2404.04219 link
2024-04-05 Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology Gaith Rjoub et.al. 2404.04205 null
2024-04-05 Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report Jerrod Wigmore et.al. 2404.04106 null
2024-04-05 Dynamic Prompt Optimizing for Text-to-Image Generation Wenyi Mo et.al. 2404.04095 link
2024-04-05 Demonstration Guided Multi-Objective Reinforcement Learning Junlin Lu et.al. 2404.03997 null
2024-04-05 A proximal policy optimization based intelligent home solar management Kode Creer et.al. 2404.03888 null
2024-04-05 Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration Xudong Guo et.al. 2404.03869 null
2024-04-04 Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning Noah Golowich et.al. 2404.03774 null
2024-04-04 A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Chunxiao Li et.al. 2404.03753 null
2024-04-04 AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Hanyu Lai et.al. 2404.03648 link
2024-04-04 Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention Ziru Liu et.al. 2404.03637 link
2024-04-04 Laser Learning Environment: A new environment for coordination-critical multi-agent tasks Yannick Molinghen et.al. 2404.03596 link
2024-04-04 Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm Miao Lu et.al. 2404.03578 null
2024-04-04 AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale Adam Pardyl et.al. 2404.03482 link
2024-04-04 Integrating Hyperparameter Search into GramML Hernán Ceferino Vázquez et.al. 2404.03419 link
2024-04-04 Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Jooyoung Lee et.al. 2404.03414 null
2024-04-04 Elementary Analysis of Policy Gradient Methods Jiacai Liu et.al. 2404.03372 null
2024-04-04 REACT: Revealing Evolutionary Action Consequence Trajectories for Interpretable Reinforcement Learning Philipp Altmann et.al. 2404.03359 null
2024-04-04 Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation Asad Ali Shahid et.al. 2404.03336 null
2024-04-03 Learning Quadrupedal Locomotion via Differentiable Simulation Clemens Schwarke et.al. 2404.02887 null
2024-04-03 Unsupervised Learning of Effective Actions in Robotics Marko Zaric et.al. 2404.02728 link
2024-04-03 Reinforcement Learning in Categorical Cybernetics Jules Hedges et.al. 2404.02688 null
2024-04-03 Solving a Real-World Optimization Problem Using Proximal Policy Optimization with Curriculum Learning and Reward Engineering Abhijeet Pendyala et.al. 2404.02577 null
2024-04-03 SliceIt! -- A Dual Simulator Framework for Learning Robot Food Slicing Cristian C. Beltran-Hernandez et.al. 2404.02569 link
2024-04-03 Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning Yi Shen et.al. 2404.02545 link
2024-04-03 Joint Optimization on Uplink OFDMA and MU-MIMO for IEEE 802.11ax: Deep Hierarchical Reinforcement Learning Approach Hyeonho Noh et.al. 2404.02486 null
2024-04-03 Deep Reinforcement Learning for Traveling Purchaser Problems Haofeng Yuan et.al. 2404.02476 null
2024-04-03 Electric Vehicle Routing Problem for Emergency Power Supply: Towards Telecom Base Station Relief Daisuke Kikuta et.al. 2404.02448 link
2024-04-03 AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset Dongsu Lee et.al. 2404.02429 null
2024-04-02 Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL Golnaz Mesbahi et.al. 2404.02113 null
2024-04-02 Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning Samuel Tovey et.al. 2404.01999 null
2024-04-02 VLRM: Vision-Language Models act as Reward Models for Image Captioning Maksim Dzabraev et.al. 2404.01911 null
2024-04-02 Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation Carlos Plou et.al. 2404.01867 null
2024-04-02 Keeping Behavioral Programs Alive: Specifying and Executing Liveness Requirements Tom Yaacov et.al. 2404.01858 null
2024-04-02 EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking Stavros Orfanoudakis et.al. 2404.01849 link
2024-04-02 Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Kyungbok Lee et.al. 2404.01830 null
2024-04-02 Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid Eric MSP Veith et.al. 2404.01794 null
2024-04-02 Unifying Qualitative and Quantitative Safety Verification of DNN-Controlled Systems Dapeng Zhi et.al. 2404.01769 null
2024-04-02 Asymptotics of Language Model Alignment Joy Qiping Yang et.al. 2404.01730 null
2024-03-29 Learning Visual Quadrupedal Loco-Manipulation from Demonstrations Zhengmao He et.al. 2403.20328 null
2024-03-29 Active flow control of a turbulent separation bubble through deep reinforcement learning Bernat Font et.al. 2403.20295 link
2024-03-29 Functional Bilevel Optimization for Machine Learning Ieva Petrulionyte et.al. 2403.20233 null
2024-03-29 Decentralized Multimedia Data Sharing in IoV: A Learning-based Equilibrium of Supply and Demand Jiani Fan et.al. 2403.20218 null
2024-03-29 Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning Duzhen Zhang et.al. 2403.20163 null
2024-03-29 CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening Hei Yi Mak et.al. 2403.20156 link
2024-03-29 A Learning-based Incentive Mechanism for Mobile AIGC Service in Decentralized Internet of Vehicles Jiani Fan et.al. 2403.20151 null
2024-03-29 Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-directed Molecular Generation Jinyeong Park et.al. 2403.20109 link
2024-03-29 Reinforcement learning for graph theory, II. Small Ramsey numbers Mohammad Ghebleh et.al. 2403.20055 link
2024-03-29 Nonparametric Bellman Mappings for Reinforcement Learning: Application to Robust Adaptive Filtering Yuki Akiyama et.al. 2403.20020 null
2024-03-28 Human-compatible driving partners through data-regularized self-play reinforcement learning Daphne Cornelisse et.al. 2403.19648 link
2024-03-28 Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment Alireza Ganjdanesh et.al. 2403.19490 null
2024-03-28 Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization Teodor V. Marinov et.al. 2403.19462 null
2024-03-28 EDA-Driven Preprocessing for SAT Solving Zhengyuan Shi et.al. 2403.19446 null
2024-03-28 Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model Qi Gou et.al. 2403.19443 null
2024-03-28 Fine-Tuning Language Models with Reward Learning on Policy Hao Lang et.al. 2403.19279 link
2024-03-28 Removing the need for ground truth UWB data collection: self-supervised ranging error correction using deep reinforcement learning Dieter Coppens et.al. 2403.19262 null
2024-03-28 Inferring Latent Temporal Sparse Coordination Graph for Multi-Agent Reinforcement Learning Wei Duan et.al. 2403.19253 link
2024-03-28 Disentangling Length from Quality in Direct Preference Optimization Ryan Park et.al. 2403.19159 null
2024-03-27 GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning Hsin-Jung Yang et.al. 2403.19062 null
2024-03-27 Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment Li Siyao et.al. 2403.18811 null
2024-03-27 CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning Elliot Chane-Sane et.al. 2403.18765 null
2024-03-27 Probabilistic Model Checking of Stochastic Reinforcement Learning Policies Dennis Gross et.al. 2403.18725 null
2024-03-28 FPGA-Based Neural Thrust Controller for UAVs Sharif Azem et.al. 2403.18703 null
2024-03-27 Safe and Robust Reinforcement-Learning: Principles and Practice Taku Yamagata et.al. 2403.18539 null
2024-03-27 Bridging the Gap: Regularized Reinforcement Learning for Improved Classical Motion Planning with Safety Modules Elias Goldsztejn et.al. 2403.18524 null
2024-03-27 VersaT2I: Improving Text-to-Image Models with Versatile Reward Jianshu Guo et.al. 2403.18493 null
2024-03-27 Scaling Vision-and-Language Navigation With Offline RL Valay Bundele et.al. 2403.18454 null
2024-03-27 FRESCO: Federated Reinforcement Energy System for Cooperative Optimization Nicolas Mauricio Cuadrado et.al. 2403.18444 null
2024-03-27 Reinforcement learning for graph theory, I. Reimplementation of Wagner's approach Salem Al-Yakoob et.al. 2403.18429 link
2024-03-26 TractOracle: towards an anatomically-informed reward function for RL-based tractography Antoine Théberge et.al. 2403.17845 link
2024-03-26 Learning the Optimal Power Flow: Environment Design Matters Thomas Wolgast et.al. 2403.17831 link
2024-03-26 Depending on yourself when you should: Mentoring LLM with RL agents to become the master in cybersecurity games Yikuan Yan et.al. 2403.17674 null
2024-03-26 Learning Goal-Directed Object Pushing in Cluttered Scenes with Location-Based Attention Nils Dengler et.al. 2403.17667 null
2024-03-26 Uncertainty-aware Distributional Offline Reinforcement Learning Xiaocong Chen et.al. 2403.17646 null
2024-03-26 PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning Frederico Metelo et.al. 2403.17637 link
2024-03-26 Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems Siyu Wang et.al. 2403.17634 null
2024-03-26 Towards a Zero-Data, Controllable, Adaptive Dialog System Dirk Väth et.al. 2403.17582 null
2024-03-26 VDSC: Enhancing Exploration Timing with Value Discrepancy and State Counts Marius Captari et.al. 2403.17542 null
2024-03-26 BVR Gym: A Reinforcement Learning Environment for Beyond-Visual-Range Air Combat Edvards Scukins et.al. 2403.17533 link
2024-03-25 An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems Hanqing Yang et.al. 2403.16809 link
2024-03-25 Enhancing Software Effort Estimation through Reinforcement Learning-based Project Management-Oriented Feature Selection Haoyang Chen et.al. 2403.16749 null
2024-03-25 Deep Reinforcement Learning and Mean-Variance Strategies for Responsible Portfolio Optimization Fernando Acero et.al. 2403.16667 null
2024-03-25 Skill Q-Network: Learning Adaptive Skill Ensemble for Mapless Navigation in Unknown Environments Hyunki Seong et.al. 2403.16664 null
2024-03-25 Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL Osama Ahmad et.al. 2403.16652 null
2024-03-26 CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment Feiteng Fang et.al. 2403.16649 link
2024-03-25 Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot Zifan Wang et.al. 2403.16535 link
2024-03-25 Towards Cooperative Maneuver Planning in Mixed Traffic at Urban Intersections Marvin Klimke et.al. 2403.16478 null
2024-03-25 If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions Reza Esfandiarpoor et.al. 2403.16442 link
2024-03-25 Physics-informed RL for Maximal Safety Probability Estimation Hikaru Hoshino et.al. 2403.16391 link
2024-03-25 Learning Action-based Representations Using Invariance Max Rudolph et.al. 2403.16369 null
2024-03-24 Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly Network Yao Kang et.al. 2403.16301 null
2024-03-22 Can large language models explore in-context? Akshay Krishnamurthy et.al. 2403.15371 null
2024-03-22 Planning with a Learned Policy Basis to Optimally Solve Complex Tasks Guillermo Infante et.al. 2403.15301 null
2024-03-22 Blockchain-based Pseudonym Management for Vehicle Twin Migrations in Vehicular Edge Metaverse Jiawen Kang et.al. 2403.15285 null
2024-03-22 Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies Nicolò Botteghi et.al. 2403.15267 null
2024-03-22 Self-Improvement for Neural Combinatorial Optimization: Sample without Replacement, but Improvement Jonathan Pirnay et.al. 2403.15180 link
2024-03-22 Subequivariant Reinforcement Learning Framework for Coordinated Motion Control Haoyu Wang et.al. 2403.15100 null
2024-03-22 Improved Long Short-Term Memory-based Wastewater Treatment Simulators for Deep Reinforcement Learning Esmaeel Mohammadi et.al. 2403.15091 null
2024-03-22 Automated Feature Selection for Inverse Reinforcement Learning Daulet Baimukashev et.al. 2403.15079 null
2024-03-22 Testing for Fault Diversity in Reinforcement Learning Quentin Mazouni et.al. 2403.15065 link
2024-03-22 Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation Zhenrui Yue et.al. 2403.14952 null
2024-03-21 Rethinking Adversarial Inverse Reinforcement Learning: From the Angles of Policy Imitation and Transferable Reward Recovery Yangchun Zhang et.al. 2403.14593 link
2024-03-21 A Mathematical Introduction to Deep Reinforcement Learning for 5G/6G Applications Farhad Rezazadeh et.al. 2403.14516 null
2024-03-21 Constrained Reinforcement Learning with Smoothed Log Barrier Function Baohe Zhang et.al. 2403.14508 null
2024-03-21 On the continuity and smoothness of the value function in reinforcement learning and optimal control Hans Harder et.al. 2403.14432 null
2024-03-21 Emergent communication and learning pressures in language models: a language evolution perspective Lukas Galke et.al. 2403.14427 null
2024-03-21 Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization Daniel Mayfrank et.al. 2403.14425 null
2024-03-21 A reinforcement learning guided hybrid evolutionary algorithm for the latency location routing problem Yuji Zou et.al. 2403.14405 link
2024-03-21 Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression Fernando Acero et.al. 2403.14328 null
2024-03-21 Reactor Optimization Benchmark by Reinforcement Learning Deborah Schwarcz et.al. 2403.14273 link
2024-03-21 Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection Kyungjae Lee et.al. 2403.14238 null
2024-03-20 Towards Principled Representation Learning from Videos for Reinforcement Learning Dipendra Misra et.al. 2403.13765 link
2024-03-20 Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study Luca Giamattei et.al. 2403.13729 null
2024-03-20 Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections Zengqi Peng et.al. 2403.13674 null
2024-03-20 Multi-agent Reinforcement Traffic Signal Control based on Interpretable Influence Mechanism and Biased ReLU Approximation Zhiyue Luo et.al. 2403.13639 null
2024-03-20 Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation Do June Min et.al. 2403.13578 link
2024-03-20 GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot Wenxuan Song et.al. 2403.13358 null
2024-03-20 Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks Shaunak A. Mehta et.al. 2403.13281 link
2024-03-20 Federated reinforcement learning for robot motion planning with zero-shot generalization Zhenyuan Yuan et.al. 2403.13245 null
2024-03-20 Graph Attention Network-based Block Propagation with Optimal AoI and Reputation in Web 3.0 Jiana Liao et.al. 2403.13237 null
2024-03-20 Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network Jiarong Fan et.al. 2403.13236 null
2024-03-19 Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes He Wang et.al. 2403.12946 null
2024-03-19 HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning Fucai Ke et.al. 2403.12884 null
2024-03-19 Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mirco Theile et.al. 2403.12856 null
2024-03-20 Policy Bifurcation in Safe Reinforcement Learning Wenjun Zou et.al. 2403.12847 link
2024-03-19 Oriented and Non-oriented Cubical Surfaces in The Penteract Manuel Estevez et.al. 2403.12825 null
2024-03-19 Automated Contrastive Learning Strategy Search for Time Series Baoyu Jing et.al. 2403.12641 null
2024-03-19 FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting Clément Gaspard et.al. 2403.12589 null
2024-03-19 INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations Lirui Luo et.al. 2403.12451 link
2024-03-19 Bin Packing Optimization via Deep Reinforcement Learning Baoying Wang et.al. 2403.12420 null
2024-03-19 Understanding Training-free Diffusion Guidance: Mechanisms and Limitations Yifei Shen et.al. 2403.12404 link
2024-03-18 The Value of Reward Lookahead in Reinforcement Learning Nadav Merlis et.al. 2403.11637 null
2024-03-18 Offline Multitask Representation Learning for Reinforcement Learning Haque Ishfaq et.al. 2403.11574 null
2024-03-18 Reinforcement Learning with Token-level Feedback for Controllable Text Generation Wendi Li et.al. 2403.11558 link
2024-03-18 TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling Weiran Chen et.al. 2403.11550 null
2024-03-18 State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards Yuto Tanimoto et.al. 2403.11520 link
2024-03-18 Demystifying Deep Reinforcement Learning-Based Autonomous Vehicle Decision-Making Hanxi Wan et.al. 2403.11432 null
2024-03-18 Variational Sampling of Temporal Trajectories Jurijs Nazarovs et.al. 2403.11418 null
2024-03-17 Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective Muhammad Aneeq uz Zaman et.al. 2403.11345 null
2024-03-17 Causality from Bottom to Top: A Survey Abraham Itzhak Weinberg et.al. 2403.11219 null
2024-03-17 Continuous Jumping of a Parallel Wire-Driven Monopedal Robot RAMIEL Using Reinforcement Learning Kento Kawaharazuka et.al. 2403.11205 null
2024-03-15 HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation Carmelo Sferrazza et.al. 2403.10506 null
2024-03-15 Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness Aidan Curtis et.al. 2403.10454 null
2024-03-15 Regret Minimization via Saddle Point Optimization Johannes Kirschner et.al. 2403.10379 null
2024-03-15 Cooperative Jamming for Physical Layer Security Enhancement Using Deep Reinforcement Learning Sayed Amir Hoseini et.al. 2403.10342 null
2024-03-15 Application of machine learning to experimental design in quantum mechanics Federico Belliardo et.al. 2403.10317 null
2024-03-15 Offline Goal-Conditioned Reinforcement Learning for Shape Control of Deformable Linear Objects Rita Laezza et.al. 2403.10290 null
2024-03-15 Grasp Anything: Combining Teacher-Augmented Policy Gradient Learning with Instance Segmentation to Grasp Arbitrary Objects Malte Mosbach et.al. 2403.10187 null
2024-03-15 Online Policy Learning from Offline Preferences Guoxi Zhang et.al. 2403.10160 null
2024-03-15 Belief Aided Navigation using Bayesian Reinforcement Learning for Avoiding Humans in Blind Spots Jinyeob Kim et.al. 2403.10105 link
2024-03-15 Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF Amey Hengle et.al. 2403.10088 null
2024-03-14 Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning Zhishuai Liu et.al. 2403.09621 null
2024-03-15 ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models Runyu Ma et.al. 2403.09583 null
2024-03-14 A Reinforcement Learning Approach to Dairy Farm Battery Management using Q Learning Nawazish Ali et.al. 2403.09499 null
2024-03-14 Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Zhiqing Sun et.al. 2403.09472 link
2024-03-14 A Deep Reinforcement Learning Approach for Autonomous Reconfigurable Intelligent Surfaces Hyuckjin Choi et.al. 2403.09270 null
2024-03-14 Leveraging Constraint Programming in a Deep Learning Approach for Dynamically Solving the Flexible Job-Shop Scheduling Problem Imanol Echeverria et.al. 2403.09249 null
2024-03-14 Rumor Mitigation in Social Media Platforms with Deep Reinforcement Learning Hongyuan Su et.al. 2403.09217 link
2024-03-14 MetroGNN: Metro Network Expansion with Reinforcement Learning Hongyuan Su et.al. 2403.09197 link
2024-03-14 SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning Nicholas Zolman et.al. 2403.09110 link
2024-03-14 CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language Models to Coding Preferences Martin Weyssow et.al. 2403.09032 link
2024-03-13 TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning Shangding Gu et.al. 2403.08694 null
2024-03-13 Digital Twin-assisted Reinforcement Learning for Resource-aware Microservice Offloading in Edge Computing Xiangchun Chen et.al. 2403.08687 null
2024-03-13 Meta Reinforcement Learning for Resource Allocation in Aerial Active-RIS-assisted Networks with Rate-Splitting Multiple Access Sajad Faramarzi et.al. 2403.08648 null
2024-03-13 Human Alignment of Large Language Models through Online Preference Optimisation Daniele Calandriello et.al. 2403.08635 null
2024-03-13 Specification Overfitting in Artificial Intelligence Benjamin Roth et.al. 2403.08425 null
2024-03-13 Optimizing Risk-averse Human-AI Hybrid Teams Andrew Fuchs et.al. 2403.08386 null
2024-03-13 Learning to Describe for Predicting Zero-shot Drug-Drug Interactions Fangqi Zhu et.al. 2403.08377 link
2024-03-13 LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments Maonan Wang et.al. 2403.08337 link
2024-03-14 HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback Ang Li et.al. 2403.08309 null
2024-03-13 SpaceOctopus: An Octopus-inspired Motion Planning Framework for Multi-arm Space Robot Wenbo Zhao et.al. 2403.08219 null
2024-03-12 Exploring Safety Generalization Challenges of Large Language Models via Code Qibing Ren et.al. 2403.07865 link
2024-03-12 Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards Wei Shen et.al. 2403.07708 null
2024-03-12 Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning Motoki Omura et.al. 2403.07704 null
2024-03-12 Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation Michael Ogezi et.al. 2403.07605 null
2024-03-12 An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning Weiwei Gu et.al. 2403.07566 null
2024-03-12 Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding Huijie Tang et.al. 2403.07559 link
2024-03-12 Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement Learning Approach Shuchang Yan et.al. 2403.07503 null
2024-03-12 Optimization of Pressure Management Strategies for Geological CO2 Sequestration Using Surrogate Model-based Reinforcement Learning Jungang Chen et.al. 2403.07360 link
2024-03-12 Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer Dipesh Tamboli et.al. 2403.07309 link
2024-03-12 Advantage-Aware Policy Optimization for Offline Reinforcement Learning Yunpeng Qing et.al. 2403.07262 null
2024-03-11 Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts Onur Celik et.al. 2403.06966 null
2024-03-11 Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning Junseok Park et.al. 2403.06880 null
2024-03-11 Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification Joar Skalse et.al. 2403.06854 null
2024-03-11 In-context Exploration-Exploitation for Reinforcement Learning Zhenwen Dai et.al. 2403.06826 null
2024-03-11 ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Hao-Lun Hsu et.al. 2403.06814 null
2024-03-11 From Factor Models to Deep Learning: Machine Learning in Reshaping Empirical Asset Pricing Junyi Ye et.al. 2403.06779 null
2024-03-11 ALaRM: Align Language Models via Hierarchical Rewards Modeling Yuhang Lai et.al. 2403.06754 link
2024-03-11 Generalising Multi-Agent Cooperation through Task-Agnostic Communication Dulhan Jayalath et.al. 2403.06750 link
2024-03-11 Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback Adarsh N L et.al. 2403.06735 null
2024-03-11 Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning Zijian Zhou et.al. 2403.06728 null
2024-03-08 Will GPT-4 Run DOOM? Adrian de Wynter et.al. 2403.05468 null
2024-03-08 Switching the Loss Reduces the Cost in Batch Reinforcement Learning Alex Ayoub et.al. 2403.05385 null
2024-03-08 Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation Xiaoying Zhang et.al. 2403.05171 null
2024-03-08 Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem Ceyao Zhang et.al. 2403.05149 null
2024-03-08 ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models Jun Xu et.al. 2403.05132 null
2024-03-08 RLPeri: Accelerating Visual Perimetry Test with Reinforcement Learning and Convolutional Feature Extraction Tanvi Verma et.al. 2403.05112 null
2024-03-08 Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection Jared M. Ping et.al. 2403.05106 null
2024-03-08 Reset & Distill: A Recipe for Overcoming Negative Transfer in Continual Reinforcement Learning Hongjoon Ahn et.al. 2403.05066 null
2024-03-08 Aligning Large Language Models for Controllable Recommendations Wensheng Lu et.al. 2403.05063 null
2024-03-08 Provable Multi-Party Reinforcement Learning with Diverse Human Feedback Huiying Zhong et.al. 2403.05006 null
2024-03-07 Teaching Large Language Models to Reason with Reinforcement Learning Alex Havrilla et.al. 2403.04642 null
2024-03-07 Zero-shot cross-modal transfer of Reinforcement Learning policies through a Global Workspace Léopold Maytié et.al. 2403.04588 null
2024-03-07 Learning Agility Adaptation for Flight in Clutter Guangyu Zhao et.al. 2403.04586 null
2024-03-07 Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition Long-Fei Li et.al. 2403.04568 null
2024-03-07 Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation Fabian Otto et.al. 2403.04453 null
2024-03-07 Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation Tairan He et.al. 2403.04436 null
2024-03-07 iTRPL: An Intelligent and Trusted RPL Protocol based on Multi-Agent Reinforcement Learning Debasmita Dey et.al. 2403.04416 null
2024-03-07 Model-free $H_{\infty}$ control of Itô stochastic system via off-policy reinforcement learning Jing Guo Jing Guo et.al. 2403.04412 null
2024-03-07 Model-Free Load Frequency Control of Nonlinear Power Systems Based on Deep Reinforcement Learning Xiaodi Chen et.al. 2403.04374 null
2024-03-07 Symmetry Considerations for Learning Task Symmetric Robot Policies Mayank Mittal et.al. 2403.04359 null
2024-03-06 Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Jesse Farebrother et.al. 2403.03950 null
2024-03-06 Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation Marcel Torne et.al. 2403.03949 null
2024-03-06 Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning Zifan Xu et.al. 2403.03848 null
2024-03-06 A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation Di Zhang et.al. 2403.03643 null
2024-03-06 Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem Yuhong Sun et.al. 2403.03558 link
2024-03-06 Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning Zida Wu et.al. 2403.03552 null
2024-03-05 RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging Jordan Poots et.al. 2403.03359 null
2024-03-05 Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination Liangzhou Wang et.al. 2403.03172 null
2024-03-05 Leveraging Federated Learning and Edge Computing for Recommendation Systems within Cloud Computing Networks Yaqian Qi et.al. 2403.03165 null
2024-03-05 Language Guided Exploration for RL Agents in Text Environments Hitesh Golchha et.al. 2403.03141 null
2024-03-05 SplAgger: Split Aggregation for Meta-Reinforcement Learning Jacob Beck et.al. 2403.03020 link
2024-03-05 Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization Yuan Lin et.al. 2403.02882 null
2024-03-05 SpaceHopper: A Small-Scale Legged Robot for Exploring Low-Gravity Celestial Bodies Alexander Spiridonov et.al. 2403.02831 null
2024-03-05 A Zero-Shot Reinforcement Learning Strategy for Autonomous Guidewire Navigation Valentina Scarponi et.al. 2403.02777 null
2024-03-05 Fighting Game Adaptive Background Music for Improved Gameplay Ibrahim Khan et.al. 2403.02701 null
2024-03-05 PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement Learning Ke Zhang et.al. 2403.02635 link
2024-03-04 DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation Xueqing Wu et.al. 2403.02528 link
2024-03-02 Improving the Validity of Automatically Generated Feedback via Reinforcement Learning Alexander Scarlatos et.al. 2403.01304 link
2024-03-02 Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Hamza Kheddar et.al. 2403.01255 null
2024-03-02 Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding Ha-Thanh Nguyen et.al. 2403.01185 null
2024-03-02 Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning Hyungho Na et.al. 2403.01112 link
2024-03-02 Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) Noah Ford et.al. 2403.01059 null
2024-03-01 A Holistic Power Optimization Approach for Microgrid Control Based on Deep Reinforcement Learning Fulong Yao et.al. 2403.01013 link
2024-03-01 Policy Optimization for PDE Control with a Warm Start Xiangyuan Zhang et.al. 2403.01005 null
2024-03-01 On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games Awni Altabaa et.al. 2403.00993 null
2024-03-01 SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation Noriaki Hirose et.al. 2403.00991 null
2024-03-01 Scale-free Adversarial Reinforcement Learning Mingyu Chen et.al. 2403.00930 null
2024-02-29 Curiosity-driven Red-teaming for Large Language Models Zhang-Wei Hong et.al. 2402.19464 link
2024-02-29 ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Yifei Zhou et.al. 2402.19446 link
2024-02-29 Understanding Iterative Combinatorial Auction Designs via Multi-Agent Reinforcement Learning Greg d'Eon et.al. 2402.19420 null
2024-02-29 RL-GPT: Integrating Reinforcement Learning and Code-as-policy Shaoteng Liu et.al. 2402.19299 null
2024-02-29 StiefelGen: A Simple, Model Agnostic Approach for Time Series Data Augmentation over Riemannian Manifolds Prasad Cheema et.al. 2402.19287 null
2024-02-29 Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning Jingxuan Yang et.al. 2402.19275 null
2024-02-29 Deep Reinforcement Learning: A Convex Optimization Approach Ather Gattami et.al. 2402.19212 null
2024-02-29 ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration Angelo Caregnato-Neto et.al. 2402.19128 null
2024-02-29 Temporal-Aware Deep Reinforcement Learning for Energy Storage Bidding in Energy and Contingency Reserve Markets Jinhao Li et.al. 2402.19110 null
2024-02-29 How to Train your Antivirus: RL-based Hardening through the Problem-Space Jacopo Cortellazzi et.al. 2402.19027 null
2024-02-28 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang et.al. 2402.18571 link
2024-02-28 Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks Benjamin David Evans et.al. 2402.18558 link
2024-02-28 Human-Centric Aware UAV Trajectory Planning in Search and Rescue Missions Employing Multi-Objective Reinforcement Learning with AHP and Similarity-Based Experience Replay Mahya Ramezani et.al. 2402.18487 null
2024-02-28 FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist Wentao Zhang et.al. 2402.18485 null
2024-02-28 Implementing Online Reinforcement Learning with Clustering Neural Networks James E. Smith et.al. 2402.18472 null
2024-02-28 Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning Jin Hwa Lee et.al. 2402.18361 null
2024-02-28 Solving Multi-Entity Robotic Problems Using Permutation Invariant Neural Networks Tianxu An et.al. 2402.18345 null
2024-02-28 Whole-body Humanoid Robot Locomotion with Human Reference Qiang Zhang et.al. 2402.18294 null
2024-02-28 Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization Shuo Yang et.al. 2402.18284 null
2024-02-28 Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment Joachim Grimstad et.al. 2402.18246 null
2024-02-27 Quantum Circuit Discovery for Fault-Tolerant Logical State Preparation with Reinforcement Learning Remmy Zen et.al. 2402.17761 link
2024-02-27 Learning to Program Variational Quantum Circuits with Fast Weights Samuel Yen-Chi Chen et.al. 2402.17760 null
2024-02-27 When Your AI Deceives You: Challenges with Partial Observability of Human Evaluators in Reward Learning Leon Lang et.al. 2402.17747 null
2024-02-27 reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use Susobhan Ghosh et.al. 2402.17739 link
2024-02-27 Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-minimum Phase Systems Fatemeh Tavakkoli et.al. 2402.17703 null
2024-02-27 Multi-Agent Deep Reinforcement Learning for Distributed Satellite Routing Federico Lozano-Cuadra et.al. 2402.17666 null
2024-02-27 Emergency Caching: Coded Caching-based Reliable Map Transmission in Emergency Networks Zeyu Tian et.al. 2402.17550 null
2024-02-27 Intensive Care as One Big Sequence Modeling Problem Vadim Liventsev et.al. 2402.17501 link
2024-02-27 Reinforced In-Context Black-Box Optimization Lei Song et.al. 2402.17423 link
2024-02-27 Beacon, a lightweight deep reinforcement learning benchmark library for flow control Jonathan Viquerat et.al. 2402.17402 link
2024-02-26 Q-FOX Learning: Breaking Tradition in Reinforcement Learning Mahmood Alqaseer et.al. 2402.16562 null
2024-02-26 Model-based deep reinforcement learning for accelerated learning from flow simulations Andre Weiner et.al. 2402.16543 link
2024-02-26 Discovering Artificial Viscosity Models for Discontinuous Galerkin Approximation of Conservation Laws using Physics-Informed Machine Learning Matteo Caldana et.al. 2402.16517 null
2024-02-26 AI-enabled STAR-RIS aided MISO ISAC Secure Communications Zhengyu Zhu et.al. 2402.16413 null
2024-02-26 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359 null
2024-02-26 C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory Tianjiao Luo et.al. 2402.16349 null
2024-02-26 Achieving $\tilde{O}(1/ε)$ Sample Complexity for Constrained Markov Decision Process Jiashuo Jiang et.al. 2402.16324 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302 link
2024-02-25 How Can LLM Guide RL? A Value-Based Approach Shenao Zhang et.al. 2402.16181 link
2024-02-25 GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Xiao Chen et.al. 2402.16174 null
2024-02-25 Citation-Enhanced Generation for LLM-based Chatbot Weitao Li et.al. 2402.16063 null
2024-02-25 LLMs with Chain-of-Thought Are Non-Causal Reasoners Guangsheng Bao et.al. 2402.16048 link
2024-02-25 Harnessing the Synergy between Pushing, Grasping, and Throwing to Enhance Object Manipulation in Cluttered Scenarios Hamidreza Kasaei et.al. 2402.16045 null
2024-02-23 Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization Swaroop Nath et.al. 2402.15473 link
2024-02-23 PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning Simon Holk et.al. 2402.15420 null
2024-02-23 Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation Zhishuai Liu et.al. 2402.15399 link
2024-02-23 Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms Filippo Lazzati et.al. 2402.15392 null
2024-02-23 Shapley Value Based Multi-Agent Reinforcement Learning: Theory, Method and Its Application to Energy Network Jianhong Wang et.al. 2402.15324 null
2024-02-23 When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination Martin Benfeghoul et.al. 2402.15283 null
2024-02-23 Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization Homayoun Honari et.al. 2402.15197 null
2024-02-23 EasyRL4Rec: A User-Friendly Code Library for Reinforcement Learning Based Recommender Systems Yuanqing Yu et.al. 2402.15164 link
2024-02-23 Spatially-Aware Transformer Memory for Embodied Agents Junmo Cho et.al. 2402.15160 link
2024-02-23 Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding Haoming Li et.al. 2402.15102 null
2024-02-22 Generalizing Reward Modeling for Out-of-Distribution Preference Learning Chen Jia et.al. 2402.14760 link
2024-02-22 SHM-Traffic: DRL and Transfer learning based UAV Control for Structural Health Monitoring of Bridges with Traffic Divija Swetha Gadiraju et.al. 2402.14757 null
2024-02-22 Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Arash Ahmadian et.al. 2402.14740 null
2024-02-22 Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning Jinyeob Kim et.al. 2402.14569 link
2024-02-22 MR-ARL: Model Reference Adaptive Reinforcement Learning for Robustly Stable On-Policy Data-Driven LQR Marco Borghesi et.al. 2402.14483 null
2024-02-22 Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems Christina Schenk et.al. 2402.14446 null
2024-02-22 Quantum Circuit Optimization with AlphaTensor Francisco J. R. Ruiz et.al. 2402.14396 null
2024-02-22 Optimal Mechanism in a Dynamic Stochastic Knapsack Environment Jihyeok Jung et.al. 2402.14269 null
2024-02-22 MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint Xinglin Zhou et.al. 2402.14244 null
2024-02-22 Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning Peng Gao et.al. 2402.14236 null
2024-02-21 Generating Realistic Arm Movements in Reinforcement Learning: A Quantitative Comparison of Reward Terms and Task Requirements Jhon Charaja et.al. 2402.13949 null
2024-02-21 AttackGNN: Red-Teaming GNNs in Hardware Security Using Reinforcement Learning Vasudev Gohil et.al. 2402.13946 null
2024-02-21 Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning Antoine Chaffin et.al. 2402.13936 link
2024-02-21 Enhancing Reinforcement Learning Agents with Local Guides Paul Daoudi et.al. 2402.13930 link
2024-02-21 Dealing with unbounded gradients in stochastic saddle-point optimization Gergely Neu et.al. 2402.13903 null
2024-02-21 Synthesis of Hierarchical Controllers Based on Deep Reinforcement Learning Policies Florent Delgrange et.al. 2402.13785 null
2024-02-21 Weakly supervised localisation of prostate cancer using reinforcement learning for bi-parametric MR images Martynas Pocius et.al. 2402.13778 null
2024-02-21 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777 link
2024-02-21 Reinforcement learning-assisted quantum architecture search for variational quantum algorithms Akash Kundu et.al. 2402.13754 null
2024-02-21 Privacy-Preserving Instructions for Aligning Large Language Models Da Yu et.al. 2402.13659 link
2024-02-20 Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies Ammar N. Abbas et.al. 2402.13219 link
2024-02-20 Bayesian Reward Models for LLM Alignment Adam X. Yang et.al. 2402.13210 null
2024-02-20 SONATA: Self-adaptive Evolutionary Framework for Hardware-aware Neural Architecture Search Halima Bouzidi et.al. 2402.13204 null
2024-02-20 Tiny Reinforcement Learning for Quadruped Locomotion using Decision Transformers Orhan Eren Akgün et.al. 2402.13201 link
2024-02-20 Align Your Intents: Offline Imitation Learning via Optimal Transport Maksim Bobrin et.al. 2402.13037 null
2024-02-20 Multi-Level ML Based Burst-Aware Autoscaling for SLO Assurance and Cost Efficiency Chunyang Meng et.al. 2402.12962 link
2024-02-20 Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Space Sindre Benjamin Remman et.al. 2402.12939 null
2024-02-20 Large Language Model-based Human-Agent Collaboration for Complex Task Solving Xueyang Feng et.al. 2402.12914 link
2024-02-20 Skill or Luck? Return Decomposition via Advantage Functions Hsiao-Ru Pan et.al. 2402.12874 null
2024-02-20 MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces Tianyu Zheng et.al. 2402.12845 link
2024-02-19 A Critical Evaluation of AI Feedback for Aligning Large Language Models Archit Sharma et.al. 2402.12366 link
2024-02-19 Refining Minimax Regret for Unsupervised Environment Design Michael Beukman et.al. 2402.12284 link
2024-02-19 CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement Learning for LLM-based Mutation Jueon Eom et.al. 2402.12222 null
2024-02-19 Revisiting Data Augmentation in Deep Reinforcement Learning Jianshu Hu et.al. 2402.12181 link
2024-02-19 BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence Jiajie Jin et.al. 2402.12174 null
2024-02-19 Joint mode switching and resource allocation in wireless-powered RIS-aided multiuser communication systems Mingang Yuan et.al. 2402.12143 null
2024-02-19 Interpretable Brain-Inspired Representations Improve RL Performance on Visual Navigation Tasks Moritz Lange et.al. 2402.12067 null
2024-02-19 All Language Models Large and Small Zhixun Chen et.al. 2402.12061 null
2024-02-19 Reinforcement Learning for Optimal Execution when Liquidity is Time-Varying Andrea Macrì et.al. 2402.12049 null
2024-02-19 When Do Off-Policy and On-Policy Policy Gradient Methods Align? Davide Mambelli et.al. 2402.12034 null
2024-02-16 RLVF: Learning from Verbal Feedback without Overgeneralization Moritz Stephan et.al. 2402.10893 link
2024-02-16 Pedipulate: Enabling Manipulation Skills using a Quadruped Robot's Leg Philip Arm et.al. 2402.10837 null
2024-02-16 Goal-Conditioned Offline Reinforcement Learning via Metric Learning Alfredo Reichlin et.al. 2402.10820 null
2024-02-16 Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning Zihao Li et.al. 2402.10810 null
2024-02-16 Modelling crypto markets by multi-agent reinforcement learning Johann Lussange et.al. 2402.10803 link
2024-02-16 Policy Learning for Off-Dynamics RL with Deficient Support Linh Le Pham Van et.al. 2402.10765 link
2024-02-16 OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models Yuxuan Kuang et.al. 2402.10670 link
2024-02-16 Direct Preference Optimization with an Offset Afra Amini et.al. 2402.10571 link
2024-02-16 Discovery of an exchange-only gate sequence for CNOT with record-low gate time using reinforcement learning Violeta N. Ivanova-Rohling et.al. 2402.10559 null
2024-02-16 Provably Sample Efficient RLHF via Active Preference Optimization Nirjhar Das et.al. 2402.10500 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210 null
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207 link
2024-02-15 Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective Tianyi Qiu et.al. 2402.10184 null
2024-02-15 Large Scale Constrained Clustering With Reinforcement Learning Benedikt Schesch et.al. 2402.10177 null
2024-02-15 GraphCBAL: Class-Balanced Active Learning for Graph Neural Networks via Reinforcement Learning Chengcheng Yu et.al. 2402.10074 null
2024-02-15 RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models Saeed Khaki et.al. 2402.10038 null
2024-02-15 Neural Network Approaches for Parameterized Optimal Control Deepanshu Verma et.al. 2402.10033 null
2024-02-15 Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts Tobias Enders et.al. 2402.09992 link
2024-02-15 Enhancing Courier Scheduling in Crowdsourced Last-Mile Delivery through Dynamic Shift Extensions: A Deep Reinforcement Learning Approach Zead Saleh et.al. 2402.09961 null
2024-02-15 Revisiting Recurrent Reinforcement Learning with Memory Monoids Steven Morad et.al. 2402.09900 link
2024-02-14 Reinforcement Learning from Human Feedback with Active Queries Kaixuan Ji et.al. 2402.09401 null
2024-02-14 LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning Adithya Raman et.al. 2402.09392 null
2024-02-14 Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning Allen M. Wang et.al. 2402.09387 null
2024-02-14 Single-Reset Divide & Conquer Imitation Learning Alexandre Chenu et.al. 2402.09355 null
2024-02-14 Mitigating Reward Hacking via Information-Theoretic Reward Modeling Yuchun Miao et.al. 2402.09345 null
2024-02-14 Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning Michael Lanier et.al. 2402.09290 null
2024-02-14 Uncertainty-Aware Transient Stability-Constrained Preventive Redispatch: A Distributional Reinforcement Learning Approach Zhengcheng Wang et.al. 2402.09263 null
2024-02-14 Discovering Command and Control (C2) Channels on Tor and Public Networks Using Reinforcement Learning Cheng Wang et.al. 2402.09200 null
2024-02-14 Measuring Exploration in Reinforcement Learning via Optimal Transport in Policy Space Reabetswe M. Nkhumise et.al. 2402.09113 null
2024-02-14 Exploiting Estimation Bias in Deep Double Q-Learning for Actor-Critic Methods Alberto Sinigaglia et.al. 2402.09078 null
2024-02-13 Mixtures of Experts Unlock Parameter Scaling for Deep RL Johan Obando-Ceron et.al. 2402.08609 link
2024-02-13 A Distributional Analogue to the Successor Representation Harley Wiltzer et.al. 2402.08530 link
2024-02-13 Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea Hanna Krasowski et.al. 2402.08502 null
2024-02-13 Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming Andrzej Mizera et.al. 2402.08491 null
2024-02-13 Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning for Digital Twins Eslam Eldeeb et.al. 2402.08421 null
2024-02-13 Transition Constrained Bayesian Optimization via Markov Decision Processes Jose Pablo Folch et.al. 2402.08406 null
2024-02-13 MAVRL: Learn to Fly in Cluttered Environments with Varying Speed Hang Yu et.al. 2402.08381 null
2024-02-13 Reinforcement Learning for Docking Maneuvers with Prescribed Performance Simon Gottschalk et.al. 2402.08306 null
2024-02-13 Off-Policy Evaluation in Markov Decision Processes under Weak Distributional Overlap Mohammad Mehrabi et.al. 2402.08201 null
2024-02-13 Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation Ayesha Siddika Nipu et.al. 2402.08184 null
2024-02-12 MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning Ayesha Siddika Nipu et.al. 2402.07890 null
2024-02-12 Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States Noam Razin et.al. 2402.07875 link
2024-02-12 IR-Aware ECO Timing Optimization Using Reinforcement Learning Vidya A. Chhabria et.al. 2402.07781 null
2024-02-12 Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model Mark Rowland et.al. 2402.07598 null
2024-02-12 Rethinking Scaling Laws for Learning in Strategic Environments Tinashe Handina et.al. 2402.07588 null
2024-02-12 A Reinforcement Learning Approach to the Design of Quantum Chains for Optimal Energy Transfer S. Sgroi et.al. 2402.07561 null
2024-02-12 Reinforcement learning based demand charge minimization using energy storage Lucas Weber et.al. 2402.07525 null
2024-02-12 Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial Wenpin Tang et.al. 2402.07487 null
2024-02-12 Auxiliary Reward Generation with Transition Distance Representation Learning Siyuan Li et.al. 2402.07412 null
2024-02-12 Measurement Scheduling for ICU Patients with Offline Reinforcement Learning Zongliang Ji et.al. 2402.07344 null
2024-02-09 Predictive representations: building blocks of intelligence Wilka Carvalho et.al. 2402.06590 null
2024-02-09 Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks Michael Y. Fatemi et.al. 2402.06552 link
2024-02-09 ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies Jasmina Gajcin et.al. 2402.06503 null
2024-02-09 Hierarchical Transformers are Efficient Meta-Reinforcement Learners Gresa Shala et.al. 2402.06402 null
2024-02-09 High-Precision Geosteering via Reinforcement Learning and Particle Filters Ressi Bonti Muhammad et.al. 2402.06377 null
2024-02-09 Dynamic Q-planning for Online UAV Path Planning in Unknown and Complex Environments Lidia Gianne Souza da Rocha et.al. 2402.06297 null
2024-02-09 Value function interference and greedy action selection in value-based multi-objective reinforcement learning Peter Vamplew et.al. 2402.06266 null
2024-02-09 Reinforcement Learning for Blind Stair Climbing with Legged and Wheeled-Legged Robots Simon Chamorro et.al. 2402.06143 null
2024-02-08 Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning Mohak Bhardwaj et.al. 2402.06102 null
2024-02-08 Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making Scotty Black et.al. 2402.06075 null
2024-02-08 Risk-Sensitive Multi-Agent Reinforcement Learning in Network Aggregative Markov Games Hafez Ghaemi et.al. 2402.05906 link
2024-02-08 Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices Jiin Woo et.al. 2402.05876 null
2024-02-08 Discovering Temporally-Aware Reinforcement Learning Algorithms Matthew Thomas Jackson et.al. 2402.05828 link
2024-02-08 Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning Zhiheng Xi et.al. 2402.05808 link
2024-02-08 Analysing the Sample Complexity of Opponent Shaping Kitty Fung et.al. 2402.05782 null
2024-02-08 When is Mean-Field Reinforcement Learning Tractable and Relevant? Batuhan Yardim et.al. 2402.05757 null
2024-02-08 Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL Jiawei Huang et.al. 2402.05724 link
2024-02-08 Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming Giorgio Angelotti et.al. 2402.05703 null
2024-02-08 Improving Token-Based World Models with Parallel Observation Prediction Lior Cohen et.al. 2402.05643 link
2024-02-08 Optimizing Delegation in Collaborative Human-AI Hybrid Teams Andrew Fuchs et.al. 2402.05605 null
2024-02-07 Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation Dennis Hoftijzer et.al. 2402.05090 link
2024-02-07 Non-Markovian Quantum Control via Model Maximum Likelihood Estimation and Reinforcement Learning Tanmay Neema et.al. 2402.05084 null
2024-02-07 Extending the Reach of First-Order Algorithms for Nonconvex Min-Max Problems with Cohypomonotonicity Ahmet Alacaoglu et.al. 2402.05071 null
2024-02-07 Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning Shathushan Sivashangaran et.al. 2402.05066 null
2024-02-07 Towards Generalizability of Multi-Agent Reinforcement Learning in Graphs with Recurrent Message Passing Jannis Weil et.al. 2402.05027 link
2024-02-07 Pedagogical Alignment of Large Language Models Shashank Sonkar et.al. 2402.05000 null
2024-02-07 A Bayesian Approach to Online Learning for Contextual Restless Bandits with Applications to Public Health Biyonka Liang et.al. 2402.04933 link
2024-02-07 Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning Apoorva Vashisth et.al. 2402.04894 link
2024-02-07 Leveraging knowledge-as-a-service (KaaS) for QoS-aware resource management in multi-user video transcoding Luis Costero et.al. 2402.04891 null
2024-02-07 Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy Ruichu Cai et.al. 2402.04869 null
2024-02-06 MusicRL: Aligning Music Generation to Human Preferences Geoffrey Cideron et.al. 2402.04229 null
2024-02-06 Reinforcement Learning with Ensemble Model Predictive Safety Certification Sven Gronauer et.al. 2402.04182 null
2024-02-06 Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions Daniel Bogdoll et.al. 2402.04168 link
2024-02-06 Harnessing the Plug-and-Play Controller by Prompting Hao Wang et.al. 2402.04160 null
2024-02-06 Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Ruoqi Zhang et.al. 2402.04080 link
2024-02-06 Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks Yang Cao et.al. 2402.04056 null
2024-02-06 REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR Liang-Hsuan Tseng et.al. 2402.03988 link
2024-02-06 Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning Maxime Toquebiau et.al. 2402.03972 link
2024-02-06 In-context learning agents are asymmetric belief updaters Johannes A. Schubert et.al. 2402.03969 null
2024-02-06 Reinforcement Learning for Collision-free Flight Exploiting Deep Collision Encoding Mihir Kulkarni et.al. 2402.03947 null
2024-02-05 Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models Anthony Sicilia et.al. 2402.03284 null
2024-02-05 A Framework for Partially Observed Reward-States in RLHF Chinmaya Kausik et.al. 2402.03282 null
2024-02-05 MobilityGPT: Enhanced Human Mobility Modeling with a GPT model Ammar Haydari et.al. 2402.03264 null
2024-02-05 Multi-agent Reinforcement Learning for Energy Saving in Multi-Cell Massive MIMO Systems Tianzhang Cai et.al. 2402.03204 null
2024-02-05 A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning Abdelhakim Benechehab et.al. 2402.03146 null
2024-02-05 Boosting Long-Delayed Reinforcement Learning with Auxiliary Short-Delayed Task Qingyuan Wu et.al. 2402.03141 link
2024-02-05 Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations Stefan Sylvius Wagner et.al. 2402.03138 null
2024-02-05 Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning Carlos A. Velazquez-Vargas et.al. 2402.03072 null
2024-02-05 Probabilistic Actor-Critic: Learning to Explore with PAC-Bayes Uncertainty Bahareh Tasdighi et.al. 2402.03055 null
2024-02-05 Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning Shengyi Huang et.al. 2402.03046 null
2024-02-02 Position Paper: Generalized grammar rules and structure-based generalization beyond classical equivariance for lexical tasks and transduction Mircea Petrache et.al. 2402.01629 null
2024-02-02 DRL-Based Dynamic Channel Access and SCLAR Maximization for Networks Under Jamming Abdul Basit et.al. 2402.01574 null
2024-02-02 A Hybrid Strategy for Chat Transcript Summarization Pratik K. Biswas et.al. 2402.01510 null
2024-02-02 Brain-Like Replay Naturally Emerges in Reinforcement Learning Agents Jiyi Wang et.al. 2402.01467 null
2024-02-02 A Reinforcement Learning-Boosted Motion Planning Framework: Comprehensive Generalization Performance in Autonomous Driving Rainer Trauth et.al. 2402.01465 link
2024-02-02 Learning the Market: Sentiment-Based Ensemble Trading Agents Andrew Ye et.al. 2402.01441 null
2024-02-02 StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Shihan Dou et.al. 2402.01391 link
2024-02-02 To the Max: Reinventing Reward in Reinforcement Learning Grigorii Veviurko et.al. 2402.01361 null
2024-02-02 Parametric-Task MAP-Elites Timothée Anne et.al. 2402.01275 null
2024-02-02 Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems Neharika Jali et.al. 2402.01147 null
2024-02-01 Towards Efficient and Exact Optimization of Language Model Alignment Haozhe Ji et.al. 2402.00856 link
2024-02-01 SLIM: Skill Learning with Multiple Critics David Emukpere et.al. 2402.00823 null
2024-02-01 Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments Alexander W. Goodall et.al. 2402.00816 null
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807 null
2024-02-01 Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning Benjamin Patrick Evans et.al. 2402.00787 null
2024-02-01 Dense Reward for Free in Reinforcement Learning from Human Feedback Alex J. Chan et.al. 2402.00782 link
2024-02-01 Control-Theoretic Techniques for Online Adaptation of Deep Neural Networks in Dynamical Systems Jacob G. Elkins et.al. 2402.00761 null
2024-02-01 FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game Guangzheng Hu et.al. 2402.00738 null
2024-02-01 Neural Policy Style Transfer Raul Fernandez-Fernandez et.al. 2402.00677 null
2024-02-01 Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching Raul Fernandez-Fernandez et.al. 2402.00676 null
2024-01-31 Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability Navin Kamuni et.al. 2401.18040 null
2024-01-31 Causal Coordinated Concurrent Reinforcement Learning Tim Tse et.al. 2401.18012 null
2024-01-31 Circuit Partitioning for Multi-Core Quantum Architectures with Deep Reinforcement Learning Arnau Pastor et.al. 2401.17976 null
2024-01-31 Attention Graph for Multi-Robot Social Navigation with Deep Reinforcement Learning Erwan Escudie et.al. 2401.17914 null
2024-01-31 On Tractability, Complexity, and Mixed-Integer Convex Programming Representability of Distributionally Favorable Optimization Nan Jiang et.al. 2401.17899 null
2024-01-31 Graph Attention-based Reinforcement Learning for Trajectory Design and Resource Assignment in Multi-UAV Assisted Communication Zikai Feng et.al. 2401.17880 null
2024-01-31 Safe Reinforcement Learning-Based Eco-Driving Control for Mixed Traffic Flows With Disturbances Ke Lu et.al. 2401.17837 null
2024-01-31 A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees Toshinori Kitamura et.al. 2401.17780 link
2024-01-31 SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models Xiao Shao et.al. 2401.17749 link
2024-01-31 Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming Haotian Ling et.al. 2401.17527 null
2024-01-30 Improving robustness of quantum feedback control with reinforcement learning Manuel Guatto et.al. 2401.17190 link
2024-01-30 Zero-Shot Reinforcement Learning via Function Encoders Tyler Ingebrand et.al. 2401.17173 link
2024-01-30 Learning Approximation Sets for Exploratory Queries Susan B. Davidson et.al. 2401.17059 null
2024-01-30 M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation Fotios Lygerakis et.al. 2401.17032 link
2024-01-30 Re3val: Reinforced and Reranked Generative Retrieval EuiYul Song et.al. 2401.16979 null
2024-01-30 CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning Andreas W. M. Sauter et.al. 2401.16974 link
2024-01-30 Deep Contextual Bandit and Reinforcement Learning for IRS-Assisted MU-MIMO Systems Dariel Pereira-Ruisánchez et.al. 2401.16901 null
2024-01-30 Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control Zhongyu Li et.al. 2401.16889 null
2024-01-30 Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator Ryoma Furuyama et.al. 2401.16772 null
2024-01-30 Gradient-Based Language Model Red Teaming Nevan Wichers et.al. 2401.16656 link
2024-01-29 Curriculum-Based Reinforcement Learning for Quadrupedal Jumping: A Reference-free Design Vassil Atanassov et.al. 2401.16337 link
2024-01-29 Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF Banghua Zhu et.al. 2401.16335 null
2024-01-29 Optimal Control of Renewable Energy Communities subject to Network Peak Fees with Model Predictive Control and Reinforcement Learning Algorithms Samy Aittahar et.al. 2401.16321 null
2024-01-29 Prepare Non-classical Collective Spin State by Reinforcement Learning X. L. Zhao et.al. 2401.16320 null
2024-01-29 Effective Communication with Dynamic Feature Compression Pietro Talli et.al. 2401.16236 link
2024-01-29 Scalable Reinforcement Learning for Linear-Quadratic Control of Networks Johan Olsson et.al. 2401.16183 null
2024-01-29 Future Impact Decomposition in Request-level Recommendations Xiaobei Wang et.al. 2401.16108 link
2024-01-29 Emergence of cooperation under punishment: A reinforcement learning perspective Chenyang Zhao et.al. 2401.16073 null
2024-01-29 SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning Jianlan Luo et.al. 2401.16013 null
2024-01-29 A Deep Q-Network Based on Radial Basis Functions for Multi-Echelon Inventory Management Liqiang Cheng et.al. 2401.15872 null
2024-01-26 Fully Independent Communication in Multi-Agent Reinforcement Learning Rafael Pina et.al. 2401.15059 link
2024-01-26 Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning Md Mushfiqur Rahman et.al. 2401.15043 null
2024-01-26 Reinforcement Learning-based Relay Selection for Cooperative WSNs in the Presence of Bursty Impulsive Noise Hazem Barka et.al. 2401.15008 null
2024-01-26 Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks Eura Nofshin et.al. 2401.14923 null
2024-01-26 RESPRECT: Speeding-up Multi-fingered Grasping with Residual Reinforcement Learning Federico Ceola et.al. 2401.14858 link
2024-01-26 A Deep Reinforcement Learning-based Approach for Adaptive Handover Protocols in Mobile Networks Peter J. Gu et.al. 2401.14823 link
2024-01-26 On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks Joar Skalse et.al. 2401.14811 null
2024-01-26 Off-Policy Primal-Dual Safe Reinforcement Learning Zifan Wu et.al. 2401.14758 link
2024-01-26 FairSample: Training Fair and Accurate Graph Convolutional Neural Networks Efficiently Zicun Cong et.al. 2401.14702 null
2024-01-25 GCBF+: A Neural Graph Control Barrier Function Framework for Distributed Safe Multi-Agent Control Songyuan Zhang et.al. 2401.14554 null
2024-01-25 Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks Shuai Han et.al. 2401.14226 null
2024-01-25 True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning Weihao Tan et.al. 2401.14151 link
2024-01-25 Concept: Dynamic Risk Assessment for AI-Controlled Robotic Systems Philipp Grimmeisen et.al. 2401.14147 null
2024-01-25 Towards a Systems Theory of Algorithms Florian Dörfler et.al. 2401.14029 null
2024-01-25 Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration Alireza Mohammadshahi et.al. 2401.13979 link
2024-01-25 Networked Multiagent Reinforcement Learning for Peer-to-Peer Energy Trading Chen Feng et.al. 2401.13947 null
2024-01-25 Learning-based sensing and computing decision for data freshness in edge computing-enabled networks Sinwoong Yun et.al. 2401.13936 null
2024-01-25 Reinforcement Learning with Hidden Markov Models for Discovering Decision-Making Dynamics Xingche Guo et.al. 2401.13929 null
2024-01-25 Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation Yixuan Zhang et.al. 2401.13884 null
2024-01-24 Machine learning for industrial sensing and control: A survey and practical perspective Nathan P. Lawrence et.al. 2401.13836 null
2024-01-24 The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations Matthias Lehmann et.al. 2401.13662 link
2024-01-24 Emergence of anti-coordinated patterns in snowdrift game by reinforcement learning Zhen-Wei Ding et.al. 2401.13497 null
2024-01-24 Multi-Agent Diagnostics for Robustness via Illuminated Diversity Mikayel Samvelyan et.al. 2401.13460 null
2024-01-24 Symbolic Equation Solving via Reinforcement Learning Lennart Dabelow et.al. 2401.13447 null
2024-01-24 TraKDis: A Transformer-based Knowledge Distillation Approach for Visual Reinforcement Learning with Application to Cloth Manipulation Wei Chen et.al. 2401.13362 null
2024-01-24 SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning Guoxin Chen et.al. 2401.13246 link
2024-01-24 DittoGym: Learning to Control Soft Shape-Shifting Robots Suning Huang et.al. 2401.13231 link
2024-01-23 NLBAC: A Neural Ordinary Differential Equations-based Framework for Stable and Safe Reinforcement Learning Liqun Zhao et.al. 2401.13148 link
2024-01-23 The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts Lingfeng Shen et.al. 2401.13136 null
2024-01-23 Generalization of Heterogeneous Multi-Robot Policies via Awareness and Communication of Capabilities Pierce Howell et.al. 2401.13127 null
2024-01-23 HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments Qinhong Zhou et.al. 2401.12975 link
2024-01-23 Reward-Relevance-Filtered Linear Offline Reinforcement Learning Angela Zhou et.al. 2401.12934 null
2024-01-23 Active Inference as a Model of Agency Lancelot Da Costa et.al. 2401.12917 null
2024-01-23 Emergent Communication Protocol Learning for Task Offloading in Industrial Internet of Things Salwa Mostafa et.al. 2401.12914 null
2024-01-23 Model-Free $δ$-Policy Iteration Based on Damped Newton Method for Nonlinear Continuous-Time H$\infty$ Tracking Control Qi Wang et.al. 2401.12882 null
2024-01-23 Learning safety critics via a non-contractive binary bellman operator Agustin Castellano et.al. 2401.12849 null
2024-01-23 Digital Twin-Based Network Management for Better QoE in Multicast Short Video Streaming Xinyu Huang et.al. 2401.12826 null
2024-01-23 Deep Learning Based Simulators for the Phosphorus Removal Process Control in Wastewater Treatment via Deep Reinforcement Learning Algorithms Esmaeel Mohammadi et.al. 2401.12822 null
2024-01-23 Dynamic Layer Tying for Parameter-Efficient Transformers Tamir David Hay et.al. 2401.12819 null
2024-01-23 Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach Christian Fabian et.al. 2401.12686 null
2024-01-22 Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning Philip Amortila et.al. 2401.12216 null
2024-01-22 Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization Animesh Basak Chowdhury et.al. 2401.12205 null
2024-01-22 WARM: On the Benefits of Weight Averaged Reward Models Alexandre Ramé et.al. 2401.12187 null
2024-01-22 West-of-N: Synthetic Preference Generation for Improved Reward Modeling Alizée Pace et.al. 2401.12086 null
2024-01-22 Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking Yujiao Zhu et.al. 2401.12079 null
2024-01-22 HomeRobot Open Vocabulary Mobile Manipulation Challenge 2023 Participant Report (Team KuzHum) Volodymyr Kuzma et.al. 2401.12048 null
2024-01-22 Adaptive Motion Planning for Multi-fingered Functional Grasp via Force Feedback Dongying Tian et.al. 2401.11977 null
2024-01-22 Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey Pengyi Li et.al. 2401.11963 link
2024-01-22 Self-Labeling the Job Shop Scheduling Problem Andrea Corsini et.al. 2401.11849 link
2024-01-22 Safe and Generalized end-to-end Autonomous Driving System with Reinforcement Learning and Demonstrations Zuojin Tang et.al. 2401.11792 null
2024-01-19 Reinforcement learning for question answering in programming domain using public community scoring as a human feedback Alexey Gorbatovski et.al. 2401.10882 null
2024-01-19 Deep Reinforcement Learning Empowered Activity-Aware Dynamic Health Monitoring Systems Ziqiaing Ye et.al. 2401.10794 null
2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng et.al. 2401.10700 link
2024-01-19 Quality-Diversity Algorithms Can Provably Be Helpful for Optimization Chao Qian et.al. 2401.10539 null
2024-01-19 Episodic Reinforcement Learning with Expanded State-reward Space Dayang Liang et.al. 2401.10516 null
2024-01-18 HRL-TSCH: A Hierarchical Reinforcement Learning-based TSCH Scheduler for IIoT F. Fernando Jurado-Lasso et.al. 2401.10368 null
2024-01-18 LangProp: A code optimization framework using Language Models applied to driving Shu Ishida et.al. 2401.10314 link
2024-01-18 Model-Assisted Learning for Adaptive Cooperative Perception of Connected Autonomous Vehicles Kaige Qu et.al. 2401.10156 null
2024-01-18 Multi-Agent Reinforcement Learning for Maritime Operational Technology Cyber Security Alec Wilson et.al. 2401.10149 null
2024-01-18 Deep Back-Filling: a Split Window Technique for Deep Online Cluster Job Scheduling Lingfei Wang et.al. 2401.09910 null
2024-01-18 Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation Network Qiong Wu et.al. 2401.09886 link
2024-01-18 Reconciling Spatial and Temporal Abstractions for Goal Representation Mehdi Zadem et.al. 2401.09870 link
2024-01-18 FREED++: Improving RL Agents for Fragment-Based Molecule Generation by Thorough Reproduction Alexander Telepov et.al. 2401.09840 link
2024-01-18 Optimizing Visible Light Communication Efficiency Through Reinforcement Learning-Based NOMA-CSK Integration Serkan Vela et.al. 2401.09780 null
2024-01-18 Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning Hao Chen et.al. 2401.09772 null
2024-01-18 Exploration and Anti-Exploration with Distributional Random Network Distillation Kai Yang et.al. 2401.09750 link
2024-01-18 A HPC Co-Scheduler with Reinforcement Learning Abel Souza et.al. 2401.09706 null
2024-01-17 Central Limit Theorem for Two-Timescale Stochastic Approximation with Markovian Noise: Theory and Applications Jie Hu et.al. 2401.09339 null
2024-01-17 Vision-driven Autonomous Flight of UAV Along River Using Deep Reinforcement Learning with Dynamic Expert Guidance Zihan Wang et.al. 2401.09332 link
2024-01-17 Deployable Reinforcement Learning with Variable Control Rate Dong Wang et.al. 2401.09286 link
2024-01-17 An Efficient Generalizable Framework for Visuomotor Policies via Control-aware Augmentation and Privilege-guided Distillation Yinuo Zhao et.al. 2401.09258 null
2024-01-17 LLMs for Relational Reasoning: How Far are We? Zhiming Li et.al. 2401.09042 null
2024-01-17 UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems Changshuo Zhang et.al. 2401.09034 link
2024-01-17 Continuous Time Continuous Space Homeostatic Reinforcement Learning (CTCS-HRRL) : Towards Biological Self-Autonomous Agent Hugo Laurencon et.al. 2401.08999 null
2024-01-17 ReFT: Reasoning with Reinforced Fine-Tuning Trung Quoc Luong et.al. 2401.08967 link
2024-01-17 Cascading Reinforcement Learning Yihan Du et.al. 2401.08961 null
2024-01-17 Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback Teng Xiao et.al. 2401.08959 null
2024-01-16 On Quantum Natural Policy Gradients André Sequeira et.al. 2401.08307 link
2024-01-16 Sum Throughput Maximization in Multi-BD Symbiotic Radio NOMA Network Assisted by Active-STAR-RIS Rahman Saadat Yeganeh et.al. 2401.08301 null
2024-01-16 PRewrite: Prompt Rewriting with Reinforcement Learning Weize Kong et.al. 2401.08189 null
2024-01-16 IoTWarden: A Deep Reinforcement Learning Based Real-time Defense System to Mitigate Trigger-action IoT Attacks Md Morshed Alam et.al. 2401.08141 null
2024-01-16 CycLight: learning traffic signal cooperation with a cycle-level strategy Gengyue Han et.al. 2401.08121 null
2024-01-15 Survey of Learning Approaches for Robotic In-Hand Manipulation Abraham Itzhak Weinberg et.al. 2401.07915 null
2024-01-15 Learned Best-Effort LLM Serving Siddharth Jha et.al. 2401.07886 null
2024-01-15 The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise Shuze Liu et.al. 2401.07844 null
2024-01-15 Inferring Preferences from Demonstrations in Multi-Objective Residential Energy Management Junlin Lu et.al. 2401.07722 null
2024-01-15 Go-Explore for Residential Energy Management Junlin Lu et.al. 2401.07710 null
2024-01-12 NetMind: Adaptive RAN Baseband Function Placement by GCN Encoding and Maze-solving DRL Haiyuan Li et.al. 2401.06722 link
2024-01-12 Identifying Policy Gradient Subspaces Jan Schneider et.al. 2401.06604 null
2024-01-12 Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Case Study Shangding Gu et.al. 2401.06603 null
2024-01-12 Maximum Causal Entropy Inverse Reinforcement Learning for Mean-Field Games Berkay Anahtarci et.al. 2401.06566 null
2024-01-12 Personalized Reinforcement Learning with a Budget of Policies Dmitry Ivanov et.al. 2401.06514 link
2024-01-12 AI-enabled Priority and Auction-Based Spectrum Management for 6G Mina Khadem et.al. 2401.06484 null
2024-01-12 UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution Gengrui Zhang et.al. 2401.06470 null
2024-01-12 Striking a Balance in Fairness for Dynamic Systems Through Reinforcement Learning Yaowei Hu et.al. 2401.06318 link
2024-01-12 A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications Hamidreza Mazandarani et.al. 2401.06308 null
2024-01-11 Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care Elham Estiri et.al. 2401.06299 null
2024-01-11 Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint Zhipeng Chen et.al. 2401.06081 link
2024-01-11 Secrets of RLHF in Large Language Models Part II: Reward Modeling Binghai Wang et.al. 2401.06080 link
2024-01-11 Spatial-Aware Deep Reinforcement Learning for the Traveling Officer Problem Niklas Strauß et.al. 2401.05969 null
2024-01-11 Machine Learning Insides OptVerse AI Solver: Design Principles and Applications Xijun Li et.al. 2401.05960 null
2024-01-11 Optimistic Model Rollouts for Pessimistic Offline Policy Optimization Yuanzhao Zhai et.al. 2401.05899 null
2024-01-11 Safe reinforcement learning in uncertain contexts Dominik Baumann et.al. 2401.05876 link
2024-01-11 Confidence-Based Curriculum Learning for Multi-Agent Path Finding Thomy Phan et.al. 2401.05860 link
2024-01-11 Interactions between dynamic team composition and coordination: An agent-based modeling approach Darío Blanco-Fernández et.al. 2401.05832 null
2024-01-11 Towards Goal-Oriented Agents for Evolving Problems Observed via Conversation Michael Free et.al. 2401.05822 null
2024-01-11 Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents Quentin Delfosse et.al. 2401.05821 link
2024-01-10 ReACT: Reinforcement Learning for Controller Parametrization using B-Spline Geometries Thomas Rudolf et.al. 2401.05251 null
2024-01-10 Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces Yaqi Duan et.al. 2401.05233 null
2024-01-10 Modelling, Positioning, and Deep Reinforcement Learning Path Tracking Control of Scaled Robotic Vehicles: Design and Experimental Validation Carmine Caponio et.al. 2401.05194 null
2024-01-11 DRL-based Latency-Aware Network Slicing in O-RAN with Time-Varying SLAs Raoul Raftopoulos et.al. 2401.05042 null
2024-01-10 Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk Dennis Ulmer et.al. 2401.05033 null
2024-01-10 An Information Theoretic Approach to Interaction-Grounded Learning Xiaoyan Hu et.al. 2401.05015 null
2024-01-10 Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval Rumsha Fatima et.al. 2401.04938 null
2024-01-10 Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey Jiechuan Jiang et.al. 2401.04934 null
2024-01-09 Graph Learning-based Fleet Scheduling for Urban Air Mobility under Operational Constraints, Varying Demand & Uncertainties Steve Paul et.al. 2401.04851 null
2024-01-09 Deep Reinforcement Multi-agent Learning framework for Information Gathering with Local Gaussian Processes for Water Monitoring Samuel Yanes Luis et.al. 2401.04631 null
2024-01-09 Scalable Policies for the Dynamic Traveling Multi-Maintainer Problem with Alerts Peter Verleijsdonk et.al. 2401.04574 null
2024-01-09 i-Rebalance: Personalized Vehicle Repositioning for Supply Demand Balance Haoyang Chen et.al. 2401.04429 null
2024-01-09 StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments Sean Kulinski et.al. 2401.04290 null
2024-01-08 Curiosity & Entropy Driven Unsupervised RL in Multiple Environments Shaurya Dewan et.al. 2401.04198 null
2024-01-08 A Minimaximalist Approach to Reinforcement Learning from Human Feedback Gokul Swamy et.al. 2401.04056 null
2024-01-08 Behavioural Cloning in VizDoom Ryan Spick et.al. 2401.03993 null
2024-01-08 Guiding drones by information gain Alouette van Hove et.al. 2401.03947 null
2024-01-08 Using reinforcement learning to improve drone-based inference of greenhouse gas fluxes Alouette van Hove et.al. 2401.03932 link
2024-01-08 A Tensor Network Implementation of Multi Agent Reinforcement Learning Sunny Howard et.al. 2401.03896 null
2024-01-08 Inverse Reinforcement Learning with Sub-optimal Experts Riccardo Poiani et.al. 2401.03857 null
2024-01-08 Long-term Safe Reinforcement Learning with Binary Feedback Akifumi Wachi et.al. 2401.03786 null
2024-01-07 NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds Shivam Goel et.al. 2401.03546 null
2024-01-07 ClusterComm: Discrete Communication in Decentralized MARL using Internal Representation Clustering Robert Müller et.al. 2401.03504 null
2024-01-07 Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence Philip Jordan et.al. 2401.03489 link
2024-01-05 A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty Parvin Malekzadeh et.al. 2401.02914 null
2024-01-05 Deep Reinforcement Learning for Local Path Following of an Autonomous Formula SAE Vehicle Harvey Merton et.al. 2401.02903 null
2024-01-05 Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning Hong-Gi Shin et.al. 2401.02710 null
2024-01-05 Adaptive Discounting of Training Time Attacks Ridhima Bector et.al. 2401.02652 null
2024-01-05 Improving sample efficiency of high dimensional Bayesian optimization with MCMC Zeji Yi et.al. 2401.02650 null
2024-01-05 Simple Hierarchical Planning with Diffusion Chang Chen et.al. 2401.02644 null
2024-01-04 Structured Matrix Learning under Arbitrary Entrywise Dependence and Estimation of Markov Transition Kernel Jinhang Chai et.al. 2401.02520 null
2024-01-04 Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach Sungwook Yang et.al. 2401.02508 null
2024-01-04 A Survey Analyzing Generalization in Deep Reinforcement Learning Ezgi Korkmaz et.al. 2401.02349 null
2024-01-04 A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning Parvin Malekzadeh et.al. 2401.02325 link
2024-01-04 Policy-regularized Offline Multi-objective Reinforcement Learning Qian Lin et.al. 2401.02244 link
2024-01-04 Trajectory-Oriented Policy Optimization with Sparse Rewards Guojian Wang et.al. 2401.02225 null
2024-01-04 OFDM-Based Digital Semantic Communication with Importance Awareness Chuanhong Liu et.al. 2401.02178 null
2024-01-04 Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning Ke Li et.al. 2401.02160 null
2024-01-04 ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers Chen Zheng et.al. 2401.02072 null
2024-01-03 NODEC: Neural ODE For Optimal Control of Unknown Dynamical Systems Cheng Chi et.al. 2401.01836 link
2024-01-03 Optimizing UAV-UGV Coalition Operations: A Hybrid Clustering and Multi-Agent Reinforcement Learning Approach for Path Planning in Obstructed Environment Shamyo Brotee et.al. 2401.01481 null
2024-01-02 Learning-based agricultural management in partially observable environments subject to climate variability Zhaoan Wang et.al. 2401.01273 null
2024-01-02 Mirror Descent for Stochastic Control Problems with Measure-valued Controls Bekzhan Kerimkulov et.al. 2401.01198 null
2024-01-02 Deep Learning Driven Buffer-Aided Cooperative Networks for B5G/6G: Challenges, Solutions, and Future Opportunities Peng Xu et.al. 2401.01195 null
2024-01-02 Reinforcement Learning for SAR View Angle Inversion with Differentiable SAR Renderer Yanni Wang et.al. 2401.01165 null
2024-01-02 Enhancing Communication Efficiency of Semantic Transmission via Joint Processing Technique Xumin Pu et.al. 2401.01143 null
2024-01-02 Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach Chong Huang et.al. 2401.01140 null
2024-01-02 Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction Jie Feng et.al. 2401.01084 null
2024-01-01 Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning Mohamad Abed El Rahman Hammoud et.al. 2401.00916 null
2024-01-01 Polynomial-time Approximation Scheme for Equilibriums of Games Hongbo Sun et.al. 2401.00747 link
2024-01-01 Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach Sangjun Bae et.al. 2401.00661 null
2023-12-29 Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios Xinyuan Wu et.al. 2312.17606 link
2023-12-29 Exploring Deep Reinforcement Learning for Robust Target Tracking using Micro Aerial Vehicles Alberto Dionigi et.al. 2312.17552 link
2023-12-29 Design Space Exploration of Approximate Computing Techniques with a Reinforcement Learning Approach Sepide Saeedi et.al. 2312.17525 null
2023-12-29 Actuator-Constrained Reinforcement Learning for High-Speed Quadrupedal Locomotion Young-Ha Shin et.al. 2312.17507 null
2023-12-29 HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning Hao Wang et.al. 2312.17503 null
2023-12-29 Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning Nigini Oliveira et.al. 2312.17479 null
2023-12-29 Once Burned, Twice Shy? The Effect of Stock Market Bubbles on Traders that Learn by Experience Haibei Zhu et.al. 2312.17472 null
2023-12-28 Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e Chenwei Xu et.al. 2312.17372 null
2023-12-28 Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng et.al. 2312.17248 null
2023-12-28 Resilient Constrained Reinforcement Learning Dongsheng Ding et.al. 2312.17194 null
2023-12-28 Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning? Gunshi Gupta et.al. 2312.17168 link
2023-12-28 Generalizable Visual Reinforcement Learning with Segment Anything Model Ziyu Wang et.al. 2312.17116 link
2023-12-28 When Metaverses Meet Vehicle Road Cooperation: Multi-Agent DRL-Based Stackelberg Game for Vehicular Twins Migration Jiawen Kang et.al. 2312.17081 null
2023-12-28 Model-aware reinforcement learning for high-performance Bayesian experimental design in quantum metrology Federico Belliardo et.al. 2312.16985 link
2023-12-28 Reinforcement-based Display-size Selection for Frugal Satellite Image Change Detection Hichem Sahbi et.al. 2312.16965 null
2023-12-28 RLPlanner: Reinforcement Learning based Floorplanning for Chiplets with Fast Thermal Analysis Yuanyuan Duan et.al. 2312.16895 null
2023-12-28 Tail-Learning: Adaptive Learning Method for Mitigating Tail Latency in Autonomous Edge Systems Cheng Zhang et.al. 2312.16883 null
2023-12-28 Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies Bing Yuan et.al. 2312.16815 null
2023-12-26 A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration Fahri Wisnu Murti et.al. 2312.16142 null
2023-12-26 Large Language Models as Traffic Signal Control Agents: Capacity and Opportunity Siqi Lai et.al. 2312.16044 link
2023-12-26 Aligning Large Language Models with Human Preferences through Representation Engineering Wenhao Liu et.al. 2312.15997 link
2023-12-26 Adaptive Kalman-based hybrid car following strategy using TD3 and CACC Yuqi Zheng et.al. 2312.15993 null
2023-12-26 Optimistic and Pessimistic Actor in RL:Decoupling Exploration and Utilization Jingpu Yang et.al. 2312.15965 link
2023-12-26 Reinforcement Unlearning Dayong Ye et.al. 2312.15910 null
2023-12-26 Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations Renzhe Zhou et.al. 2312.15909 link
2023-12-26 PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning Hangyu Mao et.al. 2312.15863 link
2023-12-26 Learning Online Policies for Person Tracking in Multi-View Environments Keivan Nalaie et.al. 2312.15858 null
2023-12-25 A Closed-Loop Multi-perspective Visual Servoing Approach with Reinforcement Learning Lei Zhang et.al. 2312.15809 null
2023-12-22 A Survey of Reinforcement Learning from Human Feedback Timo Kaufmann et.al. 2312.14925 null
2023-12-22 Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning Filippos Christianos et.al. 2312.14878 null
2023-12-22 YAYI 2: Multilingual Open-Source Large Language Models Yin Luo et.al. 2312.14862 null
2023-12-22 An investigation of belief-free DRL and MCTS for inspection and maintenance planning Daniel Koutas et.al. 2312.14824 null
2023-12-22 Hierarchical Multi-Agent Reinforcement Learning for Assessing False-Data Injection Attacks on Transportation Networks Taha Eghtesad et.al. 2312.14625 null
2023-12-22 Machine learning for structure-guided materials and process design Lukas Morand et.al. 2312.14552 null
2023-12-22 DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge Jiaming Lu et.al. 2312.14532 link
2023-12-22 Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing Jinmin He et.al. 2312.14472 null
2023-12-22 Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration Honghao Wei et.al. 2312.14470 null
2023-12-22 Dynamic Programming-based Approximate Optimal Control for Model-Based Reinforcement Learning Prakash Mallick et.al. 2312.14463 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134 null
2023-12-21 CVA Hedging by Risk-Averse Stochastic-Horizon Reinforcement Learning Roberto Daluiso et.al. 2312.14044 null
2023-12-21 Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing Hany Abdulsamad et.al. 2312.14000 link
2023-12-21 Modular Neural Network Policies for Learning In-Flight Object Catching with a Robot Hand-Arm System Wenbin Hu et.al. 2312.13987 null
2023-12-21 Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles Ruoqi Wen et.al. 2312.13910 null
2023-12-21 Variational Quantum Circuit Design for Quantum Reinforcement Learning on Continuous Environments Georg Kruse et.al. 2312.13798 null
2023-12-21 Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator Zichun Xu et.al. 2312.13788 link
2023-12-21 Critic-Guided Decision Transformer for Offline Reinforcement Learning Yuanfu Wang et.al. 2312.13716 null
2023-12-21 Automatic Curriculum Learning with Gradient Reward Signals Ryan Campbell et.al. 2312.13565 null
2023-12-20 Entropy-Regularized Mean-Variance Portfolio Optimization with Jumps Christian Bender et.al. 2312.13409 null
2023-12-20 First-principle-like reinforcement learning of nonlinear numerical schemes for conservation laws Hao-Chen Wang et.al. 2312.13260 null
2023-12-20 Learning Best Response Policies in Dynamic Auctions via Deep Reinforcement Learning Vinzenz Thoma et.al. 2312.13232 null
2023-12-20 Task-oriented Semantics-aware Communications for Robotic Waypoint Transmission: the Value and Age of Information Approach Wenchao Wu et.al. 2312.13182 null
2023-12-20 Collaborative Optimization of the Age of Information under Partial Observability Anam Tahir et.al. 2312.12977 null
2023-12-20 Sparse Mean Field Load Balancing in Large Localized Queueing Systems Anam Tahir et.al. 2312.12973 null
2023-12-20 PGN: A perturbation generation network against deep reinforcement learning Xiangjuan Li et.al. 2312.12904 null
2023-12-20 Parameterized Projected Bellman Operator Théo Vincent et.al. 2312.12869 link
2023-12-20 Towards Machines that Trust: AI Agents Learn to Trust in the Trust Game Ardavan S. Nobandegani et.al. 2312.12868 null
2023-12-20 Dynamic Fairness-Aware Spectrum Auction for Enhanced Licensed Shared Access in 6G Networks Mina Khadem et.al. 2312.12867 null
2023-12-20 Safe Multi-Agent Reinforcement Learning for Formation Control without Individual Reference Targets Murad Dawood et.al. 2312.12861 null
2023-12-19 Emergence of In-Context Reinforcement Learning from Noise Distillation Ilya Zisman et.al. 2312.12275 link
2023-12-19 TaskFlex Solver for Multi-Agent Pursuit via Automatic Curriculum Learning Jiayu Chen et.al. 2312.12255 null
2023-12-19 CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning Chenyu Sun et.al. 2312.12191 null
2023-12-19 OVD-Explorer:Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments Jinyi Liu et.al. 2312.12145 null
2023-12-19 Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning Yanwen Ba et.al. 2312.12095 link
2023-12-19 Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property Ioannis Anagnostides et.al. 2312.12067 null
2023-12-19 XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX Alexander Nikulin et.al. 2312.12044 link
2023-12-19 LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments Federico Ceola et.al. 2312.12036 link
2023-12-19 Parameterized Decision-making with Multi-modal Perception for Autonomous Driving Yuyang Xia et.al. 2312.11935 null
2023-12-19 Stable Relay Learning Optimization Approach for Fast Power System Production Cost Minimization Simulation Zishan Guo et.al. 2312.11896 null
2023-12-18 Contextual Reinforcement Learning for Offshore Wind Farm Bidding David Cole et.al. 2312.10884 null
2023-12-17 Learning to Act without Actions Dominik Schmidt et.al. 2312.10812 link
2023-12-17 Deep-Dispatch: A Deep Reinforcement Learning-Based Vehicle Dispatch Algorithm for Advanced Air Mobility Elaheh Sabziyan Varnousfaderani et.al. 2312.10809 null
2023-12-17 Language-conditioned Learning for Robotic Manipulation: A Survey Hongkuan Zhou et.al. 2312.10807 link
2023-12-17 CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization Elisa Alboni et.al. 2312.10666 link
2023-12-17 Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward Haoxin Lin et.al. 2312.10642 link
2023-12-17 Risk-Constrained Reinforcement Learning for Inverter-Dominated Power System Controls Kyung-bin Kwon et.al. 2312.10635 null
2023-12-16 Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning Rohan Banerjee et.al. 2312.10557 link
2023-12-16 Advancing RAN Slicing with Offline Reinforcement Learning Kun Yang et.al. 2312.10547 null
2023-12-16 Spatial Deep Learning for Site-Specific Movement Optimization of Aerial Base Stations Jiangbin Lyu et.al. 2312.10490 null
2023-12-15 ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Renat Aksitov et.al. 2312.10003 null
2023-12-15 Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping Lauren H. Cooke et.al. 2312.09983 null
2023-12-15 Deep Reinforcement Learning for Joint Cruise Control and Intelligent Data Acquisition in UAVs-Assisted Sensor Networks Yousef Emami et.al. 2312.09953 null
2023-12-15 Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations Cedric Derstroff et.al. 2312.09950 link
2023-12-15 Assume-Guarantee Reinforcement Learning Milad Kazemi et.al. 2312.09938 null
2023-12-15 LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer Yuxin Cao et.al. 2312.09935 link
2023-12-15 Sample-Efficient Learning to Solve a Real-World Labyrinth Game Using Data-Augmented Model-Based Reinforcement Learning Thomas Bi et.al. 2312.09906 null
2023-12-15 Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation Girolamo Macaluso et.al. 2312.09844 null
2023-12-15 Benchmarking the Full-Order Model Optimization Based Imitation in the Humanoid Robot Reinforcement Learning Walk Ekaterina Chaikovskaya et.al. 2312.09757 null
2023-12-15 GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with Relative Entropy Tianhao Peng et.al. 2312.09708 null
2023-12-14 Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking Jacob Eisenstein et.al. 2312.09244 null
2023-12-14 Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft Hao Li et.al. 2312.09238 null
2023-12-14 Vision-Language Models as a Source of Rewards Kate Baumli et.al. 2312.09187 null
2023-12-14 MRL-PoS: A Multi-agent Reinforcement Learning based Proof of Stake Consensus Algorithm for Blockchain Tariqul Islam et.al. 2312.09123 null
2023-12-14 Less is more -- the Dispatcher/ Executor principle for multi-task Reinforcement Learning Martin Riedmiller et.al. 2312.09120 null
2023-12-14 DeepSurveySim: Simulation Software and Benchmark Challenges for Astronomical Observation Scheduling Maggie Voetberg et.al. 2312.09092 link
2023-12-14 ReCoRe: Regularized Contrastive Representation Learning of World Model Rudra P. K. Poudel et.al. 2312.09056 null
2023-12-14 Using Surprise Index for Competency Assessment in Autonomous Decision-Making Akash Ratheesh et.al. 2312.09033 null
2023-12-14 Adaptive parameter sharing for multi-agent reinforcement learning Dapeng Li et.al. 2312.09009 null
2023-12-14 LiFT: Unsupervised Reinforcement Learning with Foundation Models as Teachers Taewook Nam et.al. 2312.08958 null
2023-12-13 The Effective Horizon Explains Deep RL Performance in Stochastic Environments Cassidy Laidlaw et.al. 2312.08369 link
2023-12-13 An Invitation to Deep Reinforcement Learning Bernhard Jaeger et.al. 2312.08365 null
2023-12-13 Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF Anand Siththaranjan et.al. 2312.08358 link
2023-12-13 Model-Free Verification for Neural Network Controlled Systems Han Wang et.al. 2312.08293 null
2023-12-13 Leveraging User Simulation to Develop and Evaluate Conversational Information Access Agents Nolwenn Bernard et.al. 2312.08041 null
2023-12-13 Secure Deep Reinforcement Learning for Dynamic Resource Allocation in Wireless MEC Networks Xin Hao et.al. 2312.08016 null
2023-12-14 Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies Vicki Young et.al. 2312.07953 null
2023-12-13 On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning Ze Yu Zhao et.al. 2312.07917 null
2023-12-13 Artificial Intelligence Studies in Cartography: A Review and Synthesis of Methods, Applications, and Ethics Yuhao Kang et.al. 2312.07901 null
2023-12-13 RAT: Reinforcement-Learning-Driven and Adaptive Testing for Vulnerability Discovery in Web Application Firewalls Mohammadhossein Amouei et.al. 2312.07885 link
2023-12-12 On Diverse Preferences for Large Language Model Alignment Dun Zeng et.al. 2312.07401 link
2023-12-12 ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning Xiangyu Yin et.al. 2312.07392 link
2023-12-12 Sequential Planning in Large Partially Observable Environments guided by LLMs Swarna Kamal Paul et.al. 2312.07368 link
2023-12-12 Intelligible Protocol Learning for Resource Allocation in 6G O-RAN Slicing Farhad Rezazadeh et.al. 2312.07362 null
2023-12-12 Learning from Interaction: User Interface Adaptation using Reinforcement Learning Daniel Gaspar-Figueiredo et.al. 2312.07216 null
2023-12-12 Beyond Expected Return: Accounting for Policy Reproducibility when Evaluating Reinforcement Learning Algorithms Manon Flageat et.al. 2312.07178 null
2023-12-12 Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning Wei Geng et.al. 2312.07025 null
2023-12-11 A Novel Differentiable Loss Function for Unsupervised Graph Neural Networks in Graph Partitioning Vivek Chaudhary et.al. 2312.06877 null
2023-12-11 Scalable Decentralized Cooperative Platoon using Multi-Agent Deep Reinforcement Learning Ahmed Abdelrahman et.al. 2312.06858 null
2023-12-11 Data-Driven Modeling and Verification of Perception-Based Autonomous Systems Thomas Waite et.al. 2312.06848 null
2023-12-11 Convergence of Multi-Scale Reinforcement Q-Learning Algorithms for Mean Field Game and Control Problems Andrea Angiuli et.al. 2312.06659 null
2023-12-11 Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models Theodore Wolf et.al. 2312.06527 null
2023-12-11 Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills Hongcai He et.al. 2312.06518 link
2023-12-11 Reward Certification for Policy Smoothed Reinforcement Learning Ronghui Mu et.al. 2312.06436 null
2023-12-11 Partial End-to-end Reinforcement Learning for Robustness Against Modelling Error in Autonomous Racing Andrew Murdoch et.al. 2312.06406 null
2023-12-11 FOSS: A Self-Learned Doctor for Query Optimizer Kai Zhong et.al. 2312.06357 null
2023-12-11 DiffAIL: Diffusion Adversarial Imitation Learning Bingzheng Wang et.al. 2312.06348 link
2023-12-11 Dropout is all you need: robust two-qubit gate with reinforcement learning Tian-Niu Xu et.al. 2312.06335 null
2023-12-11 Mobile Edge Computing and AI Enabled Web3 Metaverse over 6G Wireless Communications: A Deep Reinforcement Learning Approach Wenhan Yu et.al. 2312.06293 null
2023-12-11 No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning Dianyu Zhong et.al. 2312.06258 link
2023-12-08 TaskMet: Task-Driven Metric Learning for Model Learning Dishank Bansal et.al. 2312.05250 null
2023-12-08 Modeling Risk in Reinforcement Learning: A Literature Mapping Leonardo Villalobos-Arias et.al. 2312.05231 null
2023-12-08 DARLEI: Deep Accelerated Reinforcement Learning with Evolutionary Intelligence Saeejith Nair et.al. 2312.05171 null
2023-12-08 Onflow: an online portfolio allocation algorithm Gabriel Turinici et.al. 2312.05169 null
2023-12-08 Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator Samuel Mallick et.al. 2312.05166 link
2023-12-08 A Review of Cooperation in Multi-agent Learning Yali Du et.al. 2312.05162 null
2023-12-08 Learning to Fly Omnidirectional Micro Aerial Vehicles with an End-To-End Control Network Eugenio Cuniato et.al. 2312.05125 null
2023-12-08 An Autonomous Driving model with BEV-V2X Perception, Trajectory Prediction and Driving Planning in Complex Traffic Intersections Fukang Li et.al. 2312.05104 null
2023-12-08 UniTSA: A Universal Reinforcement Learning Framework for V2X Traffic Signal Control Maonan Wang et.al. 2312.05090 link
2023-12-08 Robotic Control of the Deformation of Soft Linear Objects Using Deep Reinforcement Learning Mélodie Hani Daniel Zakaria et.al. 2312.05056 link
2023-12-07 Data-Driven Robust Reinforcement Learning Control of Uncertain Nonlinear Systems: Towards a Fully-Automated, Insulin-Based Artificial Pancreas Alexandros Tanzanakis et.al. 2312.04503 null
2023-12-07 Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation Jiayi Huang et.al. 2312.04464 null
2023-12-07 Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization Carlos E. Luis et.al. 2312.04386 null
2023-12-07 HARQ-IR Aided Short Packet Communications: BLER Analysis and Throughput Maximization Fuchao He et.al. 2312.04377 null
2023-12-07 A Scalable Network-Aware Multi-Agent Reinforcement Learning Framework for Decentralized Inverter-based Voltage Control Han Xu et.al. 2312.04371 null
2023-12-07 Learning to sample in Cartesian MRI Thomas Sanchez et.al. 2312.04327 null
2023-12-07 iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design Ruyi Gan et.al. 2312.04326 null
2023-12-07 Multi Actor-Critic DDPG for Robot Action Space Decomposition: A Framework to Control Large 3D Deformation of Soft Linear Objects Mélodie Daniel et.al. 2312.04308 link
2023-12-07 Dynamic Data-Driven Digital Twins for Blockchain Systems Georgios Diamantopoulos et.al. 2312.04226 null
2023-12-07 CODEX: A Cluster-Based Method for Explainable Reinforcement Learning Timothy K. Mathes et.al. 2312.04216 link
2023-12-06 On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer Elie Aljalbout et.al. 2312.03673 null
2023-12-06 MICRACLE: Inverse Reinforcement and Curriculum Learning Model for Human-inspired Mobile Robot Navigation Nihal Gunukula et.al. 2312.03651 null
2023-12-06 MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment Ziyan Wang et.al. 2312.03644 null
2023-12-06 MOCHa: Multi-Objective Reinforcement Mitigating Caption Hallucinations Assaf Ben-Kish et.al. 2312.03631 link
2023-12-06 Evaluation of Active Feature Acquisition Methods for Static Feature Settings Henrik von Kleist et.al. 2312.03619 null
2023-12-06 Physical Symbolic Optimization Wassim Tenachi et.al. 2312.03612 link
2023-12-06 Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning Sangwoong Yoon et.al. 2312.03397 null
2023-12-06 Diffused Task-Agnostic Milestone Planner Mineui Hong et.al. 2312.03395 null
2023-12-06 Demand response for residential building heating: Effective Monte Carlo Tree Search control based on physics-informed neural networks Fabio Pavirani et.al. 2312.03365 null
2023-12-06 Masking Behaviors in Epidemiological Networks with Cognitively-plausible Reinforcement Learning Konstantinos Mitsopoulos et.al. 2312.03301 null
2023-12-05 Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World Kiana Ehsani et.al. 2312.02976 null
2023-12-05 Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications Rajeeva L. Karandikar et.al. 2312.02828 null
2023-12-05 Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems Céline Comte et.al. 2312.02804 null
2023-12-05 LExCI: A Framework for Reinforcement Learning with Embedded Systems Kevin Badalian et.al. 2312.02739 link
2023-12-05 Hierarchical Visual Policy Learning for Long-Horizon Robot Manipulation in Densely Cluttered Scenes Hecheng Wang et.al. 2312.02697 null
2023-12-05 Contact Energy Based Hindsight Experience Prioritization Erdi Sayar et.al. 2312.02677 null
2023-12-05 A Q-learning approach to the continuous control problem of robot inverted pendulum balancing Mohammad Safeea et.al. 2312.02649 null
2023-12-05 DanZero+: Dominating the GuanDan Game through Reinforcement Learning Youpeng Zhao et.al. 2312.02561 link
2023-12-05 PolyFit: A Peg-in-hole Assembly Framework for Unseen Polygon Shapes via Sim-to-real Adaptation Geonhyup Lee et.al. 2312.02531 null
2023-12-05 MASP: Scalable GNN-based Planning for Multi-Agent Navigation Xinyi Yang et.al. 2312.02522 null
2023-12-04 Optimizing Camera Configurations for Multi-View Pedestrian Detection Yunzhong Hou et.al. 2312.02144 null
2023-12-04 Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models Xingyuan Zhang et.al. 2312.02019 link
2023-12-04 CaRL: Cascade Reinforcement Learning with State Space Splitting for O-RAN based Traffic Steering Chuanneng Sun et.al. 2312.01970 null
2023-12-04 Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities Markus Wulfmeier et.al. 2312.01939 null
2023-12-04 A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization Xiaobo Hu et.al. 2312.01915 null
2023-12-04 Modular Control Architecture for Safe Marine Navigation: Reinforcement Learning and Predictive Safety Filters Aksel Vaaler et.al. 2312.01855 null
2023-12-04 Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing Ying Yuan et.al. 2312.01853 null
2023-12-04 Integrated Drill Boom Hole-Seeking Control via Reinforcement Learning Haoqi Yan et.al. 2312.01836 null
2023-12-04 Learning Machine Morality through Experience and Interaction Elizaveta Tennant et.al. 2312.01818 null
2023-12-04 Class Symbolic Regression: Gotta Fit 'Em All Wassim Tenachi et.al. 2312.01816 link
2023-12-01 Safe Reinforcement Learning in Tensor Reproducing Kernel Hilbert Space Xiaoyuan Cheng et.al. 2312.00727 null
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592 link
2023-12-01 Explainable Fraud Detection with Deep Symbolic Classification Samantha Visbeek et.al. 2312.00586 link
2023-12-01 Interior Point Constrained Reinforcement Learning with Global Convergence Guarantees Tingting Ni et.al. 2312.00561 null
2023-12-01 GFN-SR: Symbolic Regression with Generative Flow Networks Sida Li et.al. 2312.00396 link
2023-12-01 TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning Dohyeong Kim et.al. 2312.00344 null
2023-12-01 Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk Dohyeong Kim et.al. 2312.00342 null
2023-12-01 UAV-Aided Lifelong Learning for AoI and Energy Optimization in Non-Stationary IoT Networks Zhenzhen Gong et.al. 2312.00334 null
2023-12-01 Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach Xingqiu He et.al. 2312.00279 link
2023-12-01 Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration Viraj Mehta et.al. 2312.00267 null
2023-11-30 Language Model Agents Suffer from Compositional Generalization in Web Automation Hiroki Furuta et.al. 2311.18751 link
2023-11-30 Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms Xiangyuan Zhang et.al. 2311.18736 link
2023-11-30 Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization Daniel Jarne Ornia et.al. 2311.18703 link
2023-11-30 Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning Jared Markowitz et.al. 2311.18684 null
2023-11-30 Generalisable Agents for Neural Network Optimisation Kale-ab Tessera et.al. 2311.18598 null
2023-11-30 Optimizing ZX-Diagrams with Deep Reinforcement Learning Maximilian Nägele et.al. 2311.18588 link
2023-11-30 Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control Bernd Frauenknecht et.al. 2311.18393 null
2023-11-30 URLLC-Awared Resource Allocation for Heterogeneous Vehicular Edge Computing Qiong Wu et.al. 2311.18352 null
2023-11-30 Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent Bianca Marin Moreno et.al. 2311.18346 null
2023-11-30 Deep Reinforcement Learning Based Optimal Energy Management of Multi-energy Microgrids with Uncertainties Yang Cui et.al. 2311.18327 null
2023-11-29 Maximum Entropy Model Correction in Reinforcement Learning Amin Rakhsha et.al. 2311.17855 null
2023-11-29 Identifying Dynamic Regulation with Adversarial Surrogates Ron Teichner et.al. 2311.17783 null
2023-11-29 Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks Xianlun Peng et.al. 2311.17631 null
2023-11-29 LanGWM: Language Grounded World Model Rudra P. K. Poudel et.al. 2311.17593 null
2023-11-29 Deep Reinforcement Learning Graphs: Feedback Motion Planning via Neural Lyapunov Verification Armin Ghanbarzadeh et.al. 2311.17587 null
2023-11-29 Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning Lisheng Wu et.al. 2311.17565 null
2023-11-29 Reinforcement Learning with thermal fluctuations at the nano-scale Francesco Boccardo et.al. 2311.17519 null
2023-11-29 Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning Swaroop Nath et.al. 2311.17514 link
2023-11-29 Unveiling the Implicit Toxicity in Large Language Models Jiaxin Wen et.al. 2311.17391 link
2023-11-29 Data-driven Bandwidth Adaptation for Radio Access Network Slices Panagiotis Nikolaidis et.al. 2311.17347 null
2023-11-28 Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications Jun Wang et.al. 2311.17059 null
2023-11-28 An Investigation of Time Reversal Symmetry in Reinforcement Learning Brett Barkley et.al. 2311.17008 null
2023-11-28 Goal-conditioned Offline Planning from Curious Exploration Marco Bagatella et.al. 2311.16996 null
2023-11-28 ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up? Hailin Chen et.al. 2311.16989 link
2023-11-28 Bidirectional Reactive Programming for Machine Learning Dumitru Potop Butucaru et.al. 2311.16977 null
2023-11-28 End-to-end Reinforcement Learning for Time-Optimal Quadcopter Flight Robin Ferede et.al. 2311.16948 null
2023-11-28 Optimization Theory Based Deep Reinforcement Learning for Resource Allocation in Ultra-Reliable Wireless Networked Control Systems Hamida Qumber Ali et.al. 2311.16895 null
2023-11-28 Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing Zhengming Zhang et.al. 2311.16876 null
2023-11-28 Edge AI for Internet of Energy: Challenges and Perspectives Yassine Himeur et.al. 2311.16851 null
2023-11-28 Two-step dynamic obstacle avoidance Fabian Hart et.al. 2311.16841 null
2023-11-27 Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation Jiachen Li et.al. 2311.16091 null
2023-11-27 Evaluating the Impact of Personalized Value Alignment in Human-Robot Interaction: Insights into Trust and Team Performance Outcomes Shreyas Bhat et.al. 2311.16051 null
2023-11-27 Value-Based Reinforcement Learning for Digital Twins in Cloud Computing Van-Phuc Bui et.al. 2311.15985 null
2023-11-27 Adaptive Agents and Data Quality in Agent-Based Financial Markets Colin M. Van Oort et.al. 2311.15974 null
2023-11-27 Addressing Long-Horizon Tasks by Integrating Program Synthesis and State Machines Yu-An Lin et.al. 2311.15960 null
2023-11-27 Replay across Experiments: A Natural Extension of Off-Policy RL Dhruva Tirumala et.al. 2311.15951 null
2023-11-27 Reinforcement Learning for Wildfire Mitigation in Simulated Disaster Environments Alexander Tapley et.al. 2311.15925 link
2023-11-27 A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning Jianxiong Li et.al. 2311.15920 null
2023-11-27 Distributed Attacks over Federated Reinforcement Learning-enabled Cell Sleep Control Han Zhang et.al. 2311.15894 null
2023-11-27 Multi-Agent Reinforcement Learning for Power Control in Wireless Networks via Adaptive Graphs Lorenzo Mario Amorosa et.al. 2311.15858 null
2023-11-24 Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language Di Jin et.al. 2311.14543 null
2023-11-24 Digital Twin-Native AI-Driven Service Architecture for Industrial Networks Kubra Duran et.al. 2311.14532 null
2023-11-24 How to ensure a safe control strategy? Towards a SRL for urban transit autonomous operation Zicong Zhao et.al. 2311.14457 null
2023-11-24 Universal Jailbreak Backdoors from Poisoned Human Feedback Javier Rando et.al. 2311.14455 link
2023-11-24 Approximation of Convex Envelope Using Reinforcement Learning Vivek S. Borkar et.al. 2311.14421 null
2023-11-24 Directly Attention Loss Adjusted Prioritized Experience Replay Zhuoying Chen et.al. 2311.14390 null
2023-11-24 AI-based Attack Graph Generation Sangbeom Park et.al. 2311.14342 null
2023-11-24 Offline Skill Generalization via Task and Motion Planning Shin Watanabe et.al. 2311.14328 null
2023-11-24 On optimal tracking portfolio in incomplete markets: The classical control and the reinforcement learning approaches Lijun Bo et.al. 2311.14318 null
2023-11-24 Multi-modal Instance Refinement for Cross-domain Action Recognition Yuan Qing et.al. 2311.14281 null
2023-11-22 Risk-sensitive Markov Decision Process and Learning under General Utility Functions Zhengqi Wu et.al. 2311.13589 null
2023-11-22 Guided Flows for Generative Modeling and Decision Making Qinqing Zheng et.al. 2311.13443 null
2023-11-22 From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex? Yannik Keller et.al. 2311.13414 link
2023-11-22 Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents Zihao Zhou et.al. 2311.13373 link
2023-11-22 Probabilistic Inference in Reinforcement Learning Done Right Jean Tarbouriech et.al. 2311.13294 null
2023-11-22 Intention and Context Elicitation with Large Language Models in the Legal Aid Intake Process Nick Goodson et.al. 2311.13281 null
2023-11-22 Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Kai Yang et.al. 2311.13231 link
2023-11-22 AdaptiveFL: Adaptive Heterogeneous Federated Learning for Resource-Constrained AIoT Systems Chentao Jia et.al. 2311.13166 null
2023-11-22 Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications Ha-Thanh Nguyen et.al. 2311.13095 null
2023-11-22 Learning to Fly in Seconds Jonas Eschmann et.al. 2311.13081 link
2023-11-21 Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion Keshav P. Keval et.al. 2311.12613 null
2023-11-21 Reinforcement Learning for the Near-Optimal Design of Zero-Delay Codes for Markov Sources Liam Cregg et.al. 2311.12609 null
2023-11-21 Scheduling Distributed Flexible Assembly Lines using Safe Reinforcement Learning with Soft Shielding Lele Li et.al. 2311.12572 null
2023-11-21 Multi-Session Budget Optimization for Forward Auction-based Federated Learning Xiaoli Tang et.al. 2311.12548 null
2023-11-21 Towards Faster Reinforcement Learning of Quantum Circuit Optimization: Exponential Reward Functions Ioana Moflic et.al. 2311.12509 null
2023-11-21 Cost Explosion for Efficient Reinforcement Learning Optimisation of Quantum Circuits Ioana Moflic et.al. 2311.12498 null
2023-11-21 Multi-Objective Reinforcement Learning based on Decomposition: A taxonomy and framework Florian Felten et.al. 2311.12495 link
2023-11-21 Reinforcement Learning for Stochastic LQ Control of Discrete-Time Systems with Multiplicative Noises Hongdan Li et.al. 2311.12322 null
2023-11-21 Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations Sayak Mukherjee et.al. 2311.12264 null
2023-11-21 Beyond Simulated Drivers: Evaluating the Impact of Real-World Car-Following in Mixed Traffic Control Bibek Poudel et.al. 2311.12261 link
2023-11-20 Provably Efficient CVaR RL in Low-rank MDPs Yulai Zhao et.al. 2311.11965 null
2023-11-20 Continual Learning: Applications and the Road Forward Eli Verwimp et.al. 2311.11908 null
2023-11-20 Few-shot Multispectral Segmentation with Representations Generated by Reinforcement Learning Dilith Jayakody et.al. 2311.11827 null
2023-11-20 AIaaS for ORAN-based 6G Networks: Multi-time scale slice resource management with DRL Suvidha Mhatre et.al. 2311.11668 null
2023-11-20 Replay-enhanced Continual Reinforcement Learning Tiantian Zhang et.al. 2311.11557 link
2023-11-20 ADAPTER-RL: Adaptation of Any Agent using Reinforcement Learning Yizhao Jin et.al. 2311.11537 null
2023-11-19 Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets Kun Yang et.al. 2311.11423 null
2023-11-19 Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts Ahmed Hendawy et.al. 2311.11385 link
2023-11-19 Dynamic System Stability Verification Using Numerical Simulator Jongrae Kim et.al. 2311.11372 null
2023-11-19 Tactile Active Inference Reinforcement Learning for Efficient Robotic Manipulation Skill Acquisition Zihao Liu et.al. 2311.11287 null
2023-11-17 EduGym: An Environment Suite for Reinforcement Learning Education Thomas M. Moerland et.al. 2311.10590 link
2023-11-17 Learning Agile Locomotion on Risky Terrains Chong Zhang et.al. 2311.10484 null
2023-11-17 Decentralized Energy Marketplace via NFTs and AI-based Agents Rasoul Nikbakht et.al. 2311.10406 link
2023-11-17 Joint Sensing and Communication Optimization in Target-Mounted STARS-Assisted Vehicular Networks: A MADRL Approach Haocheng Zhang et.al. 2311.10352 null
2023-11-17 Imagination-augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments Sang-Hyun Lee et.al. 2311.10309 null
2023-11-17 From "Thumbs Up" to "10 out of 10": Reconsidering Scalar Feedback in Interactive Reinforcement Learning Hang Yu et.al. 2311.10284 null
2023-11-16 Data-Driven LQR using Reinforcement Learning and Quadratic Neural Networks Soroush Asri et.al. 2311.10235 null
2023-11-17 JaxMARL: Multi-Agent RL Environments in JAX Alexander Rutherford et.al. 2311.10090 link
2023-11-16 DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback Yangyi Chen et.al. 2311.10081 null
2023-11-16 Interpretable Reinforcement Learning for Robotics and Continuous Control Rohan Paleja et.al. 2311.10041 link
2023-11-16 Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning Francesco De Lellis et.al. 2311.10026 link
2023-11-16 Online Optimization for Network Resource Allocation and Comparison with Reinforcement Learning Techniques Ahmed Sid-Ali et.al. 2311.10023 null
2023-11-16 Safety Aware Autonomous Path Planning Using Model Predictive Reinforcement Learning for Inland Waterways Astrid Vanneste et.al. 2311.09878 null
2023-11-16 Short vs. Long-term Coordination of Drones: When Distributed Optimization Meets Deep Reinforcement Learning Chuhao Qin et.al. 2311.09852 null
2023-11-16 Runtime Verification of Learning Properties for Reinforcement Learning Algorithms Tommaso Mannucci et.al. 2311.09811 null
2023-11-16 Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown Genglin Liu et.al. 2311.09731 link
2023-11-16 Augmenting Unsupervised Reinforcement Learning with Self-Reference Andrew Zhao et.al. 2311.09692 null
2023-11-15 Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge Sang-Hyun Lee et.al. 2311.09195 null
2023-11-15 Grounding or Guesswork? Large Language Models are Presumptive Grounders Omar Shaikh et.al. 2311.09144 null
2023-11-15 Aligning Neural Machine Translation Models: Human Feedback in Training and Inference Miguel Moura Ramos et.al. 2311.09132 null
2023-11-15 Assessing the Robustness of Intelligence-Driven Reinforcement Learning Lorenzo Nodari et.al. 2311.09027 null
2023-11-15 On the Foundation of Distributionally Robust Reinforcement Learning Shengbo Wang et.al. 2311.09018 null
2023-11-15 Adversarial Attacks to Reward Machine-based Reinforcement Learning Lorenzo Nodari et.al. 2311.09014 null
2023-11-15 Supported Trust Region Optimization for Offline Reinforcement Learning Yixiu Mao et.al. 2311.08935 null
2023-11-15 Efficiently Escaping Saddle Points for Non-Convex Policy Optimization Sadegh Khorasani et.al. 2311.08914 null
2023-11-15 An MRL-Based Design Solution for RIS-Assisted MU-MIMO Wireless System under Time-Varying Channels Meng-Qian Alexander Wu et.al. 2311.08840 null
2023-11-15 A Deep Reinforcement Learning Approach to Efficient Distributed Optimization Daokuan Zhu et.al. 2311.08827 null
2023-11-14 MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation Ehsan Asali et.al. 2311.08393 null
2023-11-14 Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding Guangyu Yang et.al. 2311.08380 link
2023-11-14 Workflow-Guided Response Generation for Task-Oriented Dialogue Do June Min et.al. 2311.08300 null
2023-11-14 On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling Nicholas E. Corrado et.al. 2311.08290 null
2023-11-14 Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework Weiqin Zu et.al. 2311.08244 null
2023-11-14 When Mining Electric Locomotives Meet Reinforcement Learning Ying Li et.al. 2311.08153 null
2023-11-14 Probable Object Location (POLo) Score Estimation for Efficient Object Goal Navigation Jiaming Wang et.al. 2311.07992 null
2023-11-14 AutoML for Large Capacity Modeling of Meta Ranking Systems Hang Yin et.al. 2311.07870 null
2023-11-14 A Neuro-Inspired Hierarchical Reinforcement Learning for Motor Control Pei Zhang et.al. 2311.07822 null
2023-11-13 Reinforcement Learning for Solving Stochastic Vehicle Routing Problem Zangir Iklassov et.al. 2311.07708 link
2023-11-13 Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning Arjun Bhardwaj et.al. 2311.07558 null
2023-11-13 Investigating Robustness in Cyber-Physical Systems: Specification-Centric Analysis in the face of System Deviations Changjian Zhang et.al. 2311.07462 null
2023-11-13 Goal-oriented Estimation of Multiple Markov Sources in Resource-constrained Systems Jiping Luo et.al. 2311.07346 null
2023-11-13 An introduction to reinforcement learning for neuroscience Kristopher T. Jensen et.al. 2311.07315 null
2023-11-13 C-Procgen: Empowering Procgen with Controllable Contexts Zhenxiong Tan et.al. 2311.07312 null
2023-11-13 TIAGo RL: Simulated Reinforcement Learning Environments with Tactile Data for Mobile Robots Luca Lach et.al. 2311.07260 null
2023-11-13 Towards Transferring Tactile-based Continuous Force Control Policies from Simulation to Robot Luca Lach et.al. 2311.07245 null
2023-11-13 STEER: Unified Style Transfer with Expert Reinforcement Skyler Hallinan et.al. 2311.07167 link
2023-11-13 Untargeted Black-box Attacks for Social Recommendations Wenqi Fan et.al. 2311.07127 null
2023-11-12 FLASH-RL: Federated Learning Addressing System and Static Heterogeneity using Reinforcement Learning Sofiane Bouaziz et.al. 2311.06917 link
2023-11-10 Multi-Agent Reinforcement Learning for the Low-Level Control of a Quadrotor UAV Beomyeol Yu et.al. 2311.06144 link
2023-11-10 Intersection-free Robot Manipulation with Soft-Rigid Coupled Incremental Potential Contact Wenxin Du et.al. 2311.05945 null
2023-11-10 Learning-Augmented Scheduling for Solar-Powered Electric Vehicle Charging Tongxin Li et.al. 2311.05941 null
2023-11-10 Genetic Algorithm enhanced by Deep Reinforcement Learning in parent selection mechanism and mutation : Minimizing makespan in permutation flow shop scheduling problems Maissa Irmouli et.al. 2311.05937 null
2023-11-10 Clipped-Objective Policy Gradients for Pessimistic Policy Optimization Jared Markowitz et.al. 2311.05846 null
2023-11-10 Let's Reinforce Step by Step Sarah Pan et.al. 2311.05821 null
2023-11-09 Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning Aaryan Singhal et.al. 2311.05780 link
2023-11-09 Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models Gang Hu et.al. 2311.05743 null
2023-11-09 LLM Augmented Hierarchical Agents Bharat Prakash et.al. 2311.05596 null
2023-11-09 Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations Joey Hong et.al. 2311.05584 null
2023-11-09 Joint SDN Synchronization and Controller Placement in Wireless Networks using Deep Reinforcement Learning Akrit Mudvari et.al. 2311.05582 null
2023-11-09 Removing RLHF Protections in GPT-4 via Fine-Tuning Qiusi Zhan et.al. 2311.05553 null
2023-11-09 Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization Michael Kölle et.al. 2311.05546 null
2023-11-09 Anytime-Constrained Reinforcement Learning Jeremy McMahan et.al. 2311.05511 link
2023-11-09 From "What" to "When" -- a Spiking Neural Network Predicting Rare Events and Time to their Occurrence Mikhail Kiselev et.al. 2311.05210 null
2023-11-09 Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System Xiangguo Sun et.al. 2311.05144 link
2023-11-09 Accelerating Exploration with Unlabeled Prior Data Qiyang Li et.al. 2311.05067 link
2023-11-08 Reinforcement Learning Generalization for Nonlinear Systems Through Dual-Scale Homogeneity Transformations Abdel Gafoor Haddad et.al. 2311.05013 null
2023-11-08 Real-Time Recurrent Reinforcement Learning Julian Lemmel et.al. 2311.04830 null
2023-11-08 Simultaneous Discovery of Quantum Error Correction Codes and Encoders with a Noise-Aware Reinforcement Learning Agent Jan Olle et.al. 2311.04750 link
2023-11-08 Enhancing Multi-Agent Coordination through Common Operating Picture Integration Peihong Yu et.al. 2311.04740 null
2023-11-08 Social Motion Prediction with Cognitive Hierarchies Wentao Zhu et.al. 2311.04726 null
2023-11-08 RDGCN: Reinforced Dependency Graph Convolutional Network for Aspect-based Sentiment Analysis Xusheng Zhao et.al. 2311.04467 link
2023-11-07 Force-Constrained Visual Policy: Safe Robot-Assisted Dressing via Multi-Modal Sensing Zhanyi Sun et.al. 2311.04390 null
2023-11-07 Adaptive Stochastic Nonlinear Model Predictive Control with Look-ahead Deep Reinforcement Learning for Autonomous Vehicle Motion Control Baha Zarrouki et.al. 2311.04303 null
2023-11-07 Compilation of product-formula Hamiltonian simulation via reinforcement learning Lea M. Trenkwalder et.al. 2311.04285 link
2023-11-07 Interactive Semantic Map Representation for Skill-based Visual Object Navigation Tatiana Zemskova et.al. 2311.04107 null
2023-11-07 Time-Efficient Reinforcement Learning with Stochastic Stateful Policies Firas Al-Hafez et.al. 2311.04082 null
2023-11-07 Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment Geyang Guo et.al. 2311.04072 link
2023-11-07 Estimator-Coupled Reinforcement Learning for Robust Purely Tactile In-Hand Manipulation Lennart Röstel et.al. 2311.04060 null
2023-11-07 Reinforcement Learning Fine-tuning of Language Models is Biased Towards More Extractable Features Diogo Cruz et.al. 2311.04046 link
2023-11-07 A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems Cheng Yin et.al. 2311.04014 null
2023-11-07 Learning-Based Latency-Constrained Fronthaul Compression Optimization in C-RAN Axel Grönland et.al. 2311.03899 null
2023-11-07 On Deep Reinforcement Learning for Traffic Steering Intelligent ORAN Fatemeh Kavehmadavani et.al. 2311.03853 null
2023-11-07 Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning Yao Zhang et.al. 2311.03756 null
2023-11-07 Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning Joseph Suárez et.al. 2311.03736 null
2023-11-06 Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization Kun Lei et.al. 2311.03351 link
2023-11-06 A Brain-inspired Theory of Collective Mind Model for Efficient Social Cooperation Zhuoya Zhao et.al. 2311.03150 null
2023-11-06 Reinforcement Learning for Inverse Linear-quadratic Dynamic Non-cooperative Games Emin Martirosyan et.al. 2311.03044 null
2023-11-06 Virtual Action Actor-Critic Framework for Exploration (Student Abstract) Bumgeun Park et.al. 2311.02916 null
2023-11-06 Reinforcement Learning for Safety Testing: Lessons from A Mobile Robot Case Study Tom P. Huck et.al. 2311.02907 null
2023-11-06 Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs Wenke Xia et.al. 2311.02847 link
2023-11-05 ChaTA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs Yann Hicke et.al. 2311.02775 null
2023-11-05 Causal Question Answering with Reinforcement Learning Lukas Blübaum et.al. 2311.02760 link
2023-11-05 Staged Reinforcement Learning for Complex Tasks through Decomposed Environments Rafael Pina et.al. 2311.02746 null
2023-11-05 Learning Independently from Causality in Multi-Agent Environments Rafael Pina et.al. 2311.02741 null
2023-11-03 DeliverAI: Reinforcement Learning Based Distributed Path-Sharing Network for Food Deliveries Ashman Mehra et.al. 2311.02017 null
2023-11-03 Score Models for Offline Goal-Conditioned Reinforcement Learning Harshit Sikchi et.al. 2311.02013 null
2023-11-03 Conditions on Preference Relations that Guarantee the Existence of Optimal Policies Jonathan Colaco Carr et.al. 2311.01990 null
2023-11-03 Emergence of odd elasticity in a microswimmer using deep reinforcement learning Li-Shing Lin et.al. 2311.01973 null
2023-11-03 Domain Randomization via Entropy Maximization Gabriele Tiboni et.al. 2311.01885 null
2023-11-03 RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization Siqi Shen et.al. 2311.01753 link
2023-11-03 Epidemic Decision-making System Based Federated Reinforcement Learning Yangxi Zhou et.al. 2311.01749 null
2023-11-03 Energy Efficiency Optimization for Subterranean LoRaWAN Using A Reinforcement Learning Approach: A Direct-to-Satellite Scenario Kaiqiang Lin et.al. 2311.01743 null
2023-11-03 RDE: A Hybrid Policy Framework for Multi-Agent Path Finding Problem Jianqi Gao et.al. 2311.01728 null
2023-11-03 Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula Aryaman Reddi et.al. 2311.01642 null
2023-11-02 Conformal Policy Learning for Sensorimotor Control Under Distribution Shifts Huang Huang et.al. 2311.01457 null
2023-11-02 RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Yufei Wang et.al. 2311.01455 null
2023-11-02 DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing Vint Lee et.al. 2311.01450 null
2023-11-02 Analysis of Information Propagation in Ethereum Network Using Combined Graph Attention Network and Reinforcement Learning to Optimize Network Efficiency and Scalability Stefan Kambiz Behfar et.al. 2311.01406 null
2023-11-02 Learning Realistic Traffic Agents in Closed-loop Chris Zhang et.al. 2311.01394 null
2023-11-02 Formal Methods for Autonomous Systems Tichakorn Wongpiromsarn et.al. 2311.01258 null
2023-11-02 EISim: A Platform for Simulating Intelligent Edge Orchestration Solutions Henna Kokkonen et.al. 2311.01224 link
2023-11-02 Diffusion Models for Reinforcement Learning: A Survey Zhengbang Zhu et.al. 2311.01223 link
2023-11-02 Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning Siming Lan et.al. 2311.01075 link
2023-11-02 Dynamic Fair Federated Learning Based on Reinforcement Learning Weikang Chen et.al. 2311.00959 null
2023-11-02 Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning Richard Bornemann et.al. 2311.00651 null
2023-11-01 Learning impartial policies for sequential counterfactual explanations using Deep Reinforcement Learning E. Panagiotou et.al. 2311.00523 null
2023-11-01 Enhanced Generalization through Prioritization and Diversity in Self-Imitation Reinforcement Learning over Procedural Environments with Sparse Rewards Alain Andres et.al. 2311.00426 null
2023-11-01 Towards Automatic Sampling of User Behaviors for Sequential Recommender Systems Hao Zhang et.al. 2311.00388 null
2023-11-01 QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning Rizhong Wang et.al. 2311.00356 null
2023-11-02 A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents Olivier Sigaud et.al. 2311.00344 null
2023-11-01 Rethinking Decision Transformer via Hierarchical Reinforcement Learning Yi Ma et.al. 2311.00267 null
2023-11-01 Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents Yang Deng et.al. 2311.00262 link
2023-11-01 Active Neural Topological Mapping for Multi-Agent Exploration Xinyi Yang et.al. 2311.00252 null
2023-11-01 Federated Natural Policy Gradient Methods for Multi-task Reinforcement Learning Tong Yang et.al. 2311.00201 null
2023-10-31 Offline RL with Observation Histories: Analyzing and Improving Sample Complexity Joey Hong et.al. 2310.20663 null
2023-10-31 "Pick-and-Pass" as a Hat-Trick Class for First-Principle Memory, Generalizability, and Interpretability Benchmarks Jason Wang et.al. 2310.20654 null
2023-10-31 LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B Simon Lermen et.al. 2310.20624 null
2023-10-31 Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback Max Balsells et.al. 2310.20608 null
2023-10-31 Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning Ruizhe Shi et.al. 2310.20587 link
2023-10-31 Amoeba: Circumventing ML-supported Network Censorship via Adversarial Reinforcement Learning Haoyu Liu et.al. 2310.20469 link
2023-11-01 Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods Zhengpeng Xie et.al. 2310.20380 null
2023-10-31 Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents Woojun Kim et.al. 2310.20287 null
2023-10-31 Beyond Average Return in Markov Decision Processes Alexandre Marthe et.al. 2310.20266 null
2023-10-31 Handover Protocol Learning for LEO Satellite Networks: Access Delay and Collision Minimization Ju-Hyung Lee et.al. 2310.20215 null
2023-10-30 Optimal Status Updates for Minimizing Age of Correlated Information in IoT Networks with Energy Harvesting Sensors Chao Xu et.al. 2310.19216 link
2023-10-29 Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households Gargya Gokhale et.al. 2310.19155 null
2023-10-29 MAG-GNN: Reinforcement Learning Boosted Graph Neural Network Lecheng Kong et.al. 2310.19142 null
2023-10-29 Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning Suraj Singireddy et.al. 2310.19137 null
2023-10-29 Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery Katie Z Luo et.al. 2310.19080 null
2023-10-29 Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback Jingliang Duan et.al. 2310.19022 null
2023-10-31 Behavior Alignment via Reward Function Optimization Dhawal Gupta et.al. 2310.19007 null
2023-10-29 Spacecraft Autonomous Decision-Planning for Collision Avoidance: a Reinforcement Learning Approach Nicolas Bourriez et.al. 2310.18966 null
2023-10-29 Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game Zelai Xu et.al. 2310.18940 null
2023-10-29 Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation Nikki Lijing Kuang et.al. 2310.18919 null
2023-10-27 FP8-LM: Training FP8 Large Language Models Houwen Peng et.al. 2310.18313 link
2023-10-27 Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models Pushkal Katara et.al. 2310.18308 null
2023-10-27 Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt Yining Ma et.al. 2310.18264 link
2023-10-27 Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning Nicholas E. Corrado et.al. 2310.18247 null
2023-10-27 DESiRED -- Dynamic, Enhanced, and Smart iRED: A P4-AQM with Deep Reinforcement Learning and In-band Network Telemetry Leandro C. de Almeida et.al. 2310.18159 null
2023-10-27 Improving Intrinsic Exploration by Creating Stationary Objectives Roger Creus Castanyer et.al. 2310.18144 null
2023-10-27 Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models Xue Yan et.al. 2310.18127 null
2023-10-27 Text2Bundle: Towards Personalized Query-based Bundle Generation Shixuan Zhu et.al. 2310.18004 null
2023-10-27 Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning Shenzhi Wang et.al. 2310.17966 link
2023-10-27 Chain-of-Choice Hierarchical Policy Learning for Conversational Recommendation Wei Fan et.al. 2310.17922 link
2023-10-26 Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion Laura Smith et.al. 2310.17634 null
2023-10-26 Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity Jaedong Hwang et.al. 2310.17537 link
2023-10-26 Learning Regularized Graphon Mean-Field Games with Unknown Graphons Fengzhuo Zhang et.al. 2310.17531 null
2023-10-27 Adaptive Resource Management for Edge Network Slicing using Incremental Multi-Agent Deep Reinforcement Learning Haiyuan Li et.al. 2310.17523 null
2023-10-26 Orchestration of Emulator Assisted Mobile Edge Tuning for AI Foundation Models: A Multi-Agent Deep Reinforcement Learning Approach Wenhan Yu et.al. 2310.17492 null
2023-10-26 FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing Terence Jie Chua et.al. 2310.17491 null
2023-10-26 Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach Stephen Mak et.al. 2310.17485 null
2023-10-26 Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing Stephen Mak et.al. 2310.17458 null
2023-10-26 Goals are Enough: Inducing AdHoc cooperation among unseen Multi-Agent systems in IMFs Kaushik Dey et.al. 2310.17416 null
2023-10-26 CQM: Curriculum Reinforcement Learning with a Quantized World Model Seungjae Lee et.al. 2310.17330 null
2023-10-25 TD-MPC2: Scalable, Robust World Models for Continuous Control Nicklas Hansen et.al. 2310.16828 null
2023-10-25 AI Agent as Urban Planner: Steering Stakeholder Dynamics in Urban Planning via Consensus-based Multi-Agent Reinforcement Learning Kejiang Qian et.al. 2310.16772 null
2023-10-25 SuperHF: Supervised Iterative Learning from Human Feedback Gabriel Mukobi et.al. 2310.16763 link
2023-10-25 MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning Dong-Ki Kim et.al. 2310.16730 null
2023-10-25 Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies Michael Beukman et.al. 2310.16686 link
2023-10-25 BabyStories: Can Reinforcement Learning Teach Baby Language Models to Write Better Stories? Xingmeng Zhao et.al. 2310.16681 link
2023-10-25 UAV Pathfinding in Dynamic Obstacle Avoidance with Multi-agent Reinforcement Learning Qizhen Wu et.al. 2310.16659 null
2023-10-25 Towards Control-Centric Representations in Reinforcement Learning from Images Chen Liu et.al. 2310.16655 null
2023-10-25 Model predictive control-based value estimation for efficient reinforcement learning Qizhen Wu et.al. 2310.16646 link
2023-10-25 Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation Chengpeng Li et.al. 2310.16566 null
2023-10-24 AI Alignment and Social Choice: Fundamental Limitations and Policy Implications Abhilash Mishra et.al. 2310.16048 null
2023-10-25 WebWISE: Web Interface Control and Sequential Exploration with Large Language Models Heyi Tao et.al. 2310.16042 null
2023-10-24 Finetuning Offline World Models in the Real World Yunhai Feng et.al. 2310.16029 null
2023-10-24 Data-driven Traffic Simulation: A Comprehensive Review Di Chen et.al. 2310.15975 null
2023-10-24 State Sequences Prediction via Fourier Transform for Representation Learning Mingxuan Ye et.al. 2310.15888 link
2023-10-24 Control problems on infinite horizon subject to time-dependent pure state constraints Vincenzo Basco et.al. 2310.15771 null
2023-10-24 Recurrent Linear Transformers Subhojeet Pramanik et.al. 2310.15719 link
2023-10-24 Solving large flexible job shop scheduling instances by generating a diverse set of scheduling policies with deep reinforcement learning Imanol Echeverria et.al. 2310.15706 null
2023-10-24 DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention Zheng Zhang et.al. 2310.15699 link
2023-10-25 COPF: Continual Learning Human Preference through Optimal Policy Fitting Han Zhang et.al. 2310.15694 null
2023-10-23 Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning Jingyun Yang et.al. 2310.15145 null
2023-10-23 The primacy bias in Model-based RL Zhongjian Qiao et.al. 2310.15017 null
2023-10-23 Reinforcement learning in large, structured action spaces: A simulation study of decision support for spinal cord injury rehabilitation Nathan Phelps et.al. 2310.14976 null
2023-10-23 Comparison of path following in ships using modern and traditional controllers Sanjeev Kumar Ramkumar Sudha et.al. 2310.14940 null
2023-10-23 AI on the Water: Applying DRL to Autonomous Vessel Navigation Md Shadab Alam et.al. 2310.14938 null
2023-10-23 Navigating the Ocean with DRL: Path following for marine vessels Joel Jose et.al. 2310.14932 null
2023-10-23 Budgeted Embedding Table For Recommender Systems Yunke Qu et.al. 2310.14884 null
2023-10-23 Diverse Priors for Deep Reinforcement Learning Chenfan Weng et.al. 2310.14864 null
2023-10-23 Policy Gradient with Kernel Quadrature Satoshi Hayakawa et.al. 2310.14768 null
2023-10-23 Multi-Agent Learning in Contextual Games under Unknown Constraints Anna M. Maddux et.al. 2310.14685 null
2023-10-20 Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis Philip John Gorinski et.al. 2310.13669 link
2023-10-20 EXPLORA: AI/ML EXPLainability for the Open RAN Claudio Fiandrino et.al. 2310.13667 link
2023-10-20 Contrastive Prefence Learning: Learning from Human Feedback without RL Joey Hejna et.al. 2310.13639 link
2023-10-20 Entangled Preferences: The History and Risks of Reinforcement Learning and Human Feedback Nathan Lambert et.al. 2310.13595 null
2023-10-20 Simultaneous Machine Translation with Tailored Reference Shoutao Guo et.al. 2310.13588 null
2023-10-20 Cooperative Multi-Agent Deep Reinforcement Learning for Adaptive Decentralized Emergency Voltage Control Ying Zhang et.al. 2310.13577 null
2023-10-20 Tree Search in DAG Space with Model-based Reinforcement Learning for Causal Discovery Victor-Alexandru Darvariu et.al. 2310.13576 null
2023-10-20 Reward Shaping for Happier Autonomous Cyber Security Agents Elizabeth Bates et.al. 2310.13565 null
2023-10-20 Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes Ruiquan Huang et.al. 2310.13550 null
2023-10-20 Towards Understanding Sycophancy in Language Models Mrinank Sharma et.al. 2310.13548 link
2023-10-19 Towards Robust Offline Reinforcement Learning under Diverse Data Corruption Rui Yang et.al. 2310.12955 link
2023-10-19 End-to-End Delay Minimization based on Joint Optimization of DNN Partitioning and Resource Allocation for Cooperative Edge Inference Xinrui Ye et.al. 2310.12937 null
2023-10-19 Generative Flow Networks as Entropy-Regularized RL Daniil Tiapkin et.al. 2310.12934 link
2023-10-19 Eureka: Human-Level Reward Design via Coding Large Language Models Yecheng Jason Ma et.al. 2310.12931 link
2023-10-19 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Juan Rocamonde et.al. 2310.12921 link
2023-10-19 Collaborative Adaptation: Learning to Recover from Unforeseen Malfunctions in Multi-Robot Teams Yasin Findik et.al. 2310.12909 null
2023-10-19 Safe RLHF: Safe Reinforcement Learning from Human Feedback Josef Dai et.al. 2310.12773 link
2023-10-19 Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark Jiaming Ji et.al. 2310.12567 null
2023-10-19 Privacy Preserving Large Language Models: ChatGPT Case Study Based Vision and Framework Imdad Ullah et.al. 2310.12523 null
2023-10-19 SDGym: Low-Code Reinforcement Learning Environments using System Dynamics Models Emmanuel Klu et.al. 2310.12494 link
2023-10-18 Quality Diversity through Human Feedback Li Ding et.al. 2310.12103 link
2023-10-18 Understanding Reward Ambiguity Through Optimal Transport Theory in Inverse Reinforcement Learning Ali Baheri et.al. 2310.12055 null
2023-10-18 A General Theoretical Paradigm to Understand Learning from Human Preferences Mohammad Gheshlaghi Azar et.al. 2310.12036 null
2023-10-19 Improving Generalization of Alignment with Human Preferences through Group Invariant Learning Rui Zheng et.al. 2310.11971 null
2023-10-18 Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning Yen-Ju Chen et.al. 2310.11897 link
2023-10-18 Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning Yufei Kuang et.al. 2310.11845 null
2023-10-18 On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning Rohan Subramani et.al. 2310.11840 null
2023-10-18 IntentDial: An Intent Graph based Multi-Turn Dialogue System with Reasoning Path Visualization Zengguang Hao et.al. 2310.11818 null
2023-10-18 Dynamic Resource Management in Integrated NOMA Terrestrial-Satellite Networks using Multi-Agent Reinforcement Learning Ali Nauman et.al. 2310.11814 null
2023-10-18 NeuroCUT: A Neural Approach for Robust Graph Partitioning Rishi Shah et.al. 2310.11787 link
2023-10-17 GreenNFV: Energy-Efficient Network Function Virtualization with Service Level Agreement Constraints MD S Q Zulkar Nine et.al. 2310.11406 null
2023-10-17 Real-time data assimilation for the thermodynamic modeling of cryogenic storage tanks Pedro Afonso Marques et.al. 2310.11399 null
2023-10-17 Non-ergodicity in reinforcement learning: robustness via ergodicity transformations Dominik Baumann et.al. 2310.11335 link
2023-10-17 Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control Chao Li et.al. 2310.11138 null
2023-10-17 Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance Thomas Chaffre et.al. 2310.11075 null
2023-10-17 Cooperative Dispatch of Microgrids Community Using Risk-Sensitive Reinforcement Learning with Monotonously Improved Performance Ziqing Zhu et.al. 2310.10997 null
2023-10-17 Combat Urban Congestion via Collaboration: Heterogeneous GNN-based MARL for Coordinated Platooning and Traffic Signal Control Xianyue Peng et.al. 2310.10948 null
2023-10-18 Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning Yunlong Song et.al. 2310.10943 null
2023-10-17 Enhanced Transformer Architecture for Natural Language Processing Woohyeon Moon et.al. 2310.10930 null
2023-10-16 Eco-Driving Control of Connected and Automated Vehicles using Neural Network based Rollout Jacob Paugh et.al. 2310.10878 null
2023-10-16 Generating Summaries with Controllable Readability Levels Leonardo F. R. Ribeiro et.al. 2310.10623 link
2023-10-16 Quantifying Assistive Robustness Via the Natural-Adversarial Frontier Jerry Zhi-Yang He et.al. 2310.10610 null
2023-10-16 Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks Zihao Li et.al. 2310.10556 null
2023-10-16 Applications of Distributed Machine Learning for the Internet-of-Things: A Comprehensive Survey Mai Le et.al. 2310.10549 null
2023-10-16 Learning optimal integration of spatial and temporal information in noisy chemotaxis Albert Alonso et.al. 2310.10531 link
2023-10-16 Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning Xiang Zhang et.al. 2310.10509 null
2023-10-16 ReMax: A Simple, Effective, and Efficient Method for Aligning Large Language Models Ziniu Li et.al. 2310.10505 link
2023-10-16 Machine learning in physics: a short guide Francisco A. Rodrigues et.al. 2310.10368 link
2023-10-16 Unlocking Metasurface Practicality for B5G Networks: AI-assisted RIS Planning Guillermo Encinas-Lago et.al. 2310.10330 null
2023-10-16 End-to-end Offline Reinforcement Learning for Glycemia Control Tristan Beolet et.al. 2310.10312 null
2023-10-13 Goodhart's Law in Reinforcement Learning Jacek Karwowski et.al. 2310.09144 null
2023-10-13 Automatic Music Playlist Generation via Simulation-based Reinforcement Learning Federico Tomasi et.al. 2310.09123 null
2023-10-13 Online Relocating and Matching of Ride-Hailing Services: A Model-Based Modular Approach Chang Gao et.al. 2310.09071 null
2023-10-13 DATT: Deep Adaptive Trajectory Tracking for Quadrotor Control Kevin Huang et.al. 2310.09053 link
2023-10-13 Optimal Scheduling of Electric Vehicle Charging with Deep Reinforcement Learning considering End Users Flexibility Christoforos Menos-Aikateriniadis et.al. 2310.09040 null
2023-10-13 μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning Technique for Service Offloading in Fog computing Environments Mohammad Goudarzi et.al. 2310.09003 null
2023-10-13 Multi-Purpose NLP Chatbot : Design, Methodology & Conclusion Shivom Aggarwal et.al. 2310.08977 null
2023-10-13 PAGE: Equilibrate Personalization and Generalization in Federated Learning Qian Chen et.al. 2310.08961 null
2023-10-13 LLaMA Rider: Spurring Large Language Models to Explore the Open World Yicheng Feng et.al. 2310.08922 null
2023-10-13 Community Membership Hiding as Counterfactual Graph Search via Deep Reinforcement Learning Andrea Bernini et.al. 2310.08909 null
2023-10-12 Octopus: Embodied Vision-Language Programmer from Environmental Feedback Jingkang Yang et.al. 2310.08588 link
2023-10-12 Discovering Fatigued Movements for Virtual Character Animation Noshaba Cheema et.al. 2310.08583 null
2023-10-12 Universal Visual Decomposer: Long-Horizon Manipulation Made Easy Zichen Zhang et.al. 2310.08581 null
2023-10-12 A Lightweight Calibrated Simulation Enabling Efficient Offline Learning for Optimal Control of Real Buildings Judah Goldfeder et.al. 2310.08569 null
2023-10-12 Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining Licong Lin et.al. 2310.08566 link
2023-10-12 Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias Max Sobol Mark et.al. 2310.08558 link
2023-10-12 Cross-Episodic Curriculum for Transformer Agents Lucy Xiaoyang Shi et.al. 2310.08549 null
2023-10-12 MeanAP-Guided Reinforced Active Learning for Object Detection Zhixuan Liang et.al. 2310.08387 null
2023-10-12 Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment Boyang Xue et.al. 2310.08372 link
2023-10-12 Impact of multi-armed bandit strategies on deep recurrent reinforcement learning Valentina Zangirolami et.al. 2310.08331 link
2023-10-11 Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking Gustav Nikopensius et.al. 2310.07613 null
2023-10-11 Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning Mirco Mutti et.al. 2310.07518 null
2023-10-11 Sample-Driven Federated Learning for Energy-Efficient and Real-Time IoT Sensing Minh Ngoc Luu et.al. 2310.07497 link
2023-10-11 KwaiYiiMath: Technical Report Jiayi Fu et.al. 2310.07488 null
2023-10-11 GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive Testing Hangyu Wang et.al. 2310.07477 link
2023-10-12 Imitation Learning from Observation with Automatic Discount Scheduling Yuyang Liu et.al. 2310.07433 null
2023-10-11 Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages Guozheng Ma et.al. 2310.07418 link
2023-10-11 RANS: Highly-Parallelised Simulator for Reinforcement Learning based Autonomous Navigating Spacecrafts Matteo El-Hariry et.al. 2310.07393 link
2023-10-11 Learning a Reward Function for User-Preferred Appliance Scheduling Nikolina Čović et.al. 2310.07389 link
2023-10-12 RLaGA: A Reinforcement Learning Augmented Genetic Algorithm For Searching Real and Diverse Marker-Based Landing Violations Linfeng Liang et.al. 2310.07378 null
2023-10-10 Scalable Semantic Non-Markovian Simulation Proxy for Reinforcement Learning Kaustuv Mukherji et.al. 2310.06835 null
2023-10-10 $f$-Policy Gradients: A General Framework for Goal Conditioned RL using $f$ -Divergences Siddhant Agarwal et.al. 2310.06794 null
2023-10-10 Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning Stefan Stojanovic et.al. 2310.06793 null
2023-10-10 Information Content Exploration Jacob Chmura et.al. 2310.06777 null
2023-10-10 EARL: Eye-on-Hand Reinforcement Learner for Dynamic Grasping with Active Pose Estimation Baichuan Huang et.al. 2310.06751 null
2023-10-10 Near-Optimality of Finite-Memory Codes and Reinforcement Learning for Zero-Delay Coding of Markov Sources Liam Cregg et.al. 2310.06742 null
2023-10-10 Solving Inverse Problems with REINFORCE Chen Xu et.al. 2310.06711 null
2023-10-10 Diversity from Human Feedback Ren-Jian Wang et.al. 2310.06648 null
2023-10-10 BridgeHand2Vec Bridge Hand Representation Anna Sztyber-Betley et.al. 2310.06624 link
2023-10-10 SYNLOCO: Synthesizing Central Pattern Generator and Reinforcement Learning for Quadruped Locomotion Xinyu Zhang et.al. 2310.06606 null
2023-10-09 SALMON: Self-Alignment with Principle-Following Reward Models Zhiqing Sun et.al. 2310.05910 link
2023-10-09 DSAC-T: Distributional Soft Actor-Critic with Three Refinements Jingliang Duan et.al. 2310.05858 link
2023-10-09 A Simple Open-Loop Baseline for Reinforcement Learning Locomotion Tasks Antonin Raffin et.al. 2310.05808 null
2023-10-09 Aligning Language Models with Human Preferences via a Bayesian Approach Jiashuo Wang et.al. 2310.05782 link
2023-10-09 RateRL: A Framework for Developing RL-Based Rate Adaptation Algorithms in ns-3 Ruben Queiros et.al. 2310.05772 null
2023-10-09 Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning Trevor McInroe et.al. 2310.05723 null
2023-10-09 DecAP: Decaying Action Priors for Accelerated Learning of Torque-Based Legged Locomotion Policies Shivam Sood et.al. 2310.05714 null
2023-10-09 Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments Xiong-Hui Chen et.al. 2310.05712 null
2023-10-09 Hierarchical Reinforcement Learning for Temporal Pattern Prediction Faith Johnson et.al. 2310.05695 null
2023-10-09 Multi-timestep models for Model-based Reinforcement Learning Abdelhakim Benechehab et.al. 2310.05672 null
2023-10-06 Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets Zhang-Wei Hong et.al. 2310.04413 link
2023-10-06 Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Andy Zhou et.al. 2310.04406 link
2023-10-06 Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz et.al. 2310.04373 link
2023-10-06 Amortizing intractable inference in large language models Edward J. Hu et.al. 2310.04363 link
2023-10-06 Applying Reinforcement Learning to Option Pricing and Hedging Zoran Stoiljkovic et.al. 2310.04336 null
2023-10-06 Adjustable Robust Reinforcement Learning for Online 3D Bin Packing Yuxin Pan et.al. 2310.04323 null
2023-10-06 Searching for Optimal Runtime Assurance via Reachability and Reinforcement Learning Kristina Miller et.al. 2310.04288 null
2023-10-06 DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories Matteo El-Hariry et.al. 2310.04266 link
2023-10-06 Comparing Auxiliary Tasks for Learning Representations for Reinforcement Learning Moritz Lange et.al. 2310.04241 null
2023-10-06 Lending Interaction Wings to Recommender Systems with Conversational Agents Jiarui Jin et.al. 2310.04230 null
2023-10-05 Aligning Text-to-Image Diffusion Models with Reward Backpropagation Mihir Prabhudesai et.al. 2310.03739 link
2023-10-05 Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning Yihang Yao et.al. 2310.03718 null
2023-10-05 A Long Way to Go: Investigating Length Correlations in RLHF Prasann Singhal et.al. 2310.03716 link
2023-10-05 Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization Zhanhui Zhou et.al. 2310.03708 link
2023-10-05 Enhancing Exfiltration Path Analysis Using Reinforcement Learning Riddam Rishu et.al. 2310.03667 null
2023-10-05 Solving a Class of Non-Convex Minimax Optimization in Federated Learning Xidong Wu et.al. 2310.03613 link
2023-10-05 Output Feedback Reinforcement Learning with Parameter Optimisation for Temperature Control in a Material Extrusion Additive Manufacturing system Eleni Zavrakli et.al. 2310.03599 link
2023-10-05 Resilient Legged Local Navigation: Learning to Traverse with Compromised Perception End-to-End Jin Jin et.al. 2310.03581 null
2023-10-05 Reinforcement learning for traversing chemical structure space: Optimizing transition states and minimum energy paths of molecules Rhyan Barrett et.al. 2310.03511 link
2023-10-05 RL-based Stateful Neural Adaptive Sampling and Denoising for Real-Time Path Tracing Antoine Scardigli et.al. 2310.03507 link
2023-10-04 Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Jeonghye Kim et.al. 2310.03022 null
2023-10-04 Proximal Policy Optimization-Based Reinforcement Learning Approach for DC-DC Boost Converter Control: A Comparative Evaluation Against Traditional Control Techniques Utsab Saha et.al. 2310.02945 null
2023-10-04 Searching for High-Value Molecules Using Reinforcement Learning and Transformers Raj Ghugare et.al. 2310.02902 null
2023-10-04 Learning to Scale Logits for Temperature-Conditional GFlowNets Minsu Kim et.al. 2310.02823 link
2023-10-04 Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design Matthew Thomas Jackson et.al. 2310.02782 link
2023-10-04 Reward Model Ensembles Help Mitigate Overoptimization Thomas Coste et.al. 2310.02743 link
2023-10-04 Foundation Reinforcement Learning: towards Embodied Generalist Agents with Foundation Prior Assistance Weirui Ye et.al. 2310.02635 null
2023-10-04 RLTrace: Synthesizing High-Quality System Call Traces for OS Fuzz Testing Wei Chen et.al. 2310.02609 null
2023-10-04 Multi-Agent Reinforcement Learning for Power Grid Topology Optimization Erica van der Sar et.al. 2310.02605 null
2023-10-04 Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning Weidong Liu et.al. 2310.02581 null
2023-10-03 What do we learn from a large-scale study of pre-trained visual representations in sim and real environments? Sneha Silwal et.al. 2310.02219 null
2023-10-03 Towards a Unified Framework for Sequential Decision Making Carlos Núñez-Molina et.al. 2310.02167 null
2023-10-03 Navigating Uncertainty in ESG Investing Jiayue Zhang et.al. 2310.02163 null
2023-10-03 AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Zibin Dong et.al. 2310.02054 null
2023-10-03 Probabilistic Reach-Avoid for Bayesian Neural Networks Matthew Wicker et.al. 2310.01951 link
2023-10-03 Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency Francisco Roldan Sanchez et.al. 2310.01827 link
2023-10-03 Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI Emily Jin et.al. 2310.01824 link
2023-10-03 Differentially Encoded Observation Spaces for Perceptive Reinforcement Learning Lev Grossman et.al. 2310.01767 link
2023-10-04 Blending Imitation and Reinforcement Learning for Robust Policy Improvement Xuefeng Liu et.al. 2310.01737 null
2023-10-03 On Representation Complexity of Model-based and Model-free Reinforcement Learning Hanlin Zhu et.al. 2310.01706 null

(back to top)

SLAM

Publish Date Title Authors PDF Code
2024-07-01 Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation Lianjie Guo et.al. 2407.01292 link
2024-07-07 Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy Chen Wang et.al. 2406.16087 null
2024-06-16 Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry Boris Chidlovskii et.al. 2406.11019 null
2024-06-12 From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers Swaminathan Gurumurthy et.al. 2406.07785 link
2024-06-03 The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry Paolo Cudrano et.al. 2406.01797 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-06-20 Advancements in Translation Accuracy for Stereo Visual-Inertial Initialization Han Song et.al. 2405.15082 null
2024-06-08 EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving Boyi Liu et.al. 2405.12120 null
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241 null
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290 null
2024-05-07 IMU-Aided Event-based Stereo Visual Odometry Junkai Niu et.al. 2405.04071 link
2024-04-27 An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation Olivier Brochu Dufour et.al. 2404.17745 null
2024-04-26 Camera Motion Estimation from RGB-D-Inertial Scene Flow Samuel Cerezo et.al. 2404.17251 null
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263 link
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 VBR: A Vision Benchmark in Rome Leonardo Brizi et.al. 2404.11322 link
2024-04-14 Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration Yanhao Zhang et.al. 2404.09169 link
2024-04-06 Salient Sparse Visual Odometry With Pose-Only Supervision Siyu Chen et.al. 2404.04677 null
2024-03-25 A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments Gianluca D'Amico et.al. 2403.17084 null
2024-03-19 On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine Jagatpreet Singh Nir et.al. 2403.13170 null
2024-03-18 The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions Margaret Hansen et.al. 2403.12194 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-16 Efficient Domain Adaptation for Endoscopic Visual Odometry Junyang Wu et.al. 2403.10860 null
2024-03-14 Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) Matthew Lisondra et.al. 2403.09882 null
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-22 Secure Navigation using Landmark-based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2402.14280 null
2024-02-19 Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment Ganesh Sapkota et.al. 2402.12551 null
2024-02-07 Online and Certifiably Correct Visual Odometry and Mapping Devansh R Agrawal et.al. 2402.05254 null
2024-02-06 YOLOPoint Joint Keypoint and Object Detection Anton Backhaus et.al. 2402.03989 link
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857 null
2024-01-17 Event-Based Visual Odometry on Non-Holonomic Ground Vehicles Wanting Xu et.al. 2401.09331 link
2024-01-11 On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering Feng Zhu et.al. 2401.05836 null
2023-12-19 Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry Olaya Álvarez-Tuñón et.al. 2401.05396 link
2024-01-07 Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people Ali Samadzadeh et.al. 2401.03604 link
2024-01-03 LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry Weirong Chen et.al. 2401.01887 null
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800 link
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-22 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332 null
2023-12-20 Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach Habib Boloorchi Tabrizi et.al. 2312.13162 link
2023-12-20 Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera Abdulkadhem A. Abdulkadhem et.al. 2312.12680 null
2023-12-15 Deep Event Visual Odometry Simon Klenk et.al. 2312.09800 link
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141 null
2023-11-30 Event-based Visual Inertial Velometer Xiuyuan Lu et.al. 2311.18189 null
2023-11-21 CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems Young-Hee Lee et.al. 2311.12580 null
2023-11-10 Dense Visual Odometry Using Genetic Algorithm Slimane Djema et.al. 2311.06149 null
2023-11-07 Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM Seongwook Yoon et.al. 2311.03722 null
2023-10-23 Converting Depth Images and Point Clouds for Feature-based Pose Estimation Robert Lösch et.al. 2310.14924 link
2023-10-17 Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms Yanyan Li et.al. 2310.10931 link
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 l-dyno: framework to learn consistent visual features using robot's motion Kartikeya Singh et.al. 2310.06249 link
2023-10-08 XVO: Generalized Visual Odometry via Cross-Modal Self-Training Lei Lai et.al. 2309.16772 null
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-23 Tag-based Visual Odometry Estimation for Indoor UAVs Localization Massimiliano Bertoni et.al. 2309.13311 null
2023-09-22 Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms Olivier Gamache et.al. 2309.13139 link
2023-09-20 Conformalized Multimodal Uncertainty Regression and Reasoning Domenico Parente et.al. 2309.11018 null
2023-09-20 OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving Heng Li et.al. 2309.11011 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-21 Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration Hongbo Zhao et.al. 2309.10314 null
2023-09-18 End-to-End Learned Event- and Image-based Visual Odometry Roberto Pellerito et.al. 2309.09947 null
2023-09-14 An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments Yehao Liu et.al. 2309.07408 null

(back to top)

NeRF

Publish Date Title Authors PDF Code
2024-07-15 AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems Alexey Kotcov et.al. 2407.10865 null
2024-07-15 Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis Antoine Legrand et.al. 2407.10762 null
2024-07-15 IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild Shuaixian Wang et.al. 2407.10695 null
2024-07-15 NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis Yubin Hu et.al. 2407.10482 null
2024-07-15 Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering Francesco Di Sario et.al. 2407.10389 null
2024-07-14 RS-NeRF: Neural Radiance Fields from Rolling Shutter Images Muyao Niu et.al. 2407.10267 link
2024-07-14 SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion Jiyuan Zhang et.al. 2407.10062 null
2024-07-12 Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction Yiming Wang et.al. 2407.09679 null
2024-07-12 Radiance Fields from Photons Sacha Jungerman et.al. 2407.09386 null
2024-07-12 HPC: Hierarchical Progressive Coding Framework for Volumetric Video Zihan Zheng et.al. 2407.09026 null
2024-07-11 Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction Shariq Nadeem Malik et.al. 2407.08795 null
2024-07-11 WildGaussians: 3D Gaussian Splatting in the Wild Jonas Kulhanek et.al. 2407.08447 null
2024-07-11 MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos Yushuo Chen et.al. 2407.08414 link
2024-07-11 Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression Yuke Xing et.al. 2407.08165 null
2024-07-11 Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields Haojie Lian et.al. 2407.08154 null
2024-07-11 Survey on Fundamental Deep Learning 3D Reconstruction Techniques Yonge Bai et.al. 2407.08137 null
2024-07-10 Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model Qi Song et.al. 2407.07735 null
2024-07-10 Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field Ganlin Yang et.al. 2407.07461 null
2024-07-09 Reference-based Controllable Scene Stylization with Gaussian Splatting Yiqun Mei et.al. 2407.07220 null
2024-07-09 Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View Dogyoon Lee et.al. 2407.06613 null
2024-07-08 RRM: Relightable assets using Radiance guided Material extraction Diego Gomez et.al. 2407.06397 null
2024-07-08 PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes Mohammad Reza Karimi Dastjerdi et.al. 2407.06150 null
2024-07-08 Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views Jiawei Guo et.al. 2407.05666 null
2024-07-08 GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields Weiyi Xue et.al. 2407.05597 null
2024-07-08 Dynamic Neural Radiance Field From Defocused Monocular Video Xianrui Luo et.al. 2407.05586 null
2024-07-07 GaussReg: Fast 3D Registration with Gaussian Splatting Jiahao Chang et.al. 2407.05254 null
2024-07-06 SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction Weixing Xie et.al. 2407.05023 null
2024-07-04 CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images Junghe Lee et.al. 2407.03923 null
2024-07-02 MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering Ahmad AlMughrabi et.al. 2407.02668 null
2024-07-03 BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream Wenpu Li et.al. 2407.02174 link
2024-07-01 Active Human Pose Estimation via an Autonomous UAV Agent Jingxi Chen et.al. 2407.01811 null
2024-07-01 DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction Yujin Ham et.al. 2407.01761 null
2024-07-01 Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation Zihan Gao et.al. 2407.01220 null
2024-06-29 Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing Alireza Moazeni et.al. 2407.00500 null
2024-06-28 ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction Ding-Jiun Huang et.al. 2406.20066 null
2024-06-28 EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting Daiwei Zhang et.al. 2406.19811 null
2024-06-27 Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views Zongyu Li et.al. 2406.18840 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 link
2024-06-25 NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods Jonas Kulhanek et.al. 2406.17345 null
2024-06-24 From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking Xiaohao Xu et.al. 2406.16850 link
2024-06-24 Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis Jianning Deng et.al. 2406.16623 null
2024-06-24 Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction Tong Qin et.al. 2406.16289 null
2024-06-23 Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study Zhe Wang et.al. 2406.16068 null
2024-06-23 Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction Yangdi Lu et.al. 2406.15982 null
2024-06-22 psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery Tongtong Zhang et.al. 2406.15707 null
2024-06-21 A3D: Does Diffusion Dream about 3D Alignment? Savva Ignatyev et.al. 2406.15020 null
2024-06-21 E2GS: Event Enhanced Gaussian Splatting Hiroyuki Deguchi et.al. 2406.14978 link
2024-06-21 Relighting Scenes with Object Insertions in Neural Radiance Fields Xuening Zhu et.al. 2406.14806 null
2024-06-20 Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment Yunshan Qi et.al. 2406.14360 null
2024-06-19 NeRF-Feat: 6D Object Pose Estimation using Feature Rendering Shishir Reddy Vutukur et.al. 2406.13796 null
2024-06-19 Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images Haruo Fujiwara et.al. 2406.13393 null
2024-06-19 Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields Youngin Park et.al. 2406.13251 link
2024-06-18 Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models Paul Henderson et.al. 2406.13099 null
2024-06-18 Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings Ruijie Tang et.al. 2406.13048 null
2024-06-18 Fast Global Localization on Neural Radiance Field Mangyu Kong et.al. 2406.12202 null
2024-06-20 TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations Bo Sun et.al. 2406.12121 null
2024-06-17 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-17 Uncertainty modeling for fine-tuned implicit functions Anna Susmelj et.al. 2406.12082 null
2024-06-17 LLaNA: Large Language and NeRF Assistant Andrea Amaduzzi et.al. 2406.11840 null
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766 null
2024-06-17 InterNeRF: Scaling Radiance Fields via Parameter Interpolation Clinton Wang et.al. 2406.11737 null
2024-06-17 NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation Niu Guanchen et.al. 2406.11259 null
2024-06-15 NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows Zhenggang Tang et.al. 2406.10543 link
2024-06-15 Federated Neural Radiance Field for Distributed Intelligence Yintian Zhang et.al. 2406.10474 null
2024-06-14 Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections Jiacong Xu et.al. 2406.10373 null
2024-06-14 PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting Alex Hanson et.al. 2406.10219 null
2024-06-14 GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors Xiqian Yu et.al. 2406.10111 null
2024-06-14 OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control Yuzhong Huang et.al. 2406.10000 null
2024-06-14 dGrasp: NeRF-Informed Implicit Grasp Policies with Supervised Optimization Slopes Gergely Sóti et.al. 2406.09939 null
2024-06-14 RaNeuS: Ray-adaptive Neural Surface Reconstruction Yida Wang et.al. 2406.09801 link
2024-06-13 Rethinking Score Distillation as a Bridge Between Image Distributions David McAllister et.al. 2406.09417 null
2024-06-13 Preserving Identity with Variational Score for General-purpose 3D Editing Duong H. Le et.al. 2406.08953 null
2024-06-13 Neural NeRF Compression Tuan Pham et.al. 2406.08943 null
2024-06-14 AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis Swapnil Bhosale et.al. 2406.08920 null
2024-06-13 NeRF Director: Revisiting View Selection in Neural Volume Rendering Wenhui Xiao et.al. 2406.08839 null
2024-06-12 ICE-G: Image Conditional Editing of 3D Gaussian Splats Vishnu Jaganathan et.al. 2406.08488 null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 link
2024-06-12 Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering Yuru Xiao et.al. 2406.07828 link
2024-06-11 C3DAG: Controlled 3D Animal Generation using 3D pose guidance Sandeep Mishra et.al. 2406.07742 null
2024-06-11 M-LRM: Multi-view Large Reconstruction Model Mengfei Li et.al. 2406.07648 null
2024-06-11 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Christopher D. Hsu et.al. 2406.07431 null
2024-06-11 Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion Xin Yuan et.al. 2406.06972 null
2024-06-11 Neural Visibility Field for Uncertainty-Driven Active Mapping Shangjie Xue et.al. 2406.06948 null
2024-06-10 IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao et.al. 2406.06527 null
2024-06-10 GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation Haozhe Xie et.al. 2406.06526 null
2024-06-10 PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction Danpeng Chen et.al. 2406.06521 null
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216 link
2024-06-10 ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Meng-Li Shih et.al. 2406.06133 null
2024-06-09 GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement Peiye Zhuang et.al. 2406.05649 null
2024-06-07 Multiplane Prior Guided Few-Shot Aerial Scene Rendering Zihan Gao et.al. 2406.04961 null
2024-06-07 Multi-style Neural Radiance Field with AdaIN Yu-Wen Pao et.al. 2406.04960 link
2024-06-06 Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization Takuhiro Kaneko et.al. 2406.04155 null
2024-06-06 How Far Can We Compress Instant-NGP-Based NeRF? Yihang Chen et.al. 2406.04101 link
2024-06-06 Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling Xinhang Liu et.al. 2406.03723 null
2024-06-06 Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction Diwen Wan et.al. 2406.03697 null
2024-06-04 3D-HGS: 3D Half-Gaussian Splatting Haolin Li et.al. 2406.02720 link
2024-06-06 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning Jiaxu Wang et.al. 2406.02370 null
2024-06-03 Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting Shaojie Ma et.al. 2406.01593 null
2024-06-03 Tetrahedron Splatting for 3D Generation Chun Gu et.al. 2406.01579 link
2024-06-03 Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting Fang Li et.al. 2406.01042 link
2024-06-02 PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency Yeonsung Jung et.al. 2406.00798 null
2024-06-02 Representing Animatable Avatar via Factorized Neural Fields Chunjin Song et.al. 2406.00637 null
2024-06-04 SuperGaussian: Repurposing Video Models for 3D Super Resolution Yuan Shen et.al. 2406.00609 null
2024-06-02 Efficient Neural Light Fields (ENeLF) for Mobile Devices Austin Peng et.al. 2406.00598 null
2024-06-01 Bilateral Guided Radiance Field Processing Yuehao Wang et.al. 2406.00448 null
2024-05-31 R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction Ruyi Zha et.al. 2405.20693 link
2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation Haiyu Zhang et.al. 2405.20674 null
2024-05-30 $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving Nan Huang et.al. 2405.20323 link
2024-05-30 TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes Minghao Guo et.al. 2405.20283 null
2024-05-31 NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation Pedro Martin et.al. 2405.20078 null
2024-05-30 IReNe: Instant Recoloring in Neural Radiance Fields Alessio Mazzucchelli et.al. 2405.19876 null
2024-05-30 HINT: Learning Complete Human Neural Representations from Limited Viewpoints Alessandro Sanvito et.al. 2405.19712 null
2024-05-30 View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields Haodi He et.al. 2405.19678 link
2024-05-29 Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy Zijie Jiang et.al. 2405.18863 null
2024-06-02 NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild Weining Ren et.al. 2405.18715 link
2024-05-28 Self-supervised Pre-training for Transferable Multi-modal Perception Xiaohao Xu et.al. 2405.17942 null
2024-05-28 A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction Bin Zhang et.al. 2405.17891 null
2024-05-29 HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction Haoyu Zhao et.al. 2405.17872 null
2024-05-28 Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh Xiangjun Gao et.al. 2405.17811 null
2024-05-28 F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting Xiangyu Sun et.al. 2405.17083 null
2024-05-29 PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting Zipeng Wang et.al. 2405.16829 null
2024-05-26 Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors Soumava Paul et.al. 2405.16517 null
2024-05-24 Neural Elevation Models for Terrain Mapping and Path Planning Adam Dai et.al. 2405.15227 link
2024-05-27 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting Yuanhao Cai et.al. 2405.15125 link
2024-05-24 GS-Hider: Hiding Messages into 3D Gaussian Splatting Xuanyu Zhang et.al. 2405.15118 null
2024-05-23 NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections Dor Verbin et.al. 2405.14871 null
2024-05-23 Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling Liwen Wu et.al. 2405.14847 null
2024-05-23 Camera Relocalization in Shadow-free Neural Radiance Fields Shiyao Xu et.al. 2405.14824 link
2024-05-23 LDM: Large Tensorial SDF Model for Textured Mesh Generation Rengan Xie et.al. 2405.14580 null
2024-05-23 JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression Zihan Zheng et.al. 2405.14452 null
2024-05-22 DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus Yu Chen et.al. 2405.13943 null
2024-05-22 Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances Licheng Shen et.al. 2405.13694 null
2024-05-21 MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video Hongsheng Wang et.al. 2405.12806 null
2024-05-21 Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations Antoine Legrand et.al. 2405.12728 null
2024-05-20 Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo Tianqi Liu et.al. 2405.12218 null
2024-05-20 Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents Guanlin Wu et.al. 2405.12155 null
2024-05-20 NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo Fotios Logothetis et.al. 2405.12057 null
2024-05-19 Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems Shengxiang Sun et.al. 2405.11629 null
2024-05-19 R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments Huiying Yang et.al. 2405.11541 null
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-16 When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models Xianzheng Ma et.al. 2405.10255 link
2024-05-15 From NeRFs to Gaussian Splats, and Back Siming He et.al. 2405.09717 link
2024-05-14 Dynamic NeRF: A Review Jinwei Lin et.al. 2405.08609 null
2024-05-13 Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs Mingyu Kim et.al. 2405.07857 link
2024-05-12 Point Resampling and Ray Transformation Aid to Editable NeRF Models Zhenyang Li et.al. 2405.07306 null
2024-05-12 Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction Ekansh Agrawal et.al. 2405.07178 null
2024-05-11 TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization Zhen Tan et.al. 2405.07027 link
2024-05-10 LIVE: LaTex Interactive Visual Editing Jinwei Lin et.al. 2405.06762 null
2024-05-14 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461 null
2024-05-10 Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering Xiaohan Zhang et.al. 2405.06214 null
2024-05-10 Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation Bardienus P. Duisterhof et.al. 2405.06181 null
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800 null
2024-05-10 NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior Gihoon Kim et.al. 2405.05749 null
2024-05-09 RPBG: Towards Robust Neural Point-based Graphics in the Wild Qingtian Zhu et.al. 2405.05663 link
2024-05-09 Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview Yuhang Ming et.al. 2405.05526 null
2024-05-08 ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields Ning Wang et.al. 2405.05010 null
2024-05-08 DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid Sidun Liu et.al. 2405.04416 null
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345 null
2024-05-05 Blending Distributed NeRFs with Tri-stage Robust Pose Optimization Baijun Ye et.al. 2405.02880 null
2024-05-05 MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior Honghua Chen et.al. 2405.02859 null
2024-05-04 TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes Christopher Maxey et.al. 2405.02762 null
2024-05-04 ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty Hyunseo Kim et.al. 2405.02568 null
2024-05-03 Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning Dhruva Tirumala et.al. 2405.02425 null
2024-05-03 Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids Junchen Liu et.al. 2405.02386 link
2024-05-03 WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights Youngdong Jang et.al. 2405.02066 null
2024-05-02 NeRF in Robotics: A Survey Guangming Wang et.al. 2405.01333 null
2024-05-04 LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes Shanlin Sun et.al. 2405.00900 null
2024-05-01 Depth Priors in Removal Neural Radiance Fields Zhihao Guo et.al. 2405.00630 null
2024-05-01 NeRF-Guided Unsupervised Learning of RGB-D Registration Zhinan Yu et.al. 2405.00507 null
2024-05-01 RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting Zhexi Peng et.al. 2404.19706 null
2024-04-30 NeRF-Insert: 3D Local Editing with Multimodal Control Signals Benet Oriol Sabat et.al. 2404.19204 null
2024-04-29 SAGS: Structure-Aware 3D Gaussian Splatting Evangelos Ververas et.al. 2404.19149 null
2024-04-29 GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting Bo Chen et.al. 2404.19040 null
2024-04-29 Embedded Representation Learning Network for Animating Styled Video Portrait Tianyong Wang et.al. 2404.19038 null
2024-04-29 Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions Nagabhushan Somraj et.al. 2404.19015 null
2024-04-28 S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM Zhiyao Zhang et.al. 2404.18284 null
2024-04-27 DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction Chenhe Du et.al. 2404.17890 null
2024-04-26 Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields Tianqi Liu et.al. 2404.17528 link
2024-04-25 Depth Supervised Neural Surface Reconstruction from Airborne Imagery Vincent Hackstein et.al. 2404.16429 null
2024-04-24 NeRF-XL: Scaling NeRFs with Multiple GPUs Ruilong Li et.al. 2404.16221 null
2024-04-24 ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images Jinseo Jeong et.al. 2404.15707 null
2024-04-23 DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft Sam Earle et.al. 2404.15538 null
2024-04-28 GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting Hongyun Yu et.al. 2404.14037 null
2024-04-22 NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation Chi Huang et.al. 2404.13921 null
2024-04-23 CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory Yunlong Ran et.al. 2404.13896 null
2024-04-26 Neural Radiance Field in Autonomous Driving: A Survey Lei He et.al. 2404.13816 null
2024-04-26 ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis Zichen Tang et.al. 2404.13711 link
2024-04-21 Generalizable Novel-View Synthesis using a Stereo Camera Haechan Lee et.al. 2404.13541 null
2024-04-20 High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces Baoru Huang et.al. 2404.13437 null
2024-04-20 EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment Guanghao Li et.al. 2404.13346 link
2024-04-19 FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction Maria Dronova et.al. 2404.12970 null
2024-04-22 Does Gaussian Splatting need SFM Initialization? Yalda Foroutan et.al. 2404.12547 null
2024-04-18 MeshLRM: Large Reconstruction Model for High-Quality Mesh Xinyue Wei et.al. 2404.12385 null
2024-04-18 AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering Jingfeng Guo et.al. 2404.11897 null
2024-04-18 Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations Yu Feng et.al. 2404.11852 null
2024-04-17 SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping Vincent Cartillier et.al. 2404.11419 null
2024-04-16 Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks Florian Barthel et.al. 2404.10625 null
2024-04-16 Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences Seungwook Kim et.al. 2404.10603 null
2024-04-16 1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction Hang Du et.al. 2404.10441 null
2024-04-16 SRGS: Super-Resolution 3D Gaussian Splatting Xiang Feng et.al. 2404.10318 null
2024-04-16 Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal Yoshio Kato et.al. 2404.10272 link
2024-04-15 Taming Latent Diffusion Model for Neural Radiance Field Inpainting Chieh Hubert Lin et.al. 2404.09995 null
2024-04-15 Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video Hongchi Xia et.al. 2404.09833 null
2024-04-15 DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading Tong Wu et.al. 2404.09412 null
2024-04-14 VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field Fei Xue et.al. 2404.09271 link
2024-04-15 OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering Jingrui Ye et.al. 2404.08449 null
2024-04-12 GPN: Generative Point-based NeRF Haipeng Wang et.al. 2404.08312 link
2024-04-12 MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance Yuqun Wu et.al. 2404.08252 null
2024-04-11 Connecting NeRFs, Images, and Text Francesco Ballerini et.al. 2404.07993 null
2024-04-11 Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation Keonhee Han et.al. 2404.07933 link
2024-04-12 NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving William Ljungbergh et.al. 2404.07762 link
2024-04-11 G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images Zixiong Huang et.al. 2404.07474 link
2024-04-10 SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection Mathis Kruse et.al. 2404.06832 link
2024-04-10 MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views Runfa Li et.al. 2404.06753 null
2024-04-10 Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields Sibeak Lee et.al. 2404.06727 link
2024-04-11 SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera Gaole Dai et.al. 2404.06710 null
2024-04-09 Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion Fan Yang et.al. 2404.06429 null
2024-04-09 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis Zhicheng Lu et.al. 2404.06270 null
2024-04-09 GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields Arnab Dey et.al. 2404.06246 null
2024-04-09 HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields Arnab Dey et.al. 2404.06152 null
2024-04-08 Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation Y. Wang et.al. 2404.05236 null
2024-04-08 StylizedGS: Controllable Stylization for 3D Gaussian Splatting Dingxi Zhang et.al. 2404.05220 null
2024-04-08 Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos Fengrui Tian et.al. 2404.05163 link
2024-04-07 CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis Gyeongjin Kang et.al. 2404.04913 null
2024-04-07 GauU-Scene V2: Expanse Lidar Image Dataset Shows Unreliable Geometric Reconstruction Using Gaussian Splatting and NeRF Butian Xiong et.al. 2404.04880 null
2024-04-07 NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization Peng Tu et.al. 2404.04875 null
2024-04-06 DATENeRF: Depth-Aware Text-based Editing of NeRFs Sara Rojas et.al. 2404.04526 null
2024-04-05 Robust Gaussian Splatting François Darmon et.al. 2404.04211 null
2024-04-04 SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer Zijie Wu et.al. 2404.03736 null
2024-04-07 RaFE: Generative Radiance Fields Restoration Zhongkai Wu et.al. 2404.03654 null
2024-04-04 OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views Francis Engelmann et.al. 2404.03650 null
2024-04-04 VF-NeRF: Viewshed Fields for Rigid NeRF Registration Leo Segre et.al. 2404.03349 null
2024-04-03 GenN2N: Generative NeRF2NeRF Translation Xiangyue Liu et.al. 2404.02788 null
2024-04-03 LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis Zehan Zheng et.al. 2404.02742 link
2024-04-03 Neural Radiance Fields with Torch Units Bingnan Ni et.al. 2404.02617 null
2024-04-03 Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition Yisheng He et.al. 2404.02514 null
2024-04-02 NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation Sicheng Li et.al. 2404.02185 null
2024-04-02 Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields Joshua Ahn et.al. 2404.02155 null
2024-04-02 Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions Saptarshi Dasgupta et.al. 2404.01812 null
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400 null
2024-04-01 NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields Muhammad Zubair Irshad et.al. 2404.01300 null
2024-04-01 MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space Armand Comas-Massagué et.al. 2404.01296 null
2024-04-02 StructLDM: Structured Latent Diffusion for 3D Human Generation Tao Hu et.al. 2404.01241 null
2024-04-01 Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting Jiarui Meng et.al. 2404.01168 null
2024-04-01 SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance Yuru Xiao et.al. 2404.00992 null
2024-04-01 FlexiDreamer: Single Image-to-3D Generation with FlexiCubes Ruowen Zhao et.al. 2404.00987 link
2024-04-01 Marrying NeRF with Feature Matching for One-step Pose Estimation Ronghan Chen et.al. 2404.00891 null
2024-03-29 HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes Ke Wu et.al. 2403.20159 null
2024-03-29 Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior Jaehoon Ko et.al. 2403.20153 link
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079 null
2024-03-29 NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising Tianchen Deng et.al. 2403.20034 link
2024-03-29 SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image Yunhao Li et.al. 2403.20018 link
2024-03-29 DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal Yunhao Li et.al. 2403.20013 link
2024-03-29 Stable Surface Regularization for Fast Few-Shot NeRF Byeongin Joung et.al. 2403.19985 null
2024-03-29 MI-NeRF: Learning a Single Face NeRF from Multiple Identities Aggelina Chatziagapi et.al. 2403.19920 null
2024-03-28 Mitigating Motion Blur in Neural Radiance Fields with Events and Frames Marco Cannici et.al. 2403.19780 link
2024-03-28 SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects Avinash Ummadisingu et.al. 2403.19607 null
2024-03-28 CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians Avinash Paliwal et.al. 2403.19495 null
2024-03-28 Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation Yujin Chen et.al. 2403.19319 null
2024-03-28 Sine Activated Low-Rank Matrices for Parameter Efficient Learning Yiping Ji et.al. 2403.19243 null
2024-03-29 Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Qiuhong Shen et.al. 2403.18795 link
2024-03-27 SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery Camille Billouard et.al. 2403.18711 link
2024-03-27 Modeling uncertainty for Gaussian Splatting Luca Savant et.al. 2403.18476 null
2024-03-26 Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians Kerui Ren et.al. 2403.17898 link
2024-03-26 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation Jiahao Chen et.al. 2403.17537 null
2024-03-25 VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation Yang Chen et.al. 2403.17001 null
2024-03-25 CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs Yingji Zhong et.al. 2403.16885 null
2024-03-25 Spike-NeRF: Neural Radiance Field Based On Spike Camera Yijia Guo et.al. 2403.16410 null
2024-03-24 Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields Haoyuan Wang et.al. 2403.16224 null
2024-03-24 Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes Takashi Otonari et.al. 2403.16141 null
2024-03-24 CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field Jiarui Hu et.al. 2403.16095 null
2024-03-24 Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap Carl Lindström et.al. 2403.16092 null
2024-03-26 PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling Xiaoyun Zheng et.al. 2403.16080 link
2024-03-24 Semantic Is Enough: Only Semantic Information For NeRF Reconstruction Ruibo Wang et.al. 2403.16043 null
2024-03-24 Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields unhong Zhao et.al. 2403.15981 null
2024-03-23 DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation Mu-Yi Shen et.al. 2403.15791 link
2024-03-23 UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation Yuliang Guo et.al. 2403.15705 null
2024-03-22 WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization Jialu Wang et.al. 2403.15272 null
2024-03-21 Hyperspectral Neural Radiance Fields Gerry Chen et.al. 2403.14839 null
2024-03-21 ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition Tianhao Wu et.al. 2403.14619 null
2024-03-21 CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis Matteo Bonotto et.al. 2403.14412 link
2024-03-21 InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity Jiabin Liang et.al. 2403.14376 null
2024-03-21 Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions Jiacong Xu et.al. 2403.14053 null
2024-03-20 MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination Weiying Wang et.al. 2403.13348 null
2024-03-19 Depth-guided NeRF Training via Earth Mover's Distance Anita Rau et.al. 2403.13206 null
2024-03-19 DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images Zaid Tasneem et.al. 2403.13199 null
2024-03-19 Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering Mingqi Shao et.al. 2403.12839 null
2024-03-19 Learning Neural Volumetric Pose Features for Camera Localization Jingyu Lin et.al. 2403.12800 null
2024-03-19 IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model Matteo Bortolon et.al. 2403.12682 null
2024-03-18 FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos Florian Philipp Stilz et.al. 2403.12198 null
2024-03-18 ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis Mariam Hassan et.al. 2403.12154 link
2024-03-18 RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF Sibi Catley-Chandar et.al. 2403.11909 null
2024-03-18 GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors LI Yang et.al. 2403.11899 null
2024-03-18 Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging Mert Özer et.al. 2403.11865 null
2024-03-19 BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting Lingzhe Zhao et.al. 2403.11831 link
2024-03-18 Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery Yuqi Zhang et.al. 2403.11812 link
2024-03-18 Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes Antoine Schnepf et.al. 2403.11678 null
2024-03-18 UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling Yujiao Jiang et.al. 2403.11589 null
2024-03-18 Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem Mincheol Chang et.al. 2403.11573 null
2024-03-17 Creating Seamless 3D Maps Using Radiance Fields Sai Tarun Sathyan et.al. 2403.11364 null
2024-03-17 SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream Lin Zhu et.al. 2403.11222 link
2024-03-17 Recent Advances in 3D Gaussian Splatting Tong Wu et.al. 2403.11134 null
2024-03-17 Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications Yonggan Fu et.al. 2403.11131 null
2024-03-16 Fast Sparse View Guided NeRF Update for Object Reconfigurations Ziqi Lu et.al. 2403.11024 null
2024-03-16 HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering Seunghyeon Seo et.al. 2403.10906 null
2024-03-16 MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field Dongyu Yan et.al. 2403.10840 null
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516 link
2024-03-15 Thermal-NeRF: Neural Radiance Fields from an Infrared Camera Tianxiang Ye et.al. 2403.10340 null
2024-03-15 Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression Huy-Hoang Bui et.al. 2403.10297 link
2024-03-15 GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time Hao Li et.al. 2403.10147 null
2024-03-15 URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields Bo Xu et.al. 2403.10119 null
2024-03-15 DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video Huiqiang Sun et.al. 2403.10103 null
2024-03-15 Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience Xiaohang Yu et.al. 2403.09973 null
2024-03-14 GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping Yuhang Zheng et.al. 2403.09637 link
2024-03-14 The NeRFect Match: Exploring NeRF Features for Visual Localization Qunjie Zhou et.al. 2403.09577 null
2024-03-14 VIRUS-NeRF -- Vision, InfraRed and UltraSonic based Neural Radiance Fields Nicolaj Schmid et.al. 2403.09477 link
2024-03-14 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation Frank Zhang et.al. 2403.09439 null
2024-03-14 RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes Thang-Anh-Quan Nguyen et.al. 2403.09419 null
2024-03-14 PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors Tianyuan Yuan et.al. 2403.09079 link
2024-03-13 Gaussian Splatting in Style Abhishek Saroha et.al. 2403.08498 null
2024-03-13 StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields Hongbin Xu et.al. 2403.08310 link
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 null
2024-03-12 Q-SLAM: Quadric Representations for Monocular SLAM Chensheng Peng et.al. 2403.08125 null
2024-03-12 SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields Jungho Lee et.al. 2403.07547 link
2024-03-11 SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection Yifu Tao et.al. 2403.06877 null
2024-03-11 Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis Chenhao Zhang et.al. 2403.06505 null
2024-03-13 FSViewFusion: Few-Shots View Generation of Novel Objects Rukhshanda Hussain et.al. 2403.06394 null
2024-03-10 Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? Hanxin Zhu et.al. 2403.06092 null
2024-03-09 Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving Junyi Cao et.al. 2403.05907 link
2024-03-09 Large Generative Model Assisted 3D Semantic Communication Feibo Jiang et.al. 2403.05783 null
2024-03-08 GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting Francesco Palandra et.al. 2403.05154 null
2024-03-08 Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces Evangelos Skartados et.al. 2403.04508 null
2024-03-07 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis Yuanhao Cai et.al. 2403.04116 link
2024-03-08 DNAct: Diffusion Guided Multi-Task 3D Policy Learning Ge Yan et.al. 2403.04115 null
2024-03-07 Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs Nikhil Mishra et.al. 2403.04114 link
2024-03-06 GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Zi-Ting Chou et.al. 2403.03608 null
2024-03-05 A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction Haofan Lu et.al. 2403.03241 null
2024-03-05 Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps Timothy Chen et.al. 2403.02751 null
2024-03-04 DaReNeRF: Direction-aware Representation for Dynamic Scenes Ange Lou et.al. 2403.02265 null
2024-03-04 Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views Shuai Guo et.al. 2403.02063 null
2024-03-02 NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning Linsheng Chen et.al. 2403.01325 link
2024-03-02 Neural radiance fields-based holography [Invited] Minsung Kang et.al. 2403.01137 null
2024-03-02 Neural Field Classifiers via Target Encoding and Classification Loss Xindi Yang et.al. 2403.01058 null
2024-03-01 DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots Chunlin Li et.al. 2403.00228 null
2024-02-28 NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images Jingrui Yu et.al. 2402.18196 null
2024-02-26 Neural Radiance Fields in Medical Imaging: Challenges and Next Steps Xin Wang et.al. 2402.17797 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768 null
2024-02-27 VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction Jiaqi Lin et.al. 2402.17427 null
2024-02-27 Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis Zicheng Zhang et.al. 2402.17364 link
2024-02-27 DivAvatar: Diverse 3D Avatar Generation with a Single Prompt Weijing Tao et.al. 2402.17292 null
2024-02-27 CharNeRF: 3D Character Generation from Concept Art Eddy Chu et.al. 2402.17115 null
2024-02-26 Disentangled 3D Scene Generation with Layout Learning Dave Epstein et.al. 2402.16936 null
2024-02-26 CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency Hanxin Zhu et.al. 2402.16407 null
2024-02-26 SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field Zetian Song et.al. 2402.16366 null
2024-02-26 DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer Yizhe Wu et.al. 2402.16308 null
2024-02-22 Consolidating Attention Features for Multi-view Image Editing Or Patashnik et.al. 2402.14792 null
2024-02-26 FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis Yan Xing et.al. 2402.14586 null
2024-02-22 NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection Chenxi Huang et.al. 2402.14464 link
2024-02-22 TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization Renyi Mao et.al. 2402.14415 null
2024-02-22 Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields Seungtae Nam et.al. 2402.14196 null
2024-02-21 Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting Joongho Jo et.al. 2402.13827 null
2024-02-21 SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields Zhentao Huang et.al. 2402.13510 null
2024-02-20 How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey Fabio Tosi et.al. 2402.13255 link
2024-02-20 Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields Bo-Yu Cheng et.al. 2402.13252 link
2024-02-20 NeRF Solves Undersampled MRI Reconstruction Tae Jun Jang et.al. 2402.13226 null
2024-02-20 OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow Simon Boeder et.al. 2402.12792 null
2024-02-19 Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis Christian Reiser et.al. 2402.12377 null
2024-02-19 Colorizing Monochromatic Radiance Fields Yean Cheng et.al. 2402.12184 null
2024-02-17 Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review Thang-Anh-Quan Nguyen et.al. 2402.11141 link
2024-02-15 Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions Muhammad Arbab Arshad et.al. 2402.10344 null
2024-02-14 PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments Xiuzhong Hu et.al. 2402.09325 link
2024-02-13 Preconditioners for the Stochastic Training of Implicit Neural Representations Shin-Fang Chng et.al. 2402.08784 null
2024-02-13 NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs Michael Fischer et.al. 2402.08622 null
2024-02-13 H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields Minyoung Park et.al. 2402.08138 null
2024-02-12 DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation Chenchang Li et.al. 2402.07648 null
2024-02-11 BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis Leandro A. Passos et.al. 2402.07310 link
2024-02-11 3D Gaussian as a New Vision Era: A Survey Ben Fei et.al. 2402.07181 null
2024-02-09 ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting Georgii Stanishevskii et.al. 2402.06390 link
2024-02-07 NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering Jingwang Ling et.al. 2402.04829 null
2024-02-07 OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding Guibiao Liao et.al. 2402.04648 null
2024-02-11 BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery Huiqing Zhang et.al. 2402.04554 null
2024-02-06 Improved Generalization of Weight Space Networks via Augmentations Aviv Shamsian et.al. 2402.04081 null
2024-02-05 ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis Bernard Spiegl et.al. 2402.02906 link
2024-02-02 ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields Xingyu Miao et.al. 2402.01950 link
2024-02-02 Robust Inverse Graphics via Probabilistic Inference Tuan Anh Le et.al. 2402.01915 link
2024-02-02 HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation Paweł Batorski et.al. 2402.01524 link
2024-02-02 Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses Mahboubeh Asadi et.al. 2402.01485 null
2024-02-06 GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting Joanna Waczyńska et.al. 2402.01459 link
2024-02-02 Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization Zhiyu Zhang et.al. 2402.01380 null
2024-02-06 Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance Yaokun Li et.al. 2402.01217 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864 link
2024-02-01 Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering Pinxin Liu et.al. 2402.00827 null
2024-01-31 CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting Jiezhi Yang et.al. 2401.18075 null
2024-02-01 Segment Anything in 3D Gaussians Xu Hu et.al. 2401.17857 link
2024-01-30 Physical Priors Augmented Event-Based 3D Reconstruction Jiaxu Wang et.al. 2401.17121 link
2024-01-31 Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting Yiming Huang et.al. 2401.16416 link
2024-01-29 Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields Rongkai Ma et.al. 2401.16144 null
2024-01-26 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field Zhenyu Bao et.al. 2401.14726 link
2024-01-25 Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation Jiaxu Wang et.al. 2401.14354 null
2024-01-27 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257 null
2024-01-24 EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction Yangsen Chen et.al. 2401.13352 null
2024-01-23 NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis Chongke Bi et.al. 2401.12568 null
2024-01-23 Exploration and Improvement of Nerf-based 3D Scene Editing Techniques Shun Fang et.al. 2401.12456 null
2024-01-23 Methods and strategies for improving the novel view synthesis quality of neural radiation field Shun Fang et.al. 2401.12451 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175 null
2024-01-22 Scaling Face Interaction Graph Networks to Real World Scenes Tatiana Lopez-Guevara et.al. 2401.11985 null
2024-01-22 HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs Zelin Gao et.al. 2401.11711 null
2024-01-23 IPR-NeRF: Ownership Verification meets Neural Radiance Field Win Kent Ong et.al. 2401.09495 null
2024-01-17 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization Weiyao Wang et.al. 2401.08937 null
2024-01-18 ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process Kiyohiro Nakayama et.al. 2401.08140 null
2024-01-16 Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities Xu Yan et.al. 2401.08045 link
2024-01-15 6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs Gergely Sóti et.al. 2401.07935 null
2024-01-11 TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Rajaei Khatib et.al. 2401.06191 null
2024-01-11 Fast High Dynamic Range Radiance Fields for Dynamic Scenes Guanjun Wu et.al. 2401.06052 null
2024-01-11 CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians Bin Dou et.al. 2401.05925 null
2024-01-11 GO-NeRF: Generating Virtual Objects in Neural Radiance Fields Peng Dai et.al. 2401.05750 null
2024-01-10 Diffusion Priors for Dynamic View Synthesis from Monocular Videos Chaoyang Wang et.al. 2401.05583 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335 null
2024-01-10 CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video Xingyu Miao et.al. 2401.04861 link
2024-01-08 A Survey on 3D Gaussian Splatting Guikun Chen et.al. 2401.03890 null
2024-01-08 NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation Casimir Feldmann et.al. 2401.03771 null
2024-01-06 RustNeRF: Robust Neural Radiance Field with Low-Quality Images Mengfei Li et.al. 2401.03257 null
2024-01-06 Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping Tongyan Hua et.al. 2401.03203 null
2024-01-05 Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human Song Bai et.al. 2401.02620 null
2024-01-05 FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF Hao Zhang et.al. 2401.02616 link
2024-01-05 Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting Van Minh Nguyen et.al. 2401.02588 null
2024-01-03 SIGNeRF: Scene Integrated Generation for Neural Radiance Fields Jan-Niklas Dihlmann et.al. 2401.01647 null
2024-01-02 Street Gaussians for Modeling Dynamic Urban Scenes Yunzhi Yan et.al. 2401.01339 null
2024-01-02 Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise Qinglong Huang et.al. 2401.01216 null
2024-01-02 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands Xuan Huang et.al. 2401.00979 link
2024-01-01 Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior Byeonghyeon Lee et.al. 2401.00825 link
2024-01-02 GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields Xiao Pan et.al. 2401.00616 null
2023-12-30 Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models Han Jiang et.al. 2401.00208 null
2023-12-29 Informative Rays Selection for Few-Shot Neural Radiance Fields Marco Orsingher et.al. 2312.17561 null
2023-12-27 City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web Kaiwen Song et.al. 2312.16457 null
2023-12-26 DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision Lu Ling et.al. 2312.16256 null
2023-12-24 SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition Nikhil Behari et.al. 2312.16215 null
2023-12-23 INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields Andrew Hou et.al. 2312.16197 null
2023-12-26 LangSplat: 3D Language Gaussian Splatting Minghan Qin et.al. 2312.16084 link
2023-12-26 2D-Guided 3D Gaussian Segmentation Kun Lan et.al. 2312.16047 null
2023-12-26 Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images Zhan Lu et.al. 2312.15942 null
2023-12-23 Human101: Training 100+FPS Human Gaussians in 100s from 1 View Mingwei Li et.al. 2312.15258 link
2023-12-23 Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane Chen Yang et.al. 2312.15253 link
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242 null
2023-12-22 PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF Mohsen Gholami et.al. 2312.14915 link
2023-12-22 Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints Miriam Jäger et.al. 2312.14664 null
2023-12-21 PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar Tzofi Klinghoffer et.al. 2312.14239 null
2023-12-21 Virtual Pets: Animatable Animal Generation in 3D Scenes Yen-Chi Cheng et.al. 2312.14154 null
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980 null
2023-12-21 SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS Ahmet Haydar Ornek et.al. 2312.13832 null
2023-12-22 Gaussian Splatting with NeRF-based Color and Opacity Dawid Malarz et.al. 2312.13729 link
2023-12-21 DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video Minh-Quan Viet Bui et.al. 2312.13528 null
2023-12-21 Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects David Nakath et.al. 2312.13494 null
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-20 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332 null
2023-12-20 ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors Weijia Mao et.al. 2312.13324 null
2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Fangjinhua Wang et.al. 2312.13285 null
2023-12-20 Reducing Shape-Radiance Ambiguity in Radiance Fields with a Closed-Form Color Estimation Method Qihang Fang et.al. 2312.12726 link
2023-12-19 ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields Xiang Feng et.al. 2312.12122 null
2023-12-20 LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments Federico Ceola et.al. 2312.12036 link
2023-12-20 MixRT: Mixed Neural Representations For Real-Time NeRF Rendering Chaojian Li et.al. 2312.11841 null
2023-12-19 Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation Yuze He et.al. 2312.11774 null
2023-12-15 FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline Chien-Yu Lin et.al. 2312.11537 null
2023-12-15 Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior Nan Huang et.al. 2312.11535 null
2023-12-18 GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning Ye Yuan et.al. 2312.11461 null
2023-12-18 AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis Dongze Li et.al. 2312.10921 null
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649 null
2023-12-19 Learning Dense Correspondence for NeRF-Based Face Reenactment Songlin Yang et.al. 2312.10422 null
2023-12-15 SlimmeRF: Slimmable Radiance Fields Shiran Yuan et.al. 2312.10034 link
2023-12-15 LAENeRF: Local Appearance Editing for Neural Radiance Fields Lukas Radl et.al. 2312.09913 null
2023-12-15 SLS4D: Sparse Latent Space for 4D Novel View Synthesis Qi-Yuan Feng et.al. 2312.09743 null
2023-12-15 Towards Transferable Targeted 3D Adversarial Attack in the Physical World Yao Huang et.al. 2312.09558 link
2023-12-14 LatentEditor: Text Driven Local Editing of 3D Scenes Umar Khalid et.al. 2312.09313 link
2023-12-14 Stable Score Distillation for High-Quality 3D Generation Boshi Tang et.al. 2312.09305 null
2023-12-14 ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining Ruoxi Shi et.al. 2312.09249 null
2023-12-15 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting Zhiyin Qian et.al. 2312.09228 null
2023-12-15 ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field Zhangkai Ni et.al. 2312.09095 link
2023-12-15 Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption Ziteng Cui et.al. 2312.09093 link
2023-12-14 iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching Yuan Sun et.al. 2312.09031 null
2023-12-14 Scene 3-D Reconstruction System in Scattering Medium Zhuoyifan Zhang et.al. 2312.09005 null
2023-12-14 CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning Qingsong Yan et.al. 2312.08760 null
2023-12-14 SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field Ru Li et.al. 2312.08692 link
2023-12-13 ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields Juan Luis Gonzalez Bello et.al. 2312.08136 null
2023-12-13 Neural Radiance Fields for Transparent Object Using Visual Hull Heechan Yoon et.al. 2312.08118 null
2023-12-13 uSF: Learning Neural Semantic Field with Uncertainty Vsevolod Skorokhodov et.al. 2312.08012 link
2023-12-12 COLMAP-Free 3D Gaussian Splatting Yang Fu et.al. 2312.07504 null
2023-12-12 Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs Sunghwan Hong et.al. 2312.07246 link
2023-12-12 WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction Jingchun Zhou et.al. 2312.06946 null
2023-12-10 TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video Minye Wu et.al. 2312.06713 null
2023-12-11 CorresNeRF: Image Correspondence Priors for Neural Radiance Fields Yixing Lao et.al. 2312.06642 link
2023-12-11 DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior Tianyu Huang et.al. 2312.06439 link
2023-12-10 NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences Minye Wu et.al. 2312.05855 null
2023-12-10 IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment Letian Zhang et.al. 2312.05748 null
2023-12-09 CoGS: Controllable Gaussian Splatting Heng Yu et.al. 2312.05664 null
2023-12-09 R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning Zhiling Ye et.al. 2312.05572 null
2023-12-08 Multi-view Inversion for 3D-aware Generative Adversarial Networks Florian Barthel et.al. 2312.05330 link
2023-12-08 TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis Heming Zhu et.al. 2312.05161 null
2023-12-08 Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting Xiaofeng Yang et.al. 2312.04820 null
2023-12-08 Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video Yuchen Rao et.al. 2312.04784 null
2023-12-07 MuRF: Multi-Baseline Radiance Fields Haofei Xu et.al. 2312.04565 link
2023-12-07 EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS Sharath Girish et.al. 2312.04564 link
2023-12-07 Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection Kohei Yamashita et.al. 2312.04527 null
2023-12-07 Multi-View Unsupervised Image Generation with Cross Attention Guidance Llukman Cerkezi et.al. 2312.04337 null
2023-12-07 Towards 4D Human Video Stylization Tiantian Wang et.al. 2312.04143 link
2023-12-07 Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction Jiayi Kong et.al. 2312.04106 null
2023-12-06 Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion Kira Prabhu et.al. 2312.03869 null
2023-12-06 Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle Youtian Lin et.al. 2312.03431 null
2023-12-06 Artist-Friendly Relightable and Animatable Neural Heads Yingyan Xu et.al. 2312.03420 null
2023-12-06 Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method Hongyu Huang et.al. 2312.03372 null
2023-12-06 RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids Doriand Petit et.al. 2312.03357 null
2023-12-06 SO-NeRF: Active View Planning for NeRF using Surrogate Objectives Keifer Lee et.al. 2312.03266 null
2023-12-06 Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields Shijie Zhou et.al. 2312.03203 null
2023-12-05 HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces Haithem Turki et.al. 2312.03160 null
2023-12-05 ReconFusion: 3D Reconstruction with Diffusion Priors Rundi Wu et.al. 2312.02981 null
2023-12-05 GauHuman: Articulated Gaussian Splatting from Monocular Human Videos Shoukang Hu et.al. 2312.02973 link
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970 null
2023-12-05 MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures Zhangyang Xiong et.al. 2312.02963 null
2023-12-05 C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF Rui Huang et.al. 2312.02751 link
2023-12-05 Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent Jianmeng Liu et.al. 2312.02568 null
2023-12-04 PointNeRF++: A multi-scale, point-based Neural Radiance Field Weiwei Sun et.al. 2312.02362 null
2023-12-04 Calibrated Uncertainties for Neural Radiance Fields Niki Amini-Naieni et.al. 2312.02350 null
2023-12-04 Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis Felix Tristram et.al. 2312.02255 null
2023-12-04 ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction Yufei Shi et.al. 2312.02015 null
2023-12-04 Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training Runze He et.al. 2312.01663 null
2023-12-03 SANeRF-HQ: Segment Anything for NeRF in High Quality Yichen Liu et.al. 2312.01531 null
2023-12-03 VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams Liao Wang et.al. 2312.01407 null
2023-12-02 Self-Evolving Neural Radiance Fields Jaewoo Jung et.al. 2312.01003 link
2023-12-01 Gaussian Grouping: Segment and Edit Anything in 3D Scenes Mingqiao Ye et.al. 2312.00732 link
2023-11-30 LucidDreaming: Controllable Object-Centric 3D Generation Zhaoning Wang et.al. 2312.00588 null
2023-12-01 FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting Zehao Zhu et.al. 2312.00451 null
2023-11-30 PyNeRF: Pyramidal Neural Radiance Fields Haithem Turki et.al. 2312.00252 link
2023-11-30 SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting Haolin Xiong et.al. 2312.00206 link
2023-11-30 Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing Hyelin Nam et.al. 2311.18608 null
2023-11-30 ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs Violeta Menéndez González et.al. 2311.18491 null
2023-11-30 Anisotropic Neural Representation Learning for High-Quality Neural Rendering Y. Wang et.al. 2311.18311 null
2023-11-30 CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt Haiyao Xiao et.al. 2311.18288 null
2023-11-30 Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization KL Navaneet et.al. 2311.18159 link
2023-11-29 GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces Yingwenqi Jiang et.al. 2311.17977 null
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917 null
2023-11-29 FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information Wen Jiang et.al. 2311.17874 link
2023-11-29 Cinematic Behavior Transfer via NeRF-based Differentiable Filming Xuekun Jiang et.al. 2311.17754 null
2023-11-29 SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis Ziqiao Peng et.al. 2311.17590 link
2023-11-29 NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields Xiaoliang Liu et.al. 2311.17332 null
2023-11-28 LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS Zhiwen Fan et.al. 2311.17245 link
2023-11-28 Continuous Pose for Monocular Cameras in Neural Implicit Representation Qi Ma et.al. 2311.17119 link
2023-11-28 UC-NeRF: Neural Radiance Field for Under-Calibrated multi-view cameras in autonomous driving Kai Cheng et.al. 2311.16945 null
2023-11-28 The Sky's the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility James A. D. Gardner et.al. 2311.16937 link
2023-11-28 SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation Jesus Zarzar et.al. 2311.16671 link
2023-11-28 DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes Zhuopeng Li et.al. 2311.16664 null
2023-11-28 SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction Yu Chen et.al. 2311.16657 null
2023-11-28 Rethinking Directional Integration in Neural Radiance Fields Congyue Deng et.al. 2311.16504 null
2023-11-27 Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images Shiu-hong Kao et.al. 2311.16499 link
2023-11-27 Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling Zhe Li et.al. 2311.16096 link
2023-11-27 SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields Quentin Herau et.al. 2311.15803 null
2023-11-27 CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering Haidong Zhu et.al. 2311.15510 link
2023-11-26 Efficient Encoding of Graphics Primitives with Simplex-based Structures Yibo Wen et.al. 2311.15439 null
2023-11-26 Obj-NeRF: Extract Object NeRFs from Multi-view Images Zhiyi Li et.al. 2311.15291 null
2023-11-26 NeuRAD: Neural Rendering for Autonomous Driving Adam Tonderski et.al. 2311.15260 link
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603 null
2023-11-24 GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Yiwen Chen et.al. 2311.14521 link
2023-11-23 ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization Soonbin Lee et.al. 2311.14208 null
2023-11-23 Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs Andrea Tagliabue et.al. 2311.14153 null
2023-11-23 Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder Xiaohao Xu et.al. 2311.13750 null
2023-11-22 Compact 3D Gaussian Representation for Radiance Field Joo Chan Lee et.al. 2311.13681 link
2023-11-22 Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning Kai Yu et.al. 2311.13617 null
2023-11-22 Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions Keyang Ye et.al. 2311.13404 null
2023-11-22 Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images Jaeyoung Chung et.al. 2311.13398 null
2023-11-22 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization Jianwei Feng et.al. 2311.13168 null
2023-11-22 PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF Yutao Feng et.al. 2311.13099 null
2023-11-21 SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering Antoine Guédon et.al. 2311.12775 link
2023-11-21 Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields Yifan Wang et.al. 2311.12490 null
2023-11-18 Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields Xingyu Zhu et.al. 2311.12059 null
2023-11-20 GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding Hao Li et.al. 2311.11863 null
2023-11-20 Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields Zhiyuan Min et.al. 2311.11845 link
2023-11-19 GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise Xinhai Li et.al. 2311.11221 null
2023-11-18 SNI-SLAM: Semantic Neural Implicit SLAM Siting Zhu et.al. 2311.11016 link
2023-11-18 Structure-Aware Sparse-View X-ray 3D Reconstruction Yuanhao Cai et.al. 2311.10959 link
2023-11-17 Removing Adverse Volumetric Effects From Trained Neural Radiance Fields Andreas L. Teigen et.al. 2311.10523 null
2023-11-18 EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices Jingnan Gao et.al. 2311.09806 null
2023-11-16 Reconstructing Continuous Light Field From Single Coded Image Yuya Ishikawa et.al. 2311.09646 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217 null
2023-11-15 Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation Zhanfeng Liao et.al. 2311.09077 link
2023-11-13 $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF Liangchen Li et.al. 2311.07044 null
2023-11-11 Aria-NeRF: Multimodal Egocentric View Synthesis Jiankai Sun et.al. 2311.06455 null
2023-11-10 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model Jiahao Li et.al. 2311.06214 null
2023-11-10 A Neural Height-Map Approach for the Binocular Photometric Stereo Problem Fotios Logothetis et.al. 2311.05958 null
2023-11-09 BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis Hao-Bin Duan et.al. 2311.05521 link
2023-11-09 Control3D: Towards Controllable Text-to-3D Generation Yang Chen et.al. 2311.05461 null
2023-11-08 LRM: Large Reconstruction Model for Single Image to 3D Yicong Hong et.al. 2311.04400 null
2023-11-07 ADFactory: Automated Data Factory for Optical Flow Tasks Han Ling et.al. 2311.04246 null
2023-11-07 High-fidelity 3D Reconstruction of Plants using Neural Radiance Field Kewei Hu et.al. 2311.04154 null
2023-11-07 Fast Sun-aligned Outdoor Scene Relighting based on TensoRF Yeonjin Chang et.al. 2311.03965 null
2023-11-08 UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields Injae Kim et.al. 2311.03784 link
2023-11-06 Osprey: Multi-Session Autonomous Aerial Mapping with LiDAR-based SLAM and Next Best View Planning Rowan Border et.al. 2311.03484 null
2023-11-06 Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances Paul Knoll et.al. 2311.03140 null
2023-11-06 InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image Jianhui Li et.al. 2311.02826 link
2023-11-03 Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields Jianxiong Shen et.al. 2311.01815 null
2023-11-03 PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation Yuhan Ding et.al. 2311.01773 null
2023-11-03 Efficient Cloud Pipelines for Neural Radiance Fields Derek Jacoby et.al. 2311.01659 null
2023-11-02 Novel View Synthesis from a Single RGBD Image for Indoor Scenes Congrui Hetang et.al. 2311.01065 null
2023-10-31 FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees Saskia Rabich et.al. 2310.20710 null
2023-10-31 NeRF Revisited: Fixing Quadrature Instability in Volume Rendering Mikaela Angelina Uy et.al. 2310.20685 null
2023-10-30 Generative Neural Fields by Mixtures of Neural Implicit Functions Tackgeun You et.al. 2310.19464 null
2023-11-04 TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields Chengyao Duan et.al. 2310.18917 null
2023-10-28 INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings Amirhossein Kazerouni et.al. 2310.18846 link
2023-10-27 ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Kyle Sargent et.al. 2310.17994 null
2023-10-27 Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations Tristan Aumentado-Armstrong et.al. 2310.17880 null
2023-10-27 HyperFields: Towards Zero-Shot Generation of NeRFs from Text Sudarshan Babu et.al. 2310.17075 null
2023-10-25 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation Dadong Jiang et.al. 2310.16858 null
2023-10-26 LightSpeed: Light and Fast Neural Light Fields on Mobile Devices Aarush Gupta et.al. 2310.16832 link
2023-10-28 PERF: Panoramic Neural Radiance Field from a Single Panorama Guangcong Wang et.al. 2310.16831 link
2023-10-25 Open-NeRF: Towards Open Vocabulary NeRF Decomposition Hao Zhang et.al. 2310.16383 null
2023-10-25 UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception Christopher Maxey et.al. 2310.16255 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504 null
2023-10-23 CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features Omnia Mahmoud et.al. 2310.14695 null
2023-10-23 VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations Yiying Yang et.al. 2310.14487 null
2023-10-20 ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields Daiju Kanaoka et.al. 2310.13670 null
2023-10-20 Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos Seoha Kim et.al. 2310.13356 link
2023-10-20 UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene Jiaming Gu et.al. 2310.13263 null
2023-10-18 VQ-NeRF: Neural Reflectance Decomposition and Editing with Vector Quantization Hongliang Zhong et.al. 2310.11864 null
2023-10-18 Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs Khoa Tuan Nguyen et.al. 2310.11645 null
2023-10-16 TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields Leif Van Holland et.al. 2310.10650 link
2023-10-16 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Jia-Wei Liu et.al. 2310.10624 null
2023-10-16 Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model Junpeng Tan et.al. 2310.10209 null
2023-10-15 ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context Binglun Wang et.al. 2310.09965 null
2023-10-15 Active Perception using Neural Radiance Fields Siming He et.al. 2310.09892 link
2023-10-15 CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses Hongyu Fu et.al. 2310.09776 null
2023-10-11 Dynamic Appearance Particle Neural Radiance Field Ancheng Lin et.al. 2310.07916 null
2023-10-12 PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction Jia-Wang Bian et.al. 2310.07449 link
2023-10-11 rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera Tongtong Zhang et.al. 2310.07179 null
2023-10-10 Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization Le Chen et.al. 2310.06984 null
2023-10-10 High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field Minghan Qin et.al. 2310.06275 null
2023-10-09 A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields Keyang Ye et.al. 2310.05837 null
2023-10-09 Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation Ruiyang Liu et.al. 2310.05391 null
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-10-08 Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation Dominik Hollidt et.al. 2310.05133 null
2023-10-06 Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation Hye Bin Yoo et.al. 2310.04152 null
2023-10-05 Drag View: Generalizable Novel View Synthesis with Unposed Imagery Zhiwen Fan et.al. 2310.03704 link
2023-10-05 Targeted Adversarial Attacks on Generalizable Neural Radiance Fields Andras Horvath et.al. 2310.03578 null
2023-10-05 BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields Ágoston István Csehi et.al. 2310.03563 null
2023-10-04 Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation Yihan Wu et.al. 2310.03125 null
2023-10-04 T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation Yuze He et.al. 2310.02977 link
2023-10-04 ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF Jangho Park et.al. 2310.02712 null
2023-10-05 USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields Moyang Li et.al. 2310.02687 link
2023-10-03 EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields Anish Bhattacharya et.al. 2310.02437 link
2023-10-03 Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering Tong Wang et.al. 2310.01881 null
2023-10-03 MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields Takuhiro Kaneko et.al. 2310.01821 null
2023-10-02 PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments Xiuzhong Hu et.al. 2310.00874 link
2023-10-01 How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF? Sicong Pan et.al. 2310.00684 link
2023-10-01 Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images -- A Multi-tiling Approaching and the Geometry Assessment of NeRF Ningli Xu et.al. 2310.00530 null
2023-09-30 MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending Yuze He et.al. 2310.00249 null
2023-09-29 Multi-task View Synthesis with Neural Radiance Fields Shuhong Zheng et.al. 2309.17450 link
2023-09-29 Forward Flow for Novel View Synthesis of Dynamic Scenes Xiang Guo et.al. 2309.17390 null
2023-09-29 HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field Xiaochen Zhao et.al. 2309.17128 null
2023-09-28 Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis Marcel C. Bühler et.al. 2309.16859 null

(back to top)

rl-arxiv-daily's People

Contributors

vincentqyw avatar light5551 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.