学习 分类论文(5,491)
清除筛选 ✕反馈世界模型实现扩散策略的精确引导
Tuo An, Jindou Jia, Gen Li 等 11 位作者
2026
PCASim: Promptable Closed-loop Adversarial Simulation for Urban Traffic Environment
Chuancheng Zhang, Zhenhao Wang, Kaizheng Li 等 6 位作者
2026
ParallelCBF:面向张量并行强化学习的可组合安全过滤与审计框架
Yijun Lu, Zilei Yang, Yuyin Ma
2026
随机延迟下机器人遥操作的残差强化学习
Kaize Deng, Zewen Yang
2026
EgoExo-WM:利用外部视频解锁自我世界模型
Danny Tran, Roberto Martín-Martín, Kristen Grauman
2026
Time-Varying Deep State Space Models for Sequences with Switching Dynamics
Sanja Karilanova, Subhrakanti Dey, Ayça Özçelikkale
2026
PhysBrain 1.0技术报告
Shijie Lian, Bin Yu, Xiaopeng Lin 等 13 位作者
2026
Articraft:一种用于可扩展铰接3D资产生成的智能体系统
Matt Zhou, Ruining Li, Xiaoyang Lyu 等 9 位作者
2026
Slot-MPC:基于目标中心表示的目标条件模型预测控制
Jonathan Spieler, Angel Villar-Corrales, Sven Behnke
2026
Chrono-Gymnasium: An Open-Source, Gymnasium-Compatible Distributed Simulation Framework
Bocheng Zou, Harry Zhang, Khailanii Slaton 等 8 位作者
2026
CaMeRL: Collision-Aware and Memory-Enhanced Reinforcement Learning for UAV Navigation in Multi-Scale Obstacle Environments
Hong Hong, Feiyu Liao, Yongheng Liang 等 6 位作者
2026
Addressing Terminal Constraints in Data-Driven Demand Response Scheduling
Maximilian Bloor, Martha White, Ehecatl Antonio del Rio Chanona 等 4 位作者
2026
SR-Platform: An Agentic Pipeline for Natural Language-Driven Robot Simulation Environment Synthesis
Ben Wei Lim, Minh Duc Le, Thang Truong 等 4 位作者
2026
Fully Dynamic Rebalancing in Dockless Bike-Sharing Systems via Deep Reinforcement Learning
Edoardo Scarpel, Alberto Pettena, Matteo Cederle 等 6 位作者
2026
基于混合梯度的离散-连续混合动作空间策略优化
Matias Alvo, Daniel Russo, Yash Kanoria
2026
Action-Conditioned Risk Gating for Safety-Critical Control under Partial Observability
Yushen Liu, Yin-Jen Chen, Ziyi Chen 等 7 位作者
2026
PreFT: Prefill-only finetuning for efficient inference
Andrew Lanpouthakoun, Aryaman Arora, Zhengxuan Wu 等 7 位作者
2026
MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving
Rajeev Yasarla, Deepti Hegde, Hsin-Pai Cheng 等 12 位作者
2026
Optimal design of solar-battery hybrid resources considering multi-market participation under weather and price uncertainty
Hikaru Hoshino, Taiyo Mantani, Eiko Furutani
2026
R2R2:通过自预测学习中的冗余减少实现密集经验复用的鲁棒表示
Sanghyeob Song, Donghyeok Lee, Jinsik Kim 等 4 位作者
2026
Ergodic Imitation for Adaptive Exploration around Demonstrations
Ziyi Xu, Cem Bilaloglu, Yiming Li 等 4 位作者
2026
利用语言模型先验从观测中学习POMDP世界模型
Valentin Six, Frederik Panse, Mathis Fajeau 等 10 位作者
2026
Twincher:面向连续系统鲁棒逆映射的双射表示学习
Arkady Gonoskov
2026
Learning a Contracting KKL-observer with Local Optimal Guarantees
Clara Lucía Galimberti, Johan Peralez, Daniele Astolfi 等 5 位作者
2026