Seeking Physics in Diffusion Noise
Chujun Tang, Lei Zhong, Fangqiang Ding
- 发表年份
- 2026
- 访问权限
- 开放获取
摘要
Do video diffusion models encode signals predictive of physical plausibility? We probe intermediate denoising representations of a pretrained Diffusion Transformer (DiT) and find that physically plausible and implausible videos are partially separable in mid-layer feature space across noise levels. This separability cannot be fully attributed to visual quality or generator identity, suggesting recoverable physics-related cues in frozen DiT features. Leveraging this observation, we introduce progressive trajectory selection, an inference-time strategy that scores parallel denoising trajectories at a few intermediate checkpoints using a lightweight physics verifier trained on frozen features, and prunes low-scoring candidates early. Extensive experiments on PhyGenBench demonstrate that our method improves physical consistency while reducing inference cost, achieving comparable results to Best-of-K sampling with substantially fewer denoising steps.
关键词
相关论文
一种面向线弧增材制造的电动汽车结构可制造性拓扑优化的双环框架
Qiang Cui, Chuan Yu, Daoqian Yang 等 5 位作者
Robotics and Computer-Integrated Manufacturing · 2026
几何数字孪生:一种用于航空发动机装配精度预测的数字智能模型
Ke Shang, Xin Jin, Teli Xu 等 7 位作者
Robotics and Computer-Integrated Manufacturing · 2026
面向安全约束控制的机器人集成电池制造中剩余使用寿命感知的物理信息贝叶斯数字孪生
Faizanbasha A., U. Rizwan, Syed Tahir Hussainy 等 5 位作者
Robotics and Computer-Integrated Manufacturing · 2026
利用大模型与小模型协作实现智能制造的高级自动化
Qunlong Chen, Yuyi Zhang, Wei Qin 等 7 位作者
Robotics and Computer-Integrated Manufacturing · 2026