Diffusionff Cvpr 2026

DiffusionFF (CVPR 2026)

DiffusionFF

Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ...

ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers.

Title:MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene ...

Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset.

Joonki Min, Chaeyun Kim, Hyungwook Choi, Yejin Kim, Kihyun Kim, Yohan Jo, Joonseok Lee. Fine-Grained Multi-Image Object ...

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement.

[

How much do video diffusion models know about the 4D world? By introducing a 4D VAE, we jointly estimate geometry and ...

Paper: https://arxiv.org/abs/2512.01390 Project Page: https://cmlab-korea.github.io/FRAMER/ Authors/Affiliations: [Seungho ...

Poster Presentation

Hyun Lee, Hyemin Jeong, Yejin Kim, Hyungwook Choi, Hyunsoo Cho, Soo Kyung Kim, Joonseok Lee. A More Word-like Image ...

[

Video2Robo: 3DGS-based Synthetic Data from One Video Enables Scalable Robot Learning Project page: ...

We introduce our

Video for FG-Portrait: 3D Flow Guided Editable Portrait Animation (

In this video, we introduce a novel video object detection framework called D2FANet. D2FANet is the first framework to jointly ...

Title: Agentic Retoucher for Text-to-Image Generation Authors: Shaocheng Shen, Jianfeng Liang, Chunlei Cai, Cong Geng, Huiyu ...