Papers·1개월 전

KAIST AI, Tweedie matching 기반 훈련 없는 긴 비디오 생성 — 길이 4배, temporal consistency 30% 향상

KAIST AI 팀이 추가 학습 없이 비디오 생성 모델의 출력 길이를 몇 배로 늘리는 inference-time 기법을 제안했습니다. 겹치는 sliding window 사이를 Tweedie matching으로 정합하고, stochastic early-phase sampling으로 window 간 궤적을 동기화하는 방식입니다. 기존 training-free 및 autoregressive baseline 대비 temporal consistency와 시각 품질에서 우위를 보였으며, 오디오-비디오 공동 생성이나 text-to-3DGS에도 fine-tuning 없이 확장 가능합니다.

#video-generation
#diffusion-models
#tweedie-matching
#kaist
#long-video

KAIST AI

원문 보기 →

KAIST AI, Tweedie matching 기반 훈련 없는 긴 비디오 생성 — 길이 4배, temporal consistency 30% 향상

Comments