Accepted to ECCV 2026 — REGLUE Your Latents 🎉

We’re excited to announce that our paper “REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion” has been accepted to ECCV 2026.

Authors: Giorgos Petsangourakis, Christos Sgouropoulos, Bill Psomas, Theodoros Giannakopoulos, Giorgos Sfikas, Ioannis Kakogeorgiou.

Highlights:

  • REGLUE is a unified framework that jointly models VAE latents, global semantics, and local Vision Foundation Model semantics for faster, higher-fidelity diffusion-based image generation.
  • Existing methods such as REPA and REG inject only a narrow slice of VFM features. REGLUE jointly models compact, non-linearly compressed patch-level semantics alongside VAE latents, with a global [CLS] token and an alignment loss providing complementary gains.
  • REGLUE with SiT-XL/2 matches 1M-step state of the art in just 700k iterations, using roughly 30% fewer training steps, with consistent FID improvements across training budgets.

Preprint: arXiv:2512.16636

Code: github.com/giorgospets/reglue

Many thanks to the co-authors and reviewers for their feedback.