project · tipai-tspo

TiPAI-TSPO

Tournament Inpainting for Patch-level Alignment in text-to-image diffusion models with Tournament Sampling Policy Optimization

EMNLP 2026 (Under Review)DiffusionRL
  • TiPAI: a decoding-time alignment framework for diffusion models using localized timestep-aware auditing and targeted inpainting for policy-compliant generation.
  • TSPO: (Tournament Sampling Policy Optimization), a lightweight reinforcement learning framework for selecting optimal inpainting configurations under quality and compute constraints.
  • Guarded tournament decoding pipeline with monotone non-regression guarantees faithfulness, safety, seam quality.