project · tipai-tspo
TiPAI-TSPO
Tournament Inpainting for Patch-level Alignment in text-to-image diffusion models with Tournament Sampling Policy Optimization
EMNLP 2026 (Under Review)DiffusionRL
- TiPAI: a decoding-time alignment framework for diffusion models using localized timestep-aware auditing and targeted inpainting for policy-compliant generation.
- TSPO: (Tournament Sampling Policy Optimization), a lightweight reinforcement learning framework for selecting optimal inpainting configurations under quality and compute constraints.
- Guarded tournament decoding pipeline with monotone non-regression guarantees faithfulness, safety, seam quality.