Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Paper
•
2410.09347
•
Published
•
5
Paper:arxiv.org/abs/2410.09347
Github: https://github.com/thu-ml/CCA/tree/main
(TL;DR) We propose CCA as a finetuning technique for AR visual models so that they can generate high-quality images without CFG, cutting sampling costs by half. CCA and CFG have the same theoretical foundations and thus similar features, though CCA is inspired from LLM alignment instead of guided sampling.
Features of CCA:
Base model
FoundationVision/LlamaGen