Adam Ghanem
Adam Ghanem
Home
Posts
Projects
Talks
Publications
Contact
Light
Dark
Automatic
3
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
In the era of large models, scaling up model size and the integration with large language models have further improved the performance of TTI models, resulting the generation result nearly indistinguishable from real-world images, revolutionizing the way we retrieval images. Our explorative study has incentivised us to think that there are further ways of scaling text-to-image models with the combination of innovative model architectures and prediction enhancement techniques.
Feixiang Bie
,
Yibo Yang
,
Zhongzhu Zhou
,
Adam Ghanem
,
Minjia Zhang
,
Zhewei Yao
,
Xiaoxia Wu
,
Connor Holmes
,
Pareesa Golnari
,
David A. Clifton
,
Yuxiong He
,
Dacheng Tao
,
Shuaiwen Leon Song
PDF
Arxiv
Cite
×