A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models

We evaluate 26 recent text-to-image models, spanning diffusion, autoregressive, and GAN types, with sizes from 0.4B to 13B parameters. The models are compared based on their organizations, accessibility (open or closed), and default inference configurations from APIs, GitHub, or Hugging Face repositories. Table 4 summarizes key model properties for a clear comparison.


This content originally appeared on HackerNoon and was authored by Auto Encoder: How to Ignore the Signal Noise

:::info Authors:

(1) Tony Lee, Stanford with Equal contribution;

(2) Michihiro Yasunaga, Stanford with Equal contribution;

(3) Chenlin Meng, Stanford with Equal contribution;

(4) Yifan Mai, Stanford;

(5) Joon Sung Park, Stanford;

(6) Agrim Gupta, Stanford;

(7) Yunzhi Zhang, Stanford;

(8) Deepak Narayanan, Microsoft;

(9) Hannah Benita Teufel, Aleph Alpha;

(10) Marco Bellagente, Aleph Alpha;

(11) Minguk Kang, POSTECH;

(12) Taesung Park, Adobe;

(13) Jure Leskovec, Stanford;

(14) Jun-Yan Zhu, CMU;

(15) Li Fei-Fei, Stanford;

(16) Jiajun Wu, Stanford;

(17) Stefano Ermon, Stanford;

(18) Percy Liang, Stanford.

:::

Abstract and 1 Introduction

2 Core framework

3 Aspects

4 Scenarios

5 Metrics

6 Models

7 Experiments and results

8 Related work

9 Conclusion

10 Limitations

Author contributions, Acknowledgments and References

A Datasheet

B Scenario details

C Metric details

D Model details

E Human evaluation procedure

6 Models

We evaluate 26 recent text-to-image models, encompassing various types (e.g., diffusion, autoregressive, GAN), sizes (ranging from 0.4B to 13B parameters), organizations, and accessibility (open or closed). Table 4 presents an overview of the models and their corresponding properties. In our evaluation, we employ the default inference configurations provided in the respective model’s API, GitHub, or Hugging Face repositories.

\

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\


This content originally appeared on HackerNoon and was authored by Auto Encoder: How to Ignore the Signal Noise


Print Share Comment Cite Upload Translate Updates
APA

Auto Encoder: How to Ignore the Signal Noise | Sciencx (2024-10-12T22:44:30+00:00) A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models. Retrieved from https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/

MLA
" » A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models." Auto Encoder: How to Ignore the Signal Noise | Sciencx - Saturday October 12, 2024, https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/
HARVARD
Auto Encoder: How to Ignore the Signal Noise | Sciencx Saturday October 12, 2024 » A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models., viewed ,<https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/>
VANCOUVER
Auto Encoder: How to Ignore the Signal Noise | Sciencx - » A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/
CHICAGO
" » A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models." Auto Encoder: How to Ignore the Signal Noise | Sciencx - Accessed . https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/
IEEE
" » A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models." Auto Encoder: How to Ignore the Signal Noise | Sciencx [Online]. Available: https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/. [Accessed: ]
rf:citation
» A Comprehensive Evaluation of 26 State-of-the-Art Text-to-Image Models | Auto Encoder: How to Ignore the Signal Noise | Sciencx | https://www.scien.cx/2024/10/12/a-comprehensive-evaluation-of-26-state-of-the-art-text-to-image-models/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.