Curating 62 Practical Scenarios to Test AI Text-to-Image Models

To evaluate 12 key aspects of text-to-image models, HEIM curates 62 practical scenarios. These include established ones like MS-COCO and new ones for originality, aesthetics, bias, and fairness. The scenarios provide robust tests for aspects such as toxicity, quality, and creativity, offering comprehensive model assessment across varied tasks.


This content originally appeared on HackerNoon and was authored by Auto Encoder: How to Ignore the Signal Noise

:::info Authors:

(1) Tony Lee, Stanford with Equal contribution;

(2) Michihiro Yasunaga, Stanford with Equal contribution;

(3) Chenlin Meng, Stanford with Equal contribution;

(4) Yifan Mai, Stanford;

(5) Joon Sung Park, Stanford;

(6) Agrim Gupta, Stanford;

(7) Yunzhi Zhang, Stanford;

(8) Deepak Narayanan, Microsoft;

(9) Hannah Benita Teufel, Aleph Alpha;

(10) Marco Bellagente, Aleph Alpha;

(11) Minguk Kang, POSTECH;

(12) Taesung Park, Adobe;

(13) Jure Leskovec, Stanford;

(14) Jun-Yan Zhu, CMU;

(15) Li Fei-Fei, Stanford;

(16) Jiajun Wu, Stanford;

(17) Stefano Ermon, Stanford;

(18) Percy Liang, Stanford.

:::

Abstract and 1 Introduction

2 Core framework

3 Aspects

4 Scenarios

5 Metrics

6 Models

7 Experiments and results

8 Related work

9 Conclusion

10 Limitations

Author contributions, Acknowledgments and References

A Datasheet

B Scenario details

C Metric details

D Model details

E Human evaluation procedure

4 Scenarios

To evaluate the 12 aspects (§3), we curate diverse and practical scenarios. Table 2 presents an overview of all the scenarios and their descriptions. Each scenario is a set of textual inputs and can be used to evaluate certain aspects. For instance, the “MS-COCO” scenario can be used to assess the alignment, quality, and efficiency aspects, and the “Inappropriate Image Prompts (I2P)” scenario [8] can be used to assess the toxicity aspect. Some scenarios may include sub-scenarios, indicating the sub-level categories or variations within them, such as “Hate” and “Violence” within I2P. We curate these scenarios by leveraging existing datasets and creating new prompts ourselves. In total, we have 62 scenarios, including the sub-scenarios.

\ Notably, we create new scenarios (indicated with “New” in Table 2) for aspects that were previously underexplored and lacked dedicated datasets. These aspects include originality, aesthetics, bias, and fairness. For example, to evaluate originality, we develop scenarios to test the artistic creativity of these models with textual inputs to generate landing pages, logos, and magazine covers.

\

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\


This content originally appeared on HackerNoon and was authored by Auto Encoder: How to Ignore the Signal Noise


Print Share Comment Cite Upload Translate Updates
APA

Auto Encoder: How to Ignore the Signal Noise | Sciencx (2024-10-12T22:44:13+00:00) Curating 62 Practical Scenarios to Test AI Text-to-Image Models. Retrieved from https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/

MLA
" » Curating 62 Practical Scenarios to Test AI Text-to-Image Models." Auto Encoder: How to Ignore the Signal Noise | Sciencx - Saturday October 12, 2024, https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/
HARVARD
Auto Encoder: How to Ignore the Signal Noise | Sciencx Saturday October 12, 2024 » Curating 62 Practical Scenarios to Test AI Text-to-Image Models., viewed ,<https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/>
VANCOUVER
Auto Encoder: How to Ignore the Signal Noise | Sciencx - » Curating 62 Practical Scenarios to Test AI Text-to-Image Models. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/
CHICAGO
" » Curating 62 Practical Scenarios to Test AI Text-to-Image Models." Auto Encoder: How to Ignore the Signal Noise | Sciencx - Accessed . https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/
IEEE
" » Curating 62 Practical Scenarios to Test AI Text-to-Image Models." Auto Encoder: How to Ignore the Signal Noise | Sciencx [Online]. Available: https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/. [Accessed: ]
rf:citation
» Curating 62 Practical Scenarios to Test AI Text-to-Image Models | Auto Encoder: How to Ignore the Signal Noise | Sciencx | https://www.scien.cx/2024/10/12/curating-62-practical-scenarios-to-test-ai-text-to-image-models/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.