To evaluate the linguistic and aesthetic quality of the artificial poems, we collected behavioral ratings through online crowdsourcing evaluation, which enabled us to sample a wide range of population and acquire a large amount of data. The 150 artificial poems were evaluated.

We designed five questions to probe five aspects of poem appreciation:

  1. Aesthetics: 这首诗美吗?(translation: is this poem beautiful?)
  2. Fluency: 这首诗顺口吗?(translation: does the poem read fluently?)
  3. Emotion: 这首诗触动了你吗?(translation: are you moved by this poem?)
  4. Meaning: 你觉得这首诗有意义吗?(translation: is this poem meaningful?)
  5. Value: 你愿意给这首诗打多少赏?(translation: how much would you tip the poet?)
(See The demo of rating)

Rating results

The sheet of the rating result were organized as the following format:

Data samples (Download the full data)

poem subject score_1 score_2 score_3 score_4 score_5 rt_1 rt_2 rt_3 rt_4 rt_5
1471233444173219161513215852594
16125354513167062621414641220
651233544216312331835105071996
92123332213621154778314661718
56298988919760136511357857052309