To output synthesized sounds rich in emotion.
A text-forming section 31 forms an utterance text to form synthesized sounds from the text as a word string included in action command information in accordance with this information, while an emotion checking section 39 checks the emotion model value of a pet robot and determines as to whether or not the emotion of the pet robot is rising according to the emotion model value. Further, the emotion checking section 39 instructs alternation of word order to the text-forming section 31, when the emotion of the pet robot is rising. The text forming section 31 alternates the word order of the utterance text according to the instruction of the emotion checking section 39. Namely, the word order is thereby changed to 'lovely, you', when, for example, the utterance text is 'you are lovely'.