Text-to-image generation | HearLore