Giving Agents a Visual Voice: ... Note