CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

Published in arXiv, 2024