Semantic Analysis

Measuring semantic similarity between LLM responses and gold standard answers using multiple embedding models including SBERT, Cohere, Voyage, OpenAI, and BERTScore.