TriEval: A Resource-Efficient Pipeline for LLM Bias, Toxicity, and Truthfulness Assessment
TriEval is an LLM evaluation pipeline assessing bias, toxicity, and truthfulness simultaneously with minimal resources. Compatible with open-source and closed-source models, runs on standard laptop without GPU. Tested on Llama 3 8B, Mistral 7B, Gemma 2 9B, and Claude Haiku, revealing toxicity and truthfulness differences between models.