Back to feed
arXiv cs.AI·

TriEval: A Resource-Efficient Pipeline for LLM Bias, Toxicity, and Truthfulness Assessment

Signal
72
Hype
25
In three linesTriEval is an LLM evaluation pipeline assessing bias, toxicity, and truthfulness simultaneously with minimal resources. Compatible with open-source and closed-source models, runs on standard laptop without GPU. Tested on Llama 3 8B, Mistral 7B, Gemma 2 9B, and Claude Haiku, revealing toxicity and truthfulness differences between models.
Read source
Your take?
EvalsAI safetyOpen sourceLlamaMistral

Summary generated by Claude — human-verified