Not Specifying Model Versions Is Dangerous: A Case Study of 40 Models in Production
https://ericksgreatdigest.iamarrows.com/evaluating-llm-hallucinations-for-production-a-practical-cto-s-roadmap
This case study documents a real-world evaluation run by an ML operations team preparing a conversational assistant for enterprise use