Request for contrib repository for LLM benchmark

MichaelHamann · April 17, 2024, 8:34am

Hi everyone,

to evaluate LLMs and the RAG (context) support from the AI LLM application, @ppantiru and I will develop a small evaluation framework and perform a benchmark. We discussed that it would be best to have a new contrib repository for this as we would like to also put the evaluation results into the same repository, and it seems weird to mix such evaluation results with the actual extension code, in particular as the result files could be a bit bigger.

What do you think about application-ai-llm-benchmark as repository name to indicate the relationship to the existing application-ai-llm repository?

Thank you very much!

vmassol · April 17, 2024, 8:41am

The new naming policy suggest to not have the type of extension in the name (application here). See https://contrib.xwiki.org/xwiki/bin/view/Main/WebHome#HChoosingthename

So I’d propose ai-llm-benchmark instead (and we can rename application-ai-llm to ai-llm too, GitHub follows renames so shouldn’t be a problem).

Thanks