A powerful, production-ready Streamlit web application for comprehensive LLM response evaluation and benchmarking. Features multi-dimensional scoring across 7 key criteria, interactive analytics ...
A Python-based static website generator specifically designed for web novels, with support for GitHub Actions and GitHub Pages deployment. You can see a demo build ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results