siemens_ragas

Files

wangwei f8e308b7dc fix: use max_tokens=8 for chat model connectivity test

max_tokens=1 triggers 'min-output limit' errors on gpt-5.x models.
Using 8 tokens is still cheap but satisfies all known model minimums.
Falls back to max_completion_tokens=8 if max_tokens is not supported.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

2026-06-23 15:03:27 +08:00

api

fix: use max_tokens=8 for chat model connectivity test

2026-06-23 15:03:27 +08:00

services

feat: add InlineScorer service with LLM client caching

2026-06-22 15:03:43 +08:00

static

fix: restore LLM profile test connectivity buttons (lost from git)

2026-06-23 13:58:43 +08:00

__init__.py

Add RAGAS evaluation web console (FastAPI + vanilla JS)

2026-06-15 15:53:57 +08:00

models.py

fix: change ScoreRequest json_schema_extra from examples list to example dict

2026-06-23 10:03:46 +08:00

server.py

feat: add detailed logging to all API routes and global access log middleware

2026-06-23 10:35:00 +08:00