siemens_ragas

Files

wangwei 629304aa6d feat(logging): add structured evaluation logs for metric-level debugging

- pipeline.py: log each metric score/timeout/error with sample_id,
  elapsed time, and score value; log NaN list per sample; progress
  counter N/total after each sample completes
- evaluator.py: log eval start, dataset counts, adapter enrichment
  progress (per-sample OK/FAIL with elapsed), metric scoring summary,
  and per-metric NaN rate at end of run
- runner.py: _setup_logging() helper writes to stderr + optional file;
  ragas/httpx/openai noisy loggers throttled to WARNING
- main.py: add --log-file and --log-level CLI flags

Usage:
  python main.py --scenario scenarios/online/... --log-file logs/eval.log --log-level DEBUG

Co-Authored-By: Claude <noreply@anthropic.com>

2026-06-16 10:48:41 +08:00

adapters

first commit

2026-06-12 14:02:15 +08:00

config

first commit

2026-06-12 14:02:15 +08:00

dataset_builder

feat(dataset-builder): add retry logic and ASCII-safe logging for Siemens PDF pipeline

2026-06-15 23:06:33 +08:00

datasets

Add RAGAS evaluation web console (FastAPI + vanilla JS)