ollama37

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-17 19:27:00 +00:00

Files

Shang Chieh Tseng c2f4f378cc Add dual-judge mode to test runner

New options:
- --dual-judge: Run both simple and LLM judge, fail if either fails
- --judge-url: Separate LLM Judge server URL (default: localhost:11435)
- --judge-model: Model for LLM judging (default: gemma3:4b)

Dual judge logic:
- Simple judge checks exit codes
- LLM judge analyzes logs semantically
- Final result: FAIL if either judge says FAIL
- Combines reasons from both judges on failure

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2025-12-15 22:58:28 +08:00

src

Add dual-judge mode to test runner

2025-12-15 22:58:28 +08:00

testcases

Add model warmup step to TC-INFERENCE-001

2025-12-15 21:38:09 +08:00

package-lock.json

Add GitHub Actions CI/CD pipeline and test framework

2025-12-15 14:06:44 +08:00

package.json

Add GitHub Actions CI/CD pipeline and test framework

2025-12-15 14:06:44 +08:00

tsconfig.json

Add GitHub Actions CI/CD pipeline and test framework

2025-12-15 14:06:44 +08:00