ollama37

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-20 12:47:00 +00:00

Files

Shang Chieh Tseng 806232d95f Add multi-model inference tests for gemma3 12b and 27b

- TC-INFERENCE-004: gemma3:12b single GPU test
- TC-INFERENCE-005: gemma3:27b dual-GPU test (K80 layer split)
- Each test unloads previous model before loading next
- Workflows unload all 3 model sizes after inference suite
- 27b test verifies both GPUs have memory allocated

2025-12-17 17:01:25 +08:00

workflows

Add multi-model inference tests for gemma3 12b and 27b

2025-12-17 17:01:25 +08:00