ollama37

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-10 15:57:04 +00:00

Author	SHA1	Message	Date
Shang Chieh Tseng	c8f6b24358	Update tesla-k80-multi-gpu-tests.yml	2025-10-30 17:48:42 +08:00
Shang Chieh Tseng	d9d3f7b0b4	Fix GitHub Actions workflows to upload build libraries and remove LD_LIBRARY_PATH Changes: - Update tesla-k80-ci.yml to upload build/lib/ollama/ containing CUDA backend - Remove all LD_LIBRARY_PATH environment variables (no longer needed with RPATH) - Test workflows now receive libggml-cuda.so enabling GPU offload This fixes the issue where test workflows couldn't offload to GPU because the CUDA backend library wasn't included in the artifact. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-30 15:08:34 +08:00
Shang Chieh Tseng	c022e79e77	Add LD_LIBRARY_PATH to GitHub Actions workflows for CUDA library discovery Set LD_LIBRARY_PATH in all workflow steps to ensure CUDA 11.4 libraries are found during both compile time and runtime. This fixes the issue where the GGML CUDA backend (libggml-cuda.so) fails to load when running 'ollama serve'. Library paths added: - /usr/local/cuda-11.4/lib64 - /usr/local/cuda-11.4/targets/x86_64-linux/lib - /usr/lib64 - /usr/local/lib64 Updated workflows: - tesla-k80-ci.yml: CMake configure, C++/CUDA build, Go build, binary verify - tesla-k80-single-gpu-tests.yml: All test execution steps - tesla-k80-multi-gpu-tests.yml: All test execution steps	2025-10-30 13:28:44 +08:00
Shang Chieh Tseng	5895b414f4	Fix cross-workflow artifact download using dawidd6/action-download-artifact - Replace actions/download-artifact@v4 with dawidd6/action-download-artifact@v6 - The default download-artifact action only works within same workflow run - Third-party action enables downloading artifacts from different workflow - Both test workflows now download from latest successful tesla-k80-ci.yml run	2025-10-30 12:12:59 +08:00
Shang Chieh Tseng	a171c8a087	Fix test workflows to use build artifacts instead of local binary - Build workflow now uploads ollama binary as artifact with 7-day retention - Test workflows download artifact instead of expecting local binary - Eliminates 'ollama binary not found' error when running tests - Enables build-once, test-multiple-times workflow pattern - Added binary verification step to confirm artifact download	2025-10-30 12:07:28 +08:00
Shang Chieh Tseng	6c3876a30d	Add multi-GPU test workflow and rename single-GPU workflow - Rename tesla-k80-tests.yml to tesla-k80-single-gpu-tests.yml for clarity - Add new tesla-k80-multi-gpu-tests.yml workflow for large models - Add multi-gpu profile to test/config/models.yaml with gemma3:27b and gpt-oss:20b - Multi-GPU workflow includes GPU count verification and weekly schedule - Profile-specific validation allows multi-GPU splits for large models - Separate workflows optimize CI efficiency: quick tests vs. thorough tests	2025-10-30 12:04:50 +08:00

6 Commits