Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564)

2025-12-11 08:17:03 +00:00 · 2024-04-09 15:57:45 -07:00
parent fccf3eecaa
commit 1524f323a3
11 changed files with 61 additions and 251 deletions
--- a/docs/development.md
+++ b/docs/development.md
@@ -23,7 +23,13 @@ export OLLAMA_DEBUG=1
 Get the required libraries and build the native LLM code:

 ```bash
-go run build.go
+go generate ./...
+```
+
+Then build ollama:
+
+```bash
+go build .
 ```

 Now you can run `ollama`:
@@ -32,16 +38,6 @@ Now you can run `ollama`:
 ./ollama
 ```

-### Rebuilding the native code
-
-If at any point you need to rebuild the native code, you can run the
-build.go script again using the `-f` flag to force a rebuild, and,
-optionally, the `-d` flag to skip building the Go binary:
-
-```bash
-go run build.go -f -d
-```
-
 ### Linux

 #### Linux CUDA (NVIDIA)
@@ -57,10 +53,16 @@ specifying an environment variable `CUDA_LIB_DIR` to the location of the shared
 libraries, and `CUDACXX` to the location of the nvcc compiler. You can customize
 set set of target CUDA architectues by setting `CMAKE_CUDA_ARCHITECTURES` (e.g. "50;60;70")

+Then generate dependencies:
+
+```
+go generate ./...
+```
+
 Then build the binary:

 ```
-go run build.go
+go build .
 ```

 #### Linux ROCm (AMD)
@@ -76,17 +78,21 @@ install (typically `/opt/rocm`), and `CLBlast_DIR` to the location of the
 CLBlast install (typically `/usr/lib/cmake/CLBlast`). You can also customize
 the AMD GPU targets by setting AMDGPU_TARGETS (e.g. `AMDGPU_TARGETS="gfx1101;gfx1102"`)

+```
+go generate ./...
+```
+
 Then build the binary:

 ```
-go run build.go
+go build .
 ```

 ROCm requires elevated privileges to access the GPU at runtime. On most distros you can add your user account to the `render` group, or run as root.

 #### Advanced CPU Settings

-By default, running `go run build.go` will compile a few different variations
+By default, running `go generate ./...` will compile a few different variations
 of the LLM library based on common CPU families and vector math capabilities,
 including a lowest-common-denominator which should run on almost any 64 bit CPU
 somewhat slowly. At runtime, Ollama will auto-detect the optimal variation to
@@ -96,7 +102,8 @@ like to use. For example, to compile an optimized binary for an Intel i9-9880H,
 you might use:

 ```
-OLLAMA_CUSTOM_CPU_DEFS="-DLLAMA_AVX=on -DLLAMA_AVX2=on -DLLAMA_F16C=on -DLLAMA_FMA=on" go run build.go
+OLLAMA_CUSTOM_CPU_DEFS="-DLLAMA_AVX=on -DLLAMA_AVX2=on -DLLAMA_F16C=on -DLLAMA_FMA=on" go generate ./...
+go build .
 ```

 #### Containerized Linux Build
@@ -117,7 +124,8 @@ Install required tools:

 ```powershell
 $env:CGO_ENABLED="1"
-go run build.go
+go generate ./...
+go build .
 ```

 #### Windows CUDA (NVIDIA)
@@ -134,4 +142,4 @@ In addition to the common Windows development tools described above, install AMD
 - [AMD HIP](https://www.amd.com/en/developer/resources/rocm-hub/hip-sdk.html)
 - [Strawberry Perl](https://strawberryperl.com/)

-Lastly, add `ninja.exe` included with MSVC to the system path (e.g. `C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\Common7\IDE\CommonExtensions\Microsoft\CMake\Ninja`).
+Lastly, add `ninja.exe` included with MSVC to the system path (e.g. `C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\Common7\IDE\CommonExtensions\Microsoft\CMake\Ninja`).