llama server wrapper

This commit is contained in:
Bruce MacDonald
2023-06-23 13:10:13 -04:00
parent 8fa91332fa
commit 0758cb2d4b
7 changed files with 83 additions and 138 deletions

34
server/README.md Normal file
View File

@@ -0,0 +1,34 @@
# Server
🙊
## Installation
If using Apple silicon, you need a Python version that supports arm64:
```bash
wget https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-MacOSX-arm64.sh
bash Miniforge3-MacOSX-arm64.sh
```
Get the dependencies:
```bash
pip install llama-cpp-python
pip install -r requirements.txt
```
## Running
Put your model in `models/` and run:
```bash
python server.py
```
## API
### `POST /generate`
model: `string` - The name of the model to use in the `models` folder.
prompt: `string` - The prompt to use.