Revision as of 11:58, 12 July 2024

Links

HOWTO

List models

curl http://localhost:8080/v1/models

Audio to text

Apply model

curl http://localhost:8080/models/apply -H "Content-Type: application/json" -d '{
  "id": "huggingface@TheBloke/Yarn-Mistral-7B-128k-GGUF/yarn-mistral-7b-128k.Q5_K_M.gguf"
}'

Scripts

Talk to the chat interface

#!/bin/bash
echo -n "Ask me anything: "
read A
curl -s http://localhost:8080/v1/chat/completions \
   -H "Content-Type: application/json" \
   -d '{ "model": "gpt-4", "messages": [{"role": "user", "content": "'"$A"'", "temperature": 0.1}] }' |\
    jq     '.choices[].message.content' | sed 's/\\n/\n/g' | sed 's/\\"/"/g'

FAQ

File/directory locations

Local-ai log

 /usr/share/local-ai/llama.log

Messages

GPU device found but no CUDA backend present

If running in docker, try restarting docker

@@ Line 32: / Line 32: @@
 ===Local-ai log===
    /usr/share/local-ai/llama.log
+==Messages==
+===GPU device found but no CUDA backend present===
+If running in docker, try restarting docker

Anonymous

Search

LocalAI: Difference between revisions

Namespaces

More

Page actions

Revision as of 11:58, 12 July 2024

Contents

Links

HOWTO

List models

Audio to text

Apply model

Scripts

Talk to the chat interface

FAQ

File/directory locations

Local-ai log

Messages

GPU device found but no CUDA backend present

Navigation

Navigation

Wiki tools

Wiki tools

Anonymous

Search

LocalAI: Difference between revisions

Revision as of 11:58, 12 July 2024

Links

HOWTO

List models

Audio to text

Apply model

Scripts

Talk to the chat interface

FAQ

File/directory locations

Local-ai log

Messages

GPU device found but no CUDA backend present

Navigation

Wiki tools

Page tools