LocalAI: Difference between revisions
From DWIKI
m →FAQ Tag: wikieditor |
m →HOWTO |
||
| (6 intermediate revisions by the same user not shown) | |||
| Line 2: | Line 2: | ||
*[https://localai.io LocalAI homepage] | *[https://localai.io LocalAI homepage] | ||
*[https://localai.io/faq/ LocalAI FAQ] | *[https://localai.io/faq/ LocalAI FAQ] | ||
*[https://github.com/open-webui/open-webui Check out Open webui] | |||
==Installing localai== | |||
*https://localai.io/installation/index.print | |||
=HOWTO= | =HOWTO= | ||
| Line 16: | Line 19: | ||
"id": "huggingface@TheBloke/Yarn-Mistral-7B-128k-GGUF/yarn-mistral-7b-128k.Q5_K_M.gguf" | "id": "huggingface@TheBloke/Yarn-Mistral-7B-128k-GGUF/yarn-mistral-7b-128k.Q5_K_M.gguf" | ||
}' | }' | ||
==CLI== | |||
https://localai.io/reference/cli-reference/ | |||
===Reset conversation=== | |||
? | |||
=Scripts= | =Scripts= | ||
| Line 32: | Line 41: | ||
===Local-ai log=== | ===Local-ai log=== | ||
/usr/share/local-ai/llama.log | /usr/share/local-ai/llama.log | ||
===Models=== | |||
/usr/share/local-ai/models | |||
==Messages== | ==Messages== | ||
===GPU device found but no CUDA backend present=== | ===GPU device found but no CUDA backend present=== | ||
If running in docker, try restarting docker | If running in docker, try restarting docker | ||
===WARNING: failed to read int from file: open /sys/class/drm/card0/device/numa_node: no such file or directory=== | |||
===failed to load model with internal loader: grpc service not ready=== | |||
?? | |||
==Error messages== | |||
===Failed to process stream=== | |||
something broken? | |||
Latest revision as of 19:53, 27 May 2026
Links
Installing localai
HOWTO
List models
curl http://localhost:8080/v1/models
Audio to text
- https://localai.io/features/audio-to-text/
- https://docs.llamaindex.ai/en/stable/examples/llm/localai/
Apply model
curl http://localhost:8080/models/apply -H "Content-Type: application/json" -d '{ "id": "huggingface@TheBloke/Yarn-Mistral-7B-128k-GGUF/yarn-mistral-7b-128k.Q5_K_M.gguf" }'
CLI
https://localai.io/reference/cli-reference/
Reset conversation
?
Scripts
Talk to the chat interface
#!/bin/bash echo -n "Ask me anything: " read A curl -s http://localhost:8080/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-4", "messages": [{"role": "user", "content": "'"$A"'", "temperature": 0.1}] }' |\ jq '.choices[].message.content' | sed 's/\\n/\n/g' | sed 's/\\"/"/g'
FAQ
File/directory locations
Local-ai log
/usr/share/local-ai/llama.log
Models
/usr/share/local-ai/models
Messages
GPU device found but no CUDA backend present
If running in docker, try restarting docker
WARNING: failed to read int from file: open /sys/class/drm/card0/device/numa_node: no such file or directory
failed to load model with internal loader: grpc service not ready
??
Error messages
Failed to process stream
something broken?
