Daniel Hiltgen
|
b51e3b63ac
Statically link c++ and thread lib
|
5 月之前 |
jmorganca
|
a08f20d910
release: remove unwanted mingw dll.a files
|
5 月之前 |
jmorganca
|
6cea036027
Revert "llm: only statically link libstdc++"
|
5 月之前 |
jmorganca
|
5796bfc401
llm: only statically link libstdc++
|
5 月之前 |
jmorganca
|
f1a379aa56
llm: statically link pthread and stdc++ dependencies in windows build
|
5 月之前 |
Jeffrey Morgan
|
5304b765b2
llm: put back old include dir (#5507)
|
5 月之前 |
Jeffrey Morgan
|
78fb33dd07
fix typo in cgo directives in `llm.go` (#5501)
|
5 月之前 |
Jeffrey Morgan
|
8f8e736b13
update llama.cpp submodule to `d7fd29f` (#5475)
|
5 月之前 |
Michael Yang
|
829ff87bd1
revert tokenize ffi (#4761)
|
6 月之前 |
Jeffrey Morgan
|
763bb65dbb
use `int32_t` for call to tokenize (#4738)
|
6 月之前 |
Michael Yang
|
bf54c845e9
vocab only
|
6 月之前 |
Michael Yang
|
26a00a0410
use ffi for tokenizing/detokenizing
|
7 月之前 |
Michael Yang
|
01811c176a
comments
|
8 月之前 |
Michael Yang
|
9685c34509
quantize any fp16/fp32 model
|
8 月之前 |
Hernan Martinez
|
86e67fc4a9
Add import declaration for windows,arm64 to llm.go
|
8 月之前 |
Michael Yang
|
9502e5661f
cgo quantize
|
8 月之前 |
Daniel Hiltgen
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
9 月之前 |
Michael Yang
|
91b3e4d282
update memory calcualtions
|
9 月之前 |
Michael Yang
|
d338d70492
refactor model parsing
|
9 月之前 |
Patrick Devine
|
1b272d5bcd
change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347)
|
9 月之前 |
Jeffrey Morgan
|
f9cd55c70b
disable gpu for certain model architectures and fix divide-by-zero on memory estimation
|
9 月之前 |
Daniel Hiltgen
|
6c5ccb11f9
Revamp ROCm support
|
10 月之前 |
Daniel Hiltgen
|
a1dfab43b9
Ensure the libraries are present
|
10 月之前 |
Jeffrey Morgan
|
4458efb73a
Load all layers on `arm64` macOS if model is small enough (#2149)
|
11 月之前 |
Daniel Hiltgen
|
fedd705aea
Mechanical switch from log to slog
|
11 月之前 |
Michael Yang
|
eaed6f8c45
add max context length check
|
11 月之前 |
Daniel Hiltgen
|
7427fa1387
Fix up the CPU fallback selection
|
11 月之前 |
Daniel Hiltgen
|
de2fbdec99
Merge pull request #1819 from dhiltgen/multi_variant
|
11 月之前 |
Michael Yang
|
f4f939de28
Merge pull request #1552 from jmorganca/mxyng/lint-test
|
11 月之前 |
Daniel Hiltgen
|
39928a42e8
Always dynamically load the llm server library
|
11 月之前 |