Commit History

Author SHA1 Message Date
  Daniel Hiltgen b51e3b63ac Statically link c++ and thread lib 5 months ago
  jmorganca a08f20d910 release: remove unwanted mingw dll.a files 5 months ago
  jmorganca 6cea036027 Revert "llm: only statically link libstdc++" 5 months ago
  jmorganca 5796bfc401 llm: only statically link libstdc++ 5 months ago
  jmorganca f1a379aa56 llm: statically link pthread and stdc++ dependencies in windows build 5 months ago
  Jeffrey Morgan 5304b765b2 llm: put back old include dir (#5507) 5 months ago
  Jeffrey Morgan 78fb33dd07 fix typo in cgo directives in `llm.go` (#5501) 5 months ago
  Jeffrey Morgan 8f8e736b13 update llama.cpp submodule to `d7fd29f` (#5475) 5 months ago
  Michael Yang 829ff87bd1 revert tokenize ffi (#4761) 6 months ago
  Jeffrey Morgan 763bb65dbb use `int32_t` for call to tokenize (#4738) 6 months ago
  Michael Yang bf54c845e9 vocab only 6 months ago
  Michael Yang 26a00a0410 use ffi for tokenizing/detokenizing 7 months ago
  Michael Yang 01811c176a comments 8 months ago
  Michael Yang 9685c34509 quantize any fp16/fp32 model 8 months ago
  Hernan Martinez 86e67fc4a9 Add import declaration for windows,arm64 to llm.go 8 months ago
  Michael Yang 9502e5661f cgo quantize 8 months ago
  Daniel Hiltgen 58d95cc9bd Switch back to subprocessing for llama.cpp 9 months ago
  Michael Yang 91b3e4d282 update memory calcualtions 9 months ago
  Michael Yang d338d70492 refactor model parsing 9 months ago
  Patrick Devine 1b272d5bcd change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) 9 months ago
  Jeffrey Morgan f9cd55c70b disable gpu for certain model architectures and fix divide-by-zero on memory estimation 9 months ago
  Daniel Hiltgen 6c5ccb11f9 Revamp ROCm support 10 months ago
  Daniel Hiltgen a1dfab43b9 Ensure the libraries are present 10 months ago
  Jeffrey Morgan 4458efb73a Load all layers on `arm64` macOS if model is small enough (#2149) 11 months ago
  Daniel Hiltgen fedd705aea Mechanical switch from log to slog 11 months ago
  Michael Yang eaed6f8c45 add max context length check 11 months ago
  Daniel Hiltgen 7427fa1387 Fix up the CPU fallback selection 11 months ago
  Daniel Hiltgen de2fbdec99 Merge pull request #1819 from dhiltgen/multi_variant 11 months ago
  Michael Yang f4f939de28 Merge pull request #1552 from jmorganca/mxyng/lint-test 11 months ago
  Daniel Hiltgen 39928a42e8 Always dynamically load the llm server library 11 months ago