Historique des commits

Auteur SHA1 Message Date
  Jeffrey Morgan f8241bfba3 gpu: report system free memory instead of 0 (#5521) il y a 7 mois
  Daniel Hiltgen 6f351bf586 review comments and coverage il y a 8 mois
  Daniel Hiltgen fc37c192ae Refine CPU load behavior with system memory visibility il y a 8 mois
  Daniel Hiltgen 30a7d7096c Bump VRAM buffer back up il y a 8 mois
  Michael Yang 4736391bfb llm: add minimum based on layer size il y a 9 mois
  Jeffrey Morgan f0c454ab57 gpu: add 512MiB to darwin minimum, metal doesn't have partial offloading overhead (#4068) il y a 9 mois
  Daniel Hiltgen 34b9db5afc Request and model concurrency il y a 10 mois
  Michael Yang 26df674785 scale graph based on gpu count il y a 9 mois
  Michael Yang 41a272de9f darwin: no partial offloading if required memory greater than system il y a 9 mois
  Michael Yang 7e33a017c0 partial offloading il y a 10 mois
  Daniel Hiltgen be330174dd Allow setting max vram for workarounds il y a 11 mois
  peanut256 a189810df6 Determine max VRAM on macOS using `recommendedMaxWorkingSetSize` (#2354) il y a 11 mois
  Daniel Hiltgen 7427fa1387 Fix up the CPU fallback selection il y a 1 an
  Daniel Hiltgen 39928a42e8 Always dynamically load the llm server library il y a 1 an
  Daniel Hiltgen d88c527be3 Build multiple CPU variants and pick the best il y a 1 an
  Jeffrey Morgan c336693f07 calculate overhead based number of gpu devices (#1875) il y a 1 an
  Jeffrey Morgan 08f1e18965 Offload layers to GPU based on new model size estimates (#1850) il y a 1 an
  Jeffrey Morgan c7ea8f237e set `num_gpu` to 1 only by default on darwin arm64 (#1771) il y a 1 an
  Daniel Hiltgen a2ad952440 Fix windows system memory lookup il y a 1 an
  Daniel Hiltgen d966b730ac Switch windows build to fully dynamic il y a 1 an
  Daniel Hiltgen 7555ea44f8 Revamp the dynamic library shim il y a 1 an
  Daniel Hiltgen 6558f94ed0 Fix darwin intel build il y a 1 an
  Daniel Hiltgen 35934b2e05 Adapted rocm support to cgo based llama.cpp il y a 1 an