Commit History

Author SHA1 Message Date
  thonypythony 2c7f7ace8f Update README.md 5 months ago
  thonypythony 02a0458c94 imgs 5 months ago
  Jeffrey Morgan 791650ddef sched: only error when over-allocating system memory (#5626) 5 months ago
  Jeffrey Morgan efbf41ed81 llm: dont link cuda with compat libs (#5621) 5 months ago
  Michael Yang cf15589851 Merge pull request #5620 from ollama/mxyng/templates 5 months ago
  Michael Yang 19753c18c0 update embedded templates 5 months ago
  Michael Yang 41be28096a add system prompt to first legacy template 5 months ago
  Michael Yang 37a570f962 Merge pull request #5612 from ollama/mxyng/mem 5 months ago
  Michael Yang 5a739ff4cb chatglm graph 5 months ago
  Jeffrey Morgan 4e262eb2a8 remove `GGML_CUDA_FORCE_MMQ=on` from build (#5588) 5 months ago
  Daniel Hiltgen 4cfcbc328f Merge pull request #5124 from dhiltgen/amd_windows 5 months ago
  Daniel Hiltgen 79292ff3e0 Merge pull request #5555 from dhiltgen/msvc_deps 5 months ago
  Daniel Hiltgen 8ea500441d Merge pull request #5580 from dhiltgen/cuda_overhead 5 months ago
  Daniel Hiltgen b50c818623 Merge pull request #5607 from dhiltgen/win_rocm_v6 5 months ago
  Daniel Hiltgen b99e750b62 Merge pull request #5605 from dhiltgen/merge_glitch 5 months ago
  Daniel Hiltgen 1f50356e8e Bump ROCm on windows to 6.1.2 5 months ago
  Daniel Hiltgen 22c81f62ec Remove duplicate merge glitch 5 months ago
  Daniel Hiltgen 2d1e3c3229 Merge pull request #5503 from dhiltgen/dual_rocm 5 months ago
  royjhan 4918fae535 OpenAI v1/completions: allow stop token list (#5551) 5 months ago
  royjhan 0aff67877e separate request tests (#5578) 5 months ago
  Daniel Hiltgen f6f759fc5f Detect CUDA OS Overhead 5 months ago
  Daniel Hiltgen 9544a57ee4 Merge pull request #5579 from dhiltgen/win_static_deps 5 months ago
  Daniel Hiltgen b51e3b63ac Statically link c++ and thread lib 5 months ago
  Michael Yang 6bbbc50f10 Merge pull request #5440 from ollama/mxyng/messages-templates 5 months ago
  Michael Yang 9bbddc37a7 Merge pull request #5126 from ollama/mxyng/messages 5 months ago
  Jeffrey Morgan e4ff73297d server: fix model reloads when setting `OLLAMA_NUM_PARALLEL` (#5560) 5 months ago
  Daniel Hiltgen b44320db13 Bundle missing CRT libraries 5 months ago
  Daniel Hiltgen 0bacb30007 Workaround broken ROCm p2p copy 5 months ago
  Jeffrey Morgan 53da2c6965 llm: remove ambiguous comment when putting upper limit on predictions to avoid infinite generation (#5535) 5 months ago
  Jeffrey Morgan d8def1ff94 llm: allow gemma 2 to context shift (#5534) 5 months ago