diff options
| author | Danilo M. <danix@danix.xyz> | 2026-04-03 18:17:29 +0200 |
|---|---|---|
| committer | Danilo M. <danix@danix.xyz> | 2026-04-03 18:17:29 +0200 |
| commit | ebb26eac2948e02def3c7ac1ac23c4ecd345a5a7 (patch) | |
| tree | c54b2a6d28a89333b771bdee05e6baa45fe0c94f /SlackBuilds/llama.cpp-vulkan/README | |
| parent | 1045963959ddfb697898fa90476f837aae4e2881 (diff) | |
| download | my-slackbuilds-ebb26eac2948e02def3c7ac1ac23c4ecd345a5a7.tar.gz my-slackbuilds-ebb26eac2948e02def3c7ac1ac23c4ecd345a5a7.zip | |
repo: flatten layout — move packages to root, extras to .extras/
- Move all packages from SlackBuilds/ to repo root
- Move hooks/, docs/, nvchecker.toml to .extras/
- Update CLAUDE.md and README.md to reflect new structure
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Diffstat (limited to 'SlackBuilds/llama.cpp-vulkan/README')
| -rw-r--r-- | SlackBuilds/llama.cpp-vulkan/README | 22 |
1 files changed, 0 insertions, 22 deletions
diff --git a/SlackBuilds/llama.cpp-vulkan/README b/SlackBuilds/llama.cpp-vulkan/README deleted file mode 100644 index 5509d44..0000000 --- a/SlackBuilds/llama.cpp-vulkan/README +++ /dev/null @@ -1,22 +0,0 @@ -llama.cpp - -LLM inference in C/C++ - -The main goal of llama.cpp is to enable LLM inference with minimal -setup and state-of-the-art performance on a wide range of hardware -locally and in the cloud. - - - Plain C/C++ implementation without any dependencies - - Apple silicon is a first-class citizen - optimized via ARM NEON, - Accelerate and Metal frameworks - - AVX, AVX2, AVX512 and AMX support for x86 architectures - - RVV, ZVFH, ZFH, ZICBOP and ZIHINTPAUSE support for RISC-V - architectures - - 1.5-bit, 2-bit, 3-bit, 4-bit, 5-bit, 6-bit, and 8-bit integer - quantization for faster inference and reduced memory use - - Custom CUDA kernels for running LLMs on NVIDIA GPUs (support for - AMD GPUs via HIP and Moore Threads GPUs via MUSA) - - Vulkan and SYCL backend support - - CPU+GPU hybrid inference to partially accelerate models larger than - the total VRAM capacity - |
