llamacpp

Star

Here are 52 public repositories matching this topic...

LostRuins / koboldcpp

Star

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

Updated Oct 2, 2025
C++

cactus-compute / cactus

Star

Kernels & AI inference engine for phone chips

android ios mobile framework ai edge transformer smartphone llm llms llamacpp llm-inference

Updated Oct 1, 2025
C++

menloresearch / cortex.cpp

Star

Local AI API Platform

onnx onnxruntime llamacpp gguf

Updated Jul 4, 2025
C++

andrewkchan / yalm

Star

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

machine-learning cpp cuda llama mistral inference-engine llm llamacpp llm-inference

Updated Sep 13, 2025
C++

intel / neural-speed

Star

An innovative library for efficient LLM inference via low-bit quantization

Updated Aug 30, 2024
C++

KolosalAI / Kolosal

Sponsor

Star

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

Updated May 22, 2025
C++

staghado / vit.cpp

Star

Inference Vision Transformer (ViT) in plain C/C++ with ggml

c cpu ai computer-vision cpp image-classification edge-computing vision-transformer whisper-cpp llamacpp ggml

Updated Apr 11, 2024
C++

inferflow / inferflow

Star

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

mgonzs13 / llama_ros

Star

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

audio cpp embeddings llama gpt ros2 vlm reranking multimodal llm langchain llava llamacpp ggml gguf rerank llavacpp

Updated Sep 10, 2025
C++

Adriankhl / godot-llm

Star

LLM in Godot

gamedev game-development godotengine godot godot-engine gdextension llamacpp llm-inference

Updated Jun 23, 2024
C++

gotzmann / booster

Star

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

openai llama gpt llm chatgpt llamacpp llama-cpp vllm ggml exllama oobabooga ollama

Updated Aug 15, 2024
C++

MorganRO8 / Lucys_Labyrinth

Star

A game made for a school project, dedicated to my daughter.

game sfml llm llamacpp

Updated Apr 9, 2024
C++

Nexesenex / croco.cpp

Star

Croco.Cpp is fork of KoboldCPP infering GGML/GGUF models on CPU/Cuda with KoboldAI's UI. It's powered partly by IK_LLama.cpp, and compatible with most of Ikawrakow's quants except Bitnet.

ai cuda avx2 llamacpp ggml koboldcpp gguf ikllamacpp crococpp

Updated Oct 2, 2025
C++

Mobile-Artificial-Intelligence / llama_sdk

Sponsor

Star

lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

facebook meta llama gemma mistral mobile-ai llm flutter-ai llamacpp ggml llm-inference local-ai llama2 gguf mixtral

Updated Oct 2, 2025
C++

greynewell / musegpt

Sponsor

Star

Local LLMs in your DAW!

ai daw vst vst3 juce music-production ai-music vst-plugin juce-plugins llm llamacpp llama-cpp

Updated Sep 25, 2024
C++

absadiki / pyllamacpp

Star

Python bindings for llama.cpp

llama llms langchain llamacpp

Updated Feb 29, 2024
C++

xorbitsai / xllamacpp

Star

xllamacpp - a Python wrapper of llama.cpp

python llm llamacpp

Updated Sep 29, 2025
C++

hazelnutcloud / godot-llama-cpp

Star

Run large language models in Godot.

godot llamacpp

Updated Jul 5, 2024
C++

opyate / godot-llm-experiment

Star

Getting an LLM to work with Godot.

godot-engine godot4 gdextension llamacpp

Updated Oct 11, 2023
C++

xuegao-tzx / Fllama

Star

A flutter binding for llama.cpp, which use platform channel.

android dart ios ai flutter harmonyos llamacpp

Updated Sep 26, 2025
C++

Improve this page

Add a description, image, and links to the llamacpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llamacpp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamacpp

Here are 52 public repositories matching this topic...

LostRuins / koboldcpp

cactus-compute / cactus

menloresearch / cortex.cpp

andrewkchan / yalm

intel / neural-speed

KolosalAI / Kolosal

staghado / vit.cpp

inferflow / inferflow

mgonzs13 / llama_ros

Adriankhl / godot-llm

gotzmann / booster

MorganRO8 / Lucys_Labyrinth

Nexesenex / croco.cpp

Mobile-Artificial-Intelligence / llama_sdk

greynewell / musegpt

absadiki / pyllamacpp

xorbitsai / xllamacpp

hazelnutcloud / godot-llama-cpp

opyate / godot-llm-experiment

xuegao-tzx / Fllama

Improve this page

Add this topic to your repo