Skip to content
View mgoin's full-sized avatar
🤠
🤠

Organizations

@neuralmagic @vllm-project @llm-d

Block or report mgoin

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 59.3k 10.5k

  2. vllm-project/llm-compressor vllm-project/llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2k 246

  3. llm-d/llm-d llm-d/llm-d Public

    llm-d enables high-performance distributed LLM inference on Kubernetes

    Makefile 1.8k 175

  4. neuralmagic/deepsparse neuralmagic/deepsparse Public archive

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.2k 191

  5. advos advos Public

    RISC-V OS in Rust with hardware support for SiFive's HiFive1 board

    Rust

  6. torch_bitmask torch_bitmask Public

    Implementations of bitmask compression for weight sparsity in PyTorch

    Python 4 1