numa

Here are 73 public repositories matching this topic...

guqiong96 / Lvllm

LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.

cpu gpu model inference parallelism decode moe numa hybrid prefill vllm

Updated Mar 25, 2026
Python

eXascaleInfolab / PyExPool

Star

Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture

multiprocessing parallel-computing numa monitoring-server cache-control task-queue application-framework parallel-processing execution-pool benchmarking-framework load-balancing in-memory-computations

Updated Aug 28, 2019
Python

Scottcjn / ram-coffers

Star

RAM Coffers: Conditional Memory via NUMA-Distributed Weight Banking - O(1) lookup routing for LLM inference (Dec 16, 2025 - predates DeepSeek Engram by 27 days)

ai ram memory-management numa neuromorphic hacktoberfest ppc64le first-timers-only cognitive-computing power8 good-first-issue cpu-inference llm llama-cpp

Updated Mar 21, 2026
C

domargan / awesome-numa

Star

A community-oriented list of useful NUMA-related libraries, tools, and other resources

multiprocessing multithreading awesome-list numa shared-memory numa-systems non-uniform-memory-access numa-benchmarks numa-aware

Updated Sep 28, 2020

lsds / LightSaber

Star

Multi-core Window-Based Stream Processing Engine

compression cpp llvm stream-processing ssd rdma numa multi-core aggregation sliding-windows incremental-computation libaio

Updated Oct 20, 2021
C++

Scottcjn / llama-cpp-power8

Star

AltiVec/VSX optimized llama.cpp for IBM POWER8

machine-learning ai numa ibm powerpc hacktoberfest altivec vsx ppc64le first-timers-only power8 good-first-issue cpu-inference llm llama-cpp ggml

Updated Mar 21, 2026
C

memtt / numaprof

Star

NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.

profiler memory instrumentation numa

Updated Jun 20, 2025
C++

peterosterlund2 / texel

Star

Texel chess engine

android windows linux cmake chess-engine cluster cpp14 mpi smp numa

Updated Nov 21, 2025
C++

guqiong96 / Lsglang

Star

Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parallel architecture, suitable for MOE model hybrid inference.

cpu gpu model inference parallelism decode moe numa hybird prefill sglang

Updated Mar 12, 2026
Python

HadrienG2 / hwlocality

Star

Rust bindings to Open MPI Portable Hardware Locality "hwloc" library, covering version 2.0 and above.

cache os locality memory-management numa hwloc ffi-bindings hardware-support

Updated Mar 25, 2026
Rust

k13132 / openwrt-dpdk

Star

Data Plane Development Kit (DPDK) integration into OpenWrt

dpdk kernel-module openwrt numa openwrt-package openwrt-feed dpdk-driver

Updated Apr 17, 2024
Makefile

tklauser / numcpus

Star

Go package providing information about the number of CPUs in the system

go linux golang unix cpu online offline bsd numa cputopology

Updated Mar 10, 2026
Go

numamma / numamma

Star

NumaMMA is a lightweight memory profiler for parallel applications

profile memory numa pebs

Updated Jun 10, 2025
C

bastion-rs / numanji

Star

Local-affinity first NUMA-aware allocator with optional fallback.

rust allocator mmap numa numa-aware globalallocator

Updated Apr 29, 2021
Rust

c3sr / comm_scope

Star

NUMA-aware multi-CPU multi-GPU data transfer benchmarks

performance gpu cuda bandwidth numa hip benchmark-suite nvlink

Updated Oct 26, 2023
C++

numap-library / numap

Star

profile memory numa pebs

Updated May 27, 2024
C

ct-clmsn / mesos-cpusets

Star

cgroups-based cpuset isolator and resource estimator modules for mesos

cloud mesos cloud-computing numa hardware-topology

Updated Jan 9, 2017
C++

latentPrion / zambesii

Star

Non-unix, custom-API hybrid OS kernel written in C++ which can be thought of as an emulated microkernel. The native API is almost fully asynchronous and the kernel is aimed at high-scaling, high-throughput-requiring multiprocessor workloads, with working support for SMP and NUMA already implemented. Join the IRC channel, #zbz-dev on freenode!

c-plus-plus kernel os operating-system smp numa hybrid-kernel operating-system-kernel uniform-driver-interface udi symmetric-multiprocessing non-uniform-memory-access

Updated Mar 25, 2026
C++

yhaenggi / numad

Star

numad for debian/ubuntu

daemon numa numad

Updated Apr 28, 2016
C

flashxio / knor

Star

A repo to allow validation of performance results in the knor paper and provide a fast, scalable k-means implementation.

streaming algorithm cluster distributed-computing numa external-memory kmeans-clustering

Updated Mar 31, 2020
C++

Improve this page

Add a description, image, and links to the numa topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the numa topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

numa

Here are 73 public repositories matching this topic...

guqiong96 / Lvllm

eXascaleInfolab / PyExPool

Scottcjn / ram-coffers

domargan / awesome-numa

lsds / LightSaber

Scottcjn / llama-cpp-power8

memtt / numaprof

peterosterlund2 / texel

guqiong96 / Lsglang

HadrienG2 / hwlocality

k13132 / openwrt-dpdk

tklauser / numcpus

numamma / numamma

bastion-rs / numanji

c3sr / comm_scope

numap-library / numap

ct-clmsn / mesos-cpusets

latentPrion / zambesii

yhaenggi / numad

flashxio / knor

Improve this page

Add this topic to your repo