Anubis icon

Anubis OSS

Local LLM benchmarking for Apple Silicon with real-time hardware telemetry

macOS 15+ Apple Silicon SwiftUI GPL-3.0
Anubis benchmark dashboard Anubis expanded metrics export

Why Anubis?

The local LLM ecosystem on macOS is fragmented. Chat wrappers focus on conversation, performance monitors are CLI-only, and no tool correlates hardware metrics with inference speed in real time.

Chat wrappers
Ollama, LM Studio, Jan focus on conversation — not systematic testing
Performance monitors
asitop, macmon, mactop are CLI-only and lack LLM context
Eval frameworks
promptfoo requires YAML configs and terminal expertise
Anubis
Correlates GPU, CPU, power, memory, and frequency with inference — in a native macOS app

Features

Benchmark

Real-time dashboard with 8 metric cards, 7 live charts, power telemetry, and configurable prompt presets. Stream responses with live hardware overlay.

Arena

Side-by-side A/B model comparison. Sequential or parallel execution. Vote for a winner — results are persisted with full stats.

Vault

Unified model management across all backends. Pull, delete, inspect, and unload models. Automatic metadata enrichment from HuggingFace and LM Studio caches.

Power Telemetry

GPU, CPU, ANE, and DRAM power in watts via IOReport. See watts-per-token efficiency — compare quantizations by their actual power cost.

Process Monitoring

Auto-detects the backend process by port. Tracks real memory footprint including Metal/GPU allocations. Manual override available.

Export & History

Session history with full replay. Export as CSV, Markdown, or shareable 2x retina PNG images with one click. Respects light/dark mode.

Supported Backends

Ollama LM Studio mlx-lm vLLM LocalAI OpenWebUI Docker Models Any OpenAI-compatible

Hardware Metrics

MetricSourceDescription
GPU UtilizationIOReportGPU active residency percentage
CPU Utilizationhost_processor_infoUsage across all cores
GPU PowerIOReport Energy ModelGPU power consumption in watts
CPU PowerIOReport Energy ModelCPU (E-cores + P-cores) power in watts
ANE PowerIOReport Energy ModelNeural Engine power consumption
DRAM PowerIOReport Energy ModelMemory subsystem power
GPU FrequencyIOReport GPU StatsWeighted average from P-state residency
Process Memoryproc_pid_rusageBackend phys_footprint (includes Metal/GPU)
Thermal StateProcessInfoSystem thermal pressure level
Anubis arena comparison Anubis vault

Requirements

macOS 15.0+
Sequoia or later
Apple Silicon
M1 / M2 / M3 / M4 / M5+
8 GB+
16 GB recommended
Ollama
Or any openAI API endpoint comaptible backend

Try Anubis OSS

Free and open source. Download the notarized app or build from source.