Pricing — Solon

Managed

Agent Hosting

Starter

BYOK agents for development and small workloads

$25 /mo

2 vCPU / 4 GB RAM / 40 GB NVMe

Bring your own API key (Anthropic, OpenAI, etc.)
Solon agent runtime pre-installed
Web dashboard
Tenant isolation + auto-TLS
Basic monitoring

Deploy Starter

Pro

Production agents with higher throughput

$49 /mo

4 vCPU / 16 GB RAM / 80 GB NVMe

Everything in Starter
Higher throughput + multi-model routing
Request logging + analytics
Priority provisioning
Email alerts

Deploy Pro

GPU L4

Local inference with NVIDIA L4

$299 /mo

8 vCPU / 32 GB RAM / 160 GB NVMe / NVIDIA L4 24 GB

Everything in Pro
Run open-source models locally
No cloud API dependency
Full data sovereignty
Custom model deployment

Deploy GPU L4

Dedicated GPU

Run Your Own Models

NVIDIA GPUs in European datacenters. Run any open-source model. Your data never leaves your server.

GPU A100

Run Llama 70B, Mixtral, DeepSeek locally

$3.49 /hr

~$2,549/mo

NVIDIA A100 80 GB SXM4

10 vCPU / 120 GB RAM / 200 GB NVMe

Full Solon agent platform
Run any HuggingFace model
No API costs, no rate limits
Data never leaves your server
Finland or Iceland datacenter

Deploy A100

GPU H100

Fastest inference for demanding workloads

$5.49 /hr

~$3,999/mo

NVIDIA H100 80 GB SXM5

20 vCPU / 200 GB RAM / 400 GB NVMe

Everything in A100 tier
Run any model at peak speed
2-3x faster than A100
Ideal for multi-agent workflows
Finland or Iceland datacenter

Deploy H100

GPU H200

Maximum VRAM for the largest models

$6.89 /hr

~$4,999/mo

NVIDIA H200 141 GB SXM5

20 vCPU / 200 GB RAM / 400 GB NVMe

Everything in H100 tier
141 GB VRAM — run 400B+ models unquantized
Largest context windows
Fine-tuning capable
Finland or Iceland datacenter

Deploy H200

GPU tiers billed per hour. All instances on dedicated hardware in EU datacenters. Prices exclude VAT.

FAQ

What does "managed" mean?

We handle server provisioning, security hardening, TLS, monitoring, and updates. You get a running Solon instance with a web dashboard. Your server, your data — we just keep it running.

What's the difference between BYOK and GPU tiers?

Starter and Pro tiers use your existing API keys (Anthropic, OpenAI, etc.) for inference — affordable and fast to start. GPU tiers run open-source models locally on dedicated NVIDIA hardware — no API costs, full data sovereignty.

Where are the servers located?

CPU tiers (Starter, Pro, GPU L4) run on Hetzner in Germany, Finland, or US East. GPU tiers (A100, H100, H200) run on dedicated NVIDIA hardware in Finland or Iceland.

Is each instance isolated?

Every customer gets a dedicated server. No shared compute, no noisy neighbors. Full tenant isolation with automatic TLS and firewall.

Can I switch tiers later?

Yes. Upgrade or downgrade anytime. We'll migrate your instance to the new server type with minimal downtime.

How does GPU billing work?

GPU tiers (A100, H100, H200) are billed per hour. You pay only while the instance is running. CPU tiers are billed monthly at a flat rate.

Can I self-host instead?

Absolutely. Solon is open source (MIT). Install it anywhere with one command. Managed hosting is for teams who'd rather focus on building than managing infrastructure.

Self-hosted

Solon Cloud

Already running Solon? Connect your instances to the cloud dashboard for remote access, team management, and monitoring.

Free

1 self-hosted instance
60 req/min
Cloudflare Tunnel

Pro

$19 /mo

5 instances
300 req/min
Relay persistent URLs
Team access

Team