LLM Price Runner

Workload (input : output ratio)

Latency sensitivity

Your location

AI Value = log₁₀(quality / blended $) − log₁₀((ms + k/4) / (k + k/4))
ms = TTFT + city→region RTT + α·gen_time · bonus below knee k, penalty above · +1.0 ≈ 10× value

Cheapest Input

—

per 1M tokens

Best AI Value (chat)

—

AI value score

Fastest from Vilnius 🇱🇹

—

estimated RTT

Best Reasoning AI Value

—

AI value score

EU-Native Models

—

with European endpoints

🏆 Best AI Value per Category

Loading…

Provider ↕	Model ↕	In / 1M ↕	Out / 1M ↕	Ctx ↕	Latency 🇱🇹 ↕	Quality Index ↕	AI Value ↕	Links
Loading models…

📊 Methodology: Quality = Artificial Analysis Intelligence Index (rebased 2026, top frontier ~57–60). Cost = workload-weighted blend of input & output $/1M tokens. Latency = TTFT (inference) + estimated RTT from your selected city to the model's datacenter region, then a symmetric log adjustment around the sensitivity knee (bonus below, penalty above). Throughput (tok/s) is shown per model and folded into typical response time for the chosen workload.
Formula: AI Value = log₁₀(quality_index / blended_cost) − log₁₀((latency_ms + knee/4) / (knee + knee/4)). Absolute score — no rescaling. LMSYS Chatbot Arena ELO shown separately for human-preference context. Auto-refreshed every 6h.