feat(ai): Add LiteLLM-like router infrastructure (#286) by matiasmagni · Pull Request #439 · arakoodev/EdgeChains

matiasmagni · 2026-03-18T00:03:40Z

Summary

Implement load balancing between multiple LLM deployments (OpenAI, Google Palm/Gemini, Cohere)
Routing strategies: least-tokens (default), simple-shuffle, latency-based
Timeout/retry with exponential backoff via axios interceptors
Streaming support for all providers
Token usage tracking with cost calculation
Sentry and Posthog logging callbacks
JSONNet configuration support
Mock servers for testing

Demo Video

https://youtu.be/Zij1XabtJnk

Features Implemented

Load Balancing: Picks deployment below rate-limit with least tokens used
Reliability: Timeouts, retries, exponential backoff
Streaming: Full streaming support
Token Usage: Tracks prompt/completion/total tokens and cost
Logging: Sentry + Posthog callbacks

Tests

8 E2E tests passing covering all features
Mock servers for OpenAI, Gemini, Cohere

Usage Example

import { Router } from "@arakoodev/edgechains.js/ai";

const router = new Router({
  modelList: [
    { modelName: "gpt-3.5-turbo", provider: "openai", apiKey: "sk-xxx", rpm: 3000, tpm: 90000 },
    { modelName: "gpt-3.5-turbo", provider: "openai", apiKey: "sk-yyy", rpm: 3000, tpm: 90000 },
  ],
  routingStrategy: "least-tokens",
  numRetries: 3,
  timeout: 30000,
});

const response = await router.completion({
  model: "gpt-3.5-turbo",
  messages: [{ role: "user", content: "Hello!" }],
});

/claim #286

- Implement load balancing between multiple LLM deployments (OpenAI, Gemini, Cohere) - Support least-tokens, simple-shuffle, and latency-based routing strategies - Add timeout/retry logic with exponential backoff via axios interceptors - Implement streaming support for all providers - Add token usage tracking with cost calculation - Add Sentry and Posthog logging callbacks - Add JSONNet configuration example - Add mock servers and E2E tests

matiasmagni · 2026-03-18T22:23:39Z

I have read the Arakoo CLA Document and I hereby sign the CLA.

algora-pbc Bot added the 🙋 Bounty claim label Mar 18, 2026

algora-pbc Bot mentioned this pull request Mar 18, 2026

BOUNTY: Convert the endpoints to a smart router like litellm does in python #286

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): Add LiteLLM-like router infrastructure (#286)#439

feat(ai): Add LiteLLM-like router infrastructure (#286)#439
matiasmagni wants to merge 1 commit intoarakoodev:tsfrom
matiasmagni:feature/litellm-router-286-v2

matiasmagni commented Mar 18, 2026 •

edited

Loading

Uh oh!

matiasmagni commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

matiasmagni commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Demo Video

Features Implemented

Tests

Usage Example

Uh oh!

matiasmagni commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

matiasmagni commented Mar 18, 2026 •

edited

Loading