Great thank you. Side topic : anyone knows a way to have a centralized proxy to ...

ramesh31 · 2025-05-01T13:08:37 1746104917

Havoc · 2025-05-01T13:09:06 1746104946

Litellm is definitely your best bet. For recording - you can probably vibe code a proxy in front of it that mitms it and dumps the request into whatever format you need

rcarmo · 2025-05-01T16:12:17 1746115937

Litellm can log stuff pretty well on its own.

mnholt · 2025-05-01T13:05:39 1746104739

I’ve been looking for this for my team but haven’t found it. Providers like OpenAI and Anthropic offer admin token to manage team accounts and you look hook into Ollama or another self managed service for local AI.

Seems like a great way to roll out AI to a medium sized team where a very small team can coordinate access to the best available tools so the entire team doesn’t need to keep pace at the current break-neck speed.

calebkaiser · 2025-05-01T13:29:41 1746106181

I'm a maintainer of Opik, an open source LLM eval/observability framework. If you use something like LiteLLM or OpenRouter to handle the proxying of requests, Opik basically provides an out-of-the-box recording layer via its integrations with both:

https://github.com/comet-ml/opik

tidbeck · 2025-05-01T13:24:54 1746105894

Could you maybe make use of Simon Willsons [LLM lib/app](https://github.com/simonw/llm)? It has great LLM support (just pass in the model to use) and records everything by default.

simonw · 2025-05-01T13:58:56 1746107936

The one feature missing from LLM core for this right now is serving models over an HTTP OpenAI-compatible local server. There's a plugin you can try for that here though: https://github.com/irthomasthomas/llm-model-gateway