Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Wrapperone
Explore how Wrapperone handles parallel, rate‑limited LLM requests, immutable versioned conversation state, automatic tool integration, and flexible multi‑step, schema-based response formats.
A technical library designed for power users who need:
Parallel, rate-limited request processing to LLM providers (OpenAI, Anthropic, VLLM, LiteLLM, and OpenRouter).
A system to store conversation states (chat threads) with built-in immutability and versioning.
Automatic tool integration (both function-based and schema-based).
Flexible response formats: text, JSON, structured schemas, or multi-step workflows.
The core design revolves around Entities. Each conversation, message, or piece of configuration is an Entity that undergoes forking and versioning whenever modified. This ensures you have a complete lineage of conversation states without accidental in-place mutations.
This Python library orchestrates concurrent, immutable LLM inference with integrated tools.