AI Partner V1 is a self-hosted, open-source autonomous agent. Give it a goal in plain English — it researches, codes, generates documents, and delivers the finished result on your own machine. Need multi-tenancy, governance, and an AI proxy for your team? Meet V2 →
Agent pauses before running any script. Approve or reject via Telegram inline buttons. Also handles CAPTCHA hand-off, clarifications, and mid-execution input requests.
Full Puppeteer browser with 15fps live screencast. SPA-aware rendering, stealth mode. CAPTCHA detected? Agent pauses, you solve in the live panel, it resumes.
Every script runs inside a persistent container with Python, Node.js, and Bash. pandas, yfinance, matplotlib pre-loaded. Network isolation per-goal.
PDF via Chromium, multi-sheet XLSX, PowerPoint, and DOCX — all production-ready, tracked in the dashboard, and instantly downloadable.
After a successful goal, the system generalizes the solution into a reusable parameterized template. Versioned, deduplicated, and promotable to a callable tool.
Orchestrators spin up to 5 parallel sub-agents with budget controls. When a research agent exhausts its cap, it writes a handoff summary — the parent picks up seamlessly.
Cron-expression tasks persist across restarts. The Proactive Agenda reads your task list and memory — uses LLM judgment to pick the best action per tick.
Episodic log, vector search, persona traits, and periodic consolidation. Works with Ollama, OpenAI, Cohere, or pure-JS TF-IDF fallback — zero API keys required.
Browser, Shell, GitHub, Gmail, Calendar, Drive, Notion, Trello, Twitter, Spotify, Apify, Image Gen, Knowledge Base, Scheduler, and more. Plus any external MCP server.
A 5-stage pipeline that turns natural language into validated, delivered outcomes.
Your natural-language goal is converted into a structured definition with typed, measurable success criteria — file existence, content patterns, delivery receipts — before a single action is taken.
Complex goals are broken into ordered sub-tasks with declared outputs. The right specialist agent is auto-selected by keyword or explicitly @mentioned.
Agents reason, pick tools, execute, and assess in a tight loop. Up to 5 sub-agents run in parallel. When a script fails, a semantic repair engine fixes it and retries.
After every iteration the system checks real outcomes. The agent cannot self-report "done" — it's done when filesystem, content, and messaging evidence all pass.
Results delivered to Telegram, Discord, Slack, or your workspace. State is checkpointed — a restart picks up exactly where it stopped, all artifacts intact.
Goals close only when real-world evidence passes — no self-reporting allowed.
Fails a step? Replans, tries alternatives, semantically repairs scripts, retries.
State saved after each iteration. Restarts pick up exactly where they stopped.
Send goals, approve scripts, receive files — all from Telegram, no UI needed.
Anthropic, Gemini, OpenAI, Ollama, LM Studio, Groq, DeepSeek, Perplexity.
Type @handle or just a keyword — orchestrator routes to the right specialist.
Every tool call, file write, script run, and approval logged. Full run history.
Goals triggered by webhooks, Google Calendar events, or Gmail arrivals.
DALL-E 3 or Stability AI from inside any agent workflow.
Upload PDFs, URLs — chunked, embedded, searched for grounded answers.
URL safety checks, path guards, AES-256-GCM secrets, optional JWT auth.
Points to local Ollama by default. Start free, add keys progressively.
One env var to activate any integration. Auto-discovered at startup.
Python + yfinance runs in Docker sandbox. Styled multi-sheet Excel with embedded chart. File criterion confirms output.
Scheduler fires → GitHub data fetched → report written → Slack delivery. Set once, runs forever.
First run: Python from scratch. After success, system extracts a reusable template. Second run: template fills in → instant execution.
V2 is the multi-tenant, governed edition. Deploy once for your company — every member gets isolated agents, admins get full audit and control, and an opt-in AI proxy can act on your behalf, always behind your approval. Invite-only.
No scripts. No staging. AI Partner joins a live call, listens in real time, and responds.
Double-booked by default? AI Partner attends meetings you can't make, follows up on your behalf, monitors your market while you sleep. Your time is your revenue — stop wasting it on tasks that don't need you.
Spend 6-8 hours daily in meetings most don't need your judgment — just your presence. Send the proxy. It attends, flags blockers against your OKRs, captures decisions, and reports back in minutes.
AI Partner drafts replies to your emails, Slack DMs, and Telegram messages in your style. Nothing sends on its own — every action on your behalf is gated by AuthorityPolicy and a one-tap approval, and AI involvement is always disclosed.
Joins your Meet/Teams/Zoom calls with AI presence disclosed, listens, takes notes, and contributes only what you've authorized — then generates summaries and extracts action items automatically.
V2 doesn't wait to be asked. A background loop evaluates what to do every 15-30 minutes using your goals, memory, and real-time context.
From 17 servers in V1 to 36 in V2. Plus 4-tier computer use (DOM → Vision → Container Desktop → Host Control) with live CAPTCHA handoff.
Screens calls for you, then records, transcribes, and summarizes. Places approved outbound calls from your Twilio number — every call authorized by you, with AI disclosure on connect.
Episodic + Biographic + Counterparty + Vector + RAG. Understands who's who, what you've discussed, and maintains relationship context across all channels.
From 8 providers in V1. Now including NVIDIA NIM, Cerebras (~100k tokens/s), OpenRouter (100+ models), MiniMax, and LiteLLM Proxy.
Declarative rules define what the agent does automatically vs. what needs your approval. Telegram inline buttons for every gray-zone decision.
V2 is multi-tenant and governed from the core — the reason it's safe to deploy across a whole company.
One install hosts your whole team. Every user's chats, files, memory, and screenshots are fully isolated — no cross-user leak, enforced at the data, event, and identity layers.
A control room with a live activity feed, users & invites, an org audit log, per-user usage & cost, skill governance, knowledge sources, and agent access.
Every tool call, file write, and approval is logged. Declarative rules decide what runs automatically vs. what needs a human — per action, per relationship.
No default logins. First registrant becomes admin; everyone else joins by single-use invite code. JWT auth, encrypted credential vault, AES-256 secrets.
Expose your agents to other systems over authenticated agent-to-agent calls. Service-account consumers, metered usage, CSV billing, hostile-agent detection.
Sync Notion, Slack, and Drive into a shared knowledge base on a schedule. Every member's agent answers grounded in your company's own documents.
The same platform, pointed at one job. Email in → reconciled books → P&L emailed back. This is what "AI that works for your clients" looks like in production.
Each client gets their own isolated tenant and a dedicated address. They email bank and credit-card CSVs — attachments are pulled in automatically.
The bookkeeper profile categorizes transactions, reconciles against prior balances, and flags anomalies — in a sandbox, building a live-formula workbook.
Before anything reaches the client, you get a one-tap human approval. Review the numbers, then release — nothing goes out on its own.
A finished Excel workbook with live formulas and a zero-difference reconciliation lands back in the client's inbox. Month-end close, on autopilot.
V1 is live and production-ready. V2 is the next chapter.
| Capability | V1 — Available Now | V2 — Coming Soon |
|---|---|---|
| LLM Providers | 8 providers | 18+ providers including NVIDIA NIM, Cerebras |
| Tools / MCP Servers | 100+ tools, 17 servers | 100+ tools, 36 servers |
| Agent Profiles | 16 agents | ✓ Same + custom profiles |
| Inbox & Chat Proxy | — | Drafts on your behalf, you approve sends |
| Meeting Assistant | — | ✓ Built & running — private beta Q2 2026 |
| Phone Assistant | — | Call screening + approved outbound |
| Proactive Engine | — | Heartbeat + LLM agenda |
| Memory System | Vector + RAG | 5-layer (episodic + biographic + counterparty) |
| Computer Use | Basic browser automation | 4-tier (DOM → Vision → T3 Container → Host) |
| Skill Learning | ✓ Automatic | ✓ Enhanced + version control |
| Authority Policy | — | Per-action gating with Telegram approval |
| Self-Correction | ✓ 3-retry semantic | ✓ Enhanced with stuck detection |
Secure your spot now. Early access members get first invites, exclusive onboarding, and direct founder contact.
🔥 Spots are limited — once beta fills, waitlist closes.
Single Docker Compose. One API key minimum — or use local Ollama for free.
Book a walkthrough for your team, or get a service pilot scoped to one of your clients. Drop your email and we'll reach out.
🔒 No spam — we'll only reach out about your demo or pilot.
Self-hosted. Local-first. No subscriptions. No cloud lock-in.