Open source · Self-hosted · Free to run

The agent that
works for you.

AI Partner V1 is a self-hosted, open-source autonomous agent. Give it a goal in plain English — it researches, codes, generates documents, and delivers the finished result on your own machine. Need multi-tenancy, governance, and an AI proxy for your team? Meet V2 →

⚡ Deploy Free ▶ See It Work ⭐ Star on GitHub

curl -fsSL https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/setup.sh | bash

iwr -useb https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/install.ps1 | iex

ai-partner — goal-execution

Core Features

Built different. Works different.

🧠

Human-in-the-Loop

Agent pauses before running any script. Approve or reject via Telegram inline buttons. Also handles CAPTCHA hand-off, clarifications, and mid-execution input requests.

🌐

Live Browser Automation

Full Puppeteer browser with 15fps live screencast. SPA-aware rendering, stealth mode. CAPTCHA detected? Agent pauses, you solve in the live panel, it resumes.

🐳

Sandboxed Docker Execution

Every script runs inside a persistent container with Python, Node.js, and Bash. pandas, yfinance, matplotlib pre-loaded. Network isolation per-goal.

📄

Document Generation

PDF via Chromium, multi-sheet XLSX, PowerPoint, and DOCX — all production-ready, tracked in the dashboard, and instantly downloadable.

⚡

Skill Learning

After a successful goal, the system generalizes the solution into a reusable parameterized template. Versioned, deduplicated, and promotable to a callable tool.

🤝

Multi-Agent Delegation

Orchestrators spin up to 5 parallel sub-agents with budget controls. When a research agent exhausts its cap, it writes a handoff summary — the parent picks up seamlessly.

🕐

Scheduler + Proactive Agenda

Cron-expression tasks persist across restarts. The Proactive Agenda reads your task list and memory — uses LLM judgment to pick the best action per tick.

🧬

Vector Memory + RAG

Episodic log, vector search, persona traits, and periodic consolidation. Works with Ollama, OpenAI, Cohere, or pure-JS TF-IDF fallback — zero API keys required.

🔌

17 Built-in MCP Servers

Browser, Shell, GitHub, Gmail, Calendar, Drive, Notion, Trello, Twitter, Spotify, Apify, Image Gen, Knowledge Base, Scheduler, and more. Plus any external MCP server.

How It Works

Goal in. Result out.

A 5-stage pipeline that turns natural language into validated, delivered outcomes.

Goal Extraction

Your natural-language goal is converted into a structured definition with typed, measurable success criteria — file existence, content patterns, delivery receipts — before a single action is taken.

Task Decomposition

Complex goals are broken into ordered sub-tasks with declared outputs. The right specialist agent is auto-selected by keyword or explicitly @mentioned.

Parallel Execution

Agents reason, pick tools, execute, and assess in a tight loop. Up to 5 sub-agents run in parallel. When a script fails, a semantic repair engine fixes it and retries.

Criteria-Based Validation

After every iteration the system checks real outcomes. The agent cannot self-report "done" — it's done when filesystem, content, and messaging evidence all pass.

Delivery + Checkpoint

Results delivered to Telegram, Discord, Slack, or your workspace. State is checkpointed — a restart picks up exactly where it stopped, all artifacts intact.

Agent Profiles

16 specialists. Zero generic.

@web-researcher

Web Researcher

Research

@fact-checker

Fact Checker

Research

@trend-spotter

Trend Spotter

Research

@python-dev

Python Developer

Dev

@node-dev

Node.js Developer

Dev

@debugger

Debugger

Dev

@shell-op

Shell Operator

Dev

@fin-analyst

Financial Analyst

Data

@data-analyst

Data Analyst

Data

@excel-builder

Excel Builder

Data

@report-gen

Report Generator

Content

@summarizer

Summarizer

Content

@tech-writer

Tech Writer

Content

@prompt-arch

Prompt Architect

Content

@task-planner

Task Planner

Content

@telegram-rep

Telegram Reporter

Delivery

Full Capabilities

What it can actually do

🎯

Criteria-Based Completion

Goals close only when real-world evidence passes — no self-reporting allowed.

🔁

Dynamic Replanning

Fails a step? Replans, tries alternatives, semantically repairs scripts, retries.

💾

Checkpoint & Resume

State saved after each iteration. Restarts pick up exactly where they stopped.

📲

Telegram as Control Panel

Send goals, approve scripts, receive files — all from Telegram, no UI needed.

🤖

8 LLM Providers

Anthropic, Gemini, OpenAI, Ollama, LM Studio, Groq, DeepSeek, Perplexity.

🔗

Auto-Routing

Type @handle or just a keyword — orchestrator routes to the right specialist.

📊

Full Audit Log

Every tool call, file write, script run, and approval logged. Full run history.

🌐

Webhook + Event Triggers

Goals triggered by webhooks, Google Calendar events, or Gmail arrivals.

🖼️

AI Image Generation

DALL-E 3 or Stability AI from inside any agent workflow.

📚

Knowledge Base (RAG)

Upload PDFs, URLs — chunked, embedded, searched for grounded answers.

🔒

Security-First

URL safety checks, path guards, AES-256-GCM secrets, optional JWT auth.

⚙️

Zero-Config Local Mode

Points to local Ollama by default. Start free, add keys progressively.

Integrations

Connected to everything

One env var to activate any integration. Auto-discovered at startup.

📬

Tasks, approvals, files

💬

Discord

Text & file delivery

💼

Slack

Socket Mode

📱

QR auth

🐙

GitHub

Issues, PRs, code

📝

Notion

Read, create, query

📅

Google Calendar

Events + triggers

📧

Gmail

Send, search, trigger

💾

Google Drive

Search, create

🐦

Twitter / X

Search, post, reply

🎵

Spotify

Play, pause, queue

🕷️

Apify

Proxy scraping

🎨

Image Gen

DALL-E 3 / Stability

Demo Flows

Real tasks. Real outcomes.

Daily Tech Briefing → Telegram

"Research today's top AI stories from HN + GitHub. Write a report and send to Telegram."

Full pipeline: research → PDF generation → validated Telegram delivery. Goal closes only after the file actually arrives.

Trend SpotterWeb SearchReport GenPDFTelegram

Live Stock Portfolio Snapshot

"Fetch closing prices for RELIANCE.NS, TCS.NS, INFY.NS. Build Excel with price, change%, 5-day chart."

Python + yfinance runs in Docker sandbox. Styled multi-sheet Excel with embedded chart. File criterion confirms output.

Financial AnalystyfinanceExcel BuilderXLSX

CAPTCHA-Protected Research

"Go to LinkedIn and find the top 5 posts about AI agents this week."

Browser navigates → CAPTCHA detected → Inspector opens → you solve in live preview → agent resumes and extracts content.

BrowserCAPTCHA HITLLive Screencast

Automated Weekly GitHub Report

Every Monday 9am → "Summarize commits, note open PRs, post summary to Slack."

Scheduler fires → GitHub data fetched → report written → Slack delivery. Set once, runs forever.

SchedulerGitHubReport GenSlack

Skill Learning: Instant Second Run

"Fetch NIFTY50 data and plot the last 30 days."

First run: Python from scratch. After success, system extracts a reusable template. Second run: template fills in → instant execution.

Skill LearningTemplatesTool Promotion

The Next Generation

🔒 Private — not open-source

📅 Beta opens Q2 2026

An AI workforce
for your whole team.

V2 is the multi-tenant, governed edition. Deploy once for your company — every member gets isolated agents, admins get full audit and control, and an opt-in AI proxy can act on your behalf, always behind your approval. Invite-only.

🔔 Get Early Access Explore V2 Features

ai-partner — human-proxy mode

Live Demo

Watch V2 Join a Real Meeting

No scripts. No staging. AI Partner joins a live call, listens in real time, and responds.

Who Is This For?

Built for busy professionals

🎯 Entrepreneurs & Founders

Double-booked by default? AI Partner attends meetings you can't make, follows up on your behalf, monitors your market while you sleep. Your time is your revenue — stop wasting it on tasks that don't need you.

💼 Corporate CXO & Senior Leaders

Spend 6-8 hours daily in meetings most don't need your judgment — just your presence. Send the proxy. It attends, flags blockers against your OKRs, captures decisions, and reports back in minutes.

What V2 Adds

Everything V1 does. Plus this.

New

🎭

Inbox & Chat Proxy — With Your Approval

AI Partner drafts replies to your emails, Slack DMs, and Telegram messages in your style. Nothing sends on its own — every action on your behalf is gated by AuthorityPolicy and a one-tap approval, and AI involvement is always disclosed.

New

📹

Meeting Assistant — In the Room For You

Joins your Meet/Teams/Zoom calls with AI presence disclosed, listens, takes notes, and contributes only what you've authorized — then generates summaries and extracts action items automatically.

New

💡

Proactive Heartbeat Engine

V2 doesn't wait to be asked. A background loop evaluates what to do every 15-30 minutes using your goals, memory, and real-time context.

Enhanced

🔧

100+ Tools, 36 MCP Servers

From 17 servers in V1 to 36 in V2. Plus 4-tier computer use (DOM → Vision → Container Desktop → Host Control) with live CAPTCHA handoff.

New

📱

Phone Assistant — Voicemail + Outbound

Screens calls for you, then records, transcribes, and summarizes. Places approved outbound calls from your Twilio number — every call authorized by you, with AI disclosure on connect.

Enhanced

🧠

5-Layer Memory System

Episodic + Biographic + Counterparty + Vector + RAG. Understands who's who, what you've discussed, and maintains relationship context across all channels.

Enhanced

🤖

18+ LLM Providers

From 8 providers in V1. Now including NVIDIA NIM, Cerebras (~100k tokens/s), OpenRouter (100+ models), MiniMax, and LiteLLM Proxy.

New

📋

Authority Policy + HITL

Declarative rules define what the agent does automatically vs. what needs your approval. Telegram inline buttons for every gray-zone decision.

Built for Teams

AI you can actually roll out.

V2 is multi-tenant and governed from the core — the reason it's safe to deploy across a whole company.

👥

Multi-Tenant by Design

One install hosts your whole team. Every user's chats, files, memory, and screenshots are fully isolated — no cross-user leak, enforced at the data, event, and identity layers.

🛡️

Admin Console

A control room with a live activity feed, users & invites, an org audit log, per-user usage & cost, skill governance, knowledge sources, and agent access.

📋

Audit & Authority Policy

Every tool call, file write, and approval is logged. Declarative rules decide what runs automatically vs. what needs a human — per action, per relationship.

🔑

Invite-Only Access

No default logins. First registrant becomes admin; everyone else joins by single-use invite code. JWT auth, encrypted credential vault, AES-256 secrets.

🔌

Agent API (A2A)

Expose your agents to other systems over authenticated agent-to-agent calls. Service-account consumers, metered usage, CSV billing, hostile-agent detection.

📚

Org Knowledge Sources

Sync Notion, Slack, and Drive into a shared knowledge base on a schedule. Every member's agent answers grounded in your company's own documents.

Done-For-You Vertical

A real example: a bookkeeper that runs itself.

The same platform, pointed at one job. Email in → reconciled books → P&L emailed back. This is what "AI that works for your clients" looks like in production.

Client emails statements

Each client gets their own isolated tenant and a dedicated address. They email bank and credit-card CSVs — attachments are pulled in automatically.

The agent reconciles

The bookkeeper profile categorizes transactions, reconciles against prior balances, and flags anomalies — in a sandbox, building a live-formula workbook.

You approve

Before anything reaches the client, you get a one-tap human approval. Review the numbers, then release — nothing goes out on its own.

Client gets their P&L

A finished Excel workbook with live formulas and a zero-difference reconciliation lands back in the client's inbox. Month-end close, on autopilot.

Evolution

V1 vs V2 — What's Changing?

V1 is live and production-ready. V2 is the next chapter.

Capability	V1 — Available Now	V2 — Coming Soon
LLM Providers	8 providers	18+ providers including NVIDIA NIM, Cerebras
Tools / MCP Servers	100+ tools, 17 servers	100+ tools, 36 servers
Agent Profiles	16 agents	✓ Same + custom profiles
Inbox & Chat Proxy	—	Drafts on your behalf, you approve sends
Meeting Assistant	—	✓ Built & running — private beta Q2 2026
Phone Assistant	—	Call screening + approved outbound
Proactive Engine	—	Heartbeat + LLM agenda
Memory System	Vector + RAG	5-layer (episodic + biographic + counterparty)
Computer Use	Basic browser automation	4-tier (DOM → Vision → T3 Container → Host)
Skill Learning	✓ Automatic	✓ Enhanced + version control
Authority Policy	—	Per-action gating with Telegram approval
Self-Correction	✓ 3-retry semantic	✓ Enhanced with stuck detection

Early Access

Be First for V2

📅 Private Beta — Q2 2026 🔒 Not open-source — invitation only

Secure your spot now. Early access members get first invites, exclusive onboarding, and direct founder contact.

🔥 Spots are limited — once beta fills, waitlist closes.

The agent thatworks for you.

Built different. Works different.

Human-in-the-Loop

Live Browser Automation

Sandboxed Docker Execution

Document Generation

Skill Learning

Multi-Agent Delegation

Scheduler + Proactive Agenda

Vector Memory + RAG

17 Built-in MCP Servers

Goal in. Result out.

Goal Extraction

Task Decomposition

Parallel Execution

Criteria-Based Validation

Delivery + Checkpoint

16 specialists. Zero generic.

What it can actually do

Criteria-Based Completion

Dynamic Replanning

Checkpoint & Resume

Telegram as Control Panel

8 LLM Providers

Auto-Routing

Full Audit Log

Webhook + Event Triggers

AI Image Generation

Knowledge Base (RAG)

Security-First

Zero-Config Local Mode

Connected to everything

Real tasks. Real outcomes.

Daily Tech Briefing → Telegram

Live Stock Portfolio Snapshot

CAPTCHA-Protected Research

Automated Weekly GitHub Report

Skill Learning: Instant Second Run

An AI workforcefor your whole team.

Watch V2 Join a Real Meeting

Built for busy professionals

🎯 Entrepreneurs & Founders

💼 Corporate CXO & Senior Leaders

Everything V1 does. Plus this.

Inbox & Chat Proxy — With Your Approval

Meeting Assistant — In the Room For You

Proactive Heartbeat Engine

100+ Tools, 36 MCP Servers

Phone Assistant — Voicemail + Outbound

5-Layer Memory System

18+ LLM Providers

Authority Policy + HITL

AI you can actually roll out.

Multi-Tenant by Design

Admin Console

Audit & Authority Policy

Invite-Only Access

Agent API (A2A)

Org Knowledge Sources

A real example: a bookkeeper that runs itself.

Client emails statements

The agent reconciles

You approve

Client gets their P&L

V1 vs V2 — What's Changing?

Be First for V2

One command. Done.

See it run on your workflow

Your AI partner is ready.Are you?

The agent that
works for you.

An AI workforce
for your whole team.

Your AI partner is ready.
Are you?