OctoDock is an API platform that lets any AI agent (Claude, ChatGPT, Cursor, or any MCP-compatible tool) operate all your apps through a single MCP URL. It provides unified authentication, smart parameter correction, error recovery, and cross-agent memory — so your AI remembers your preferences no matter which AI tool you use.

What is MCP (Model Context Protocol)?

MCP is an open protocol created by Anthropic that lets AI assistants connect to external tools and data sources. OctoDock is an MCP server — you add one URL to your AI tool, and it can instantly access all your connected apps like Notion, Gmail, Google Calendar, GitHub, and more.

How many apps does OctoDock support?

OctoDock supports 57+ apps including Notion, Gmail, Google Calendar, Google Drive, Google Docs, Google Sheets, GitHub, YouTube, Todoist, LinkedIn, Telegram, Discord, Slack, Canva, and many more. New apps are added regularly.

Yes, OctoDock offers a free tier with 1,000 MCP tool calls per month. This is enough for most personal use. A Pro plan is available for heavy users who need unlimited calls.

Does OctoDock work with ChatGPT?

Yes. OctoDock works with any MCP-compatible AI tool including Claude, ChatGPT, Cursor, Windsurf, and more. Your app connections, memory, and preferences carry over across all AI tools.

How do I set up OctoDock?

Sign up at octo-dock.com, connect your apps via OAuth (one-click for Google, Notion, GitHub, etc.), and add your personal MCP URL to your AI tool's settings. The whole process takes about 2 minutes. No coding required.

Is my data safe with OctoDock?

OctoDock never stores your app data. It acts as a secure bridge — when your AI requests an action, OctoDock authenticates with the app's API using your encrypted OAuth tokens, executes the action, and returns the result. All tokens are encrypted with AES-256-GCM at rest.

What is cross-agent memory?

Cross-agent memory means OctoDock remembers your preferences and usage patterns across different AI tools. If you tell Claude your default Notion database, ChatGPT will know it too. Your memory lives in OctoDock, not in any single AI — so switching AI tools doesn't mean starting over.

How is OctoDock different from connecting apps directly via MCP?

Connecting apps directly means managing separate MCP servers for each app — each with its own authentication, error handling, and configuration. OctoDock provides one URL for all apps, plus smart features like automatic parameter correction (AI sends wrong parameter names, OctoDock fixes them), unified error messages, operation history, and cross-app workflows.

Can I use OctoDock for my team or business?

Yes. OctoDock supports organizations — create a team, invite members, and share app connections and custom adapters. Each member gets their own MCP URL with team-level shared resources. Enterprise features include custom adapters, private API integrations, and usage analytics.

Claude Code + Computer Use: When the Terminal Isn't Enough

Name: OctoDock
Author: OctoDock

You just built a new onboarding flow. The tests pass. The types check. But does it actually look right when a real user taps through it on a phone screen?
That last-mile verification has always been manual. You open the simulator, tap through the screens, take screenshots, compare against the design. Claude can write the code, but it couldn't verify the visual result.
Now it can. Computer use lets Claude control your actual desktop from the Claude Code Desktop app — open native apps, click through UI, take screenshots, and verify changes on screen.

Key Takeaways

Claude can control your desktop: mouse, keyboard, screenshots
Available in the Claude Code Desktop app (not the CLI)
Off by default, asks before each action
Best for things nothing else can reach: apps without APIs, proprietary tools, GUI-only verification
Research preview — works best with simple, sequential UI tasks

What It Actually Does

Claude takes screenshots of your screen, identifies UI elements, and controls the mouse and keyboard to interact with them. It's not simulating a browser — it's controlling your actual desktop, the same way a remote support tool would.
This means it can work with anything that has a GUI: native Mac/Windows apps, iOS/Android simulators, web apps in any browser, even hardware control panels.

Getting Started

Step 1: Open Claude Code Desktop app (not the terminal CLI).
Step 2: Go to Settings and enable Computer Use. The OS will ask for accessibility permissions — Claude needs these to control mouse and keyboard.
Step 3: Ask Claude to do something visual:

Open the iOS simulator, tap through the onboarding flow, and screenshot each step
Claude takes a screenshot, identifies the first button, clicks it, takes another screenshot, and continues. Each action shows you what it's about to do and asks for approval.

Where This Changes Things

End-to-end visual verification. You tell Claude "build a settings page with dark mode toggle" and then "open the app and verify the toggle works." Same conversation, code to verification.
Testing in simulators. Claude can drive the iOS simulator or Android emulator. Tap through flows, fill forms, trigger edge cases. Not a replacement for automated tests, but perfect for the exploratory testing you'd do manually.
Proprietary tools without APIs. Some tools only have a GUI — design apps, admin panels, legacy systems. Claude can operate them the way you would.
Screenshots as bug reports. Ask Claude to reproduce a bug visually and screenshot each step. Instant reproduction steps with evidence.

A Few Things I Noticed

💡 Sequential tasks work best. "Open this app, click here, type this, click there" is reliable. "Find the best route through this complex UI" is less so. Give specific steps when you can.
💡 Screenshots are the bottleneck. Each screenshot takes a moment to process. If your task involves 30 clicks, it'll feel slower than doing it yourself. Use it for verification, not for speed.
💡 It reads text from screenshots accurately. Error messages, console output, status indicators — Claude reads them from the screenshot and can act on what it sees.

Honest Limitations

❌ Desktop app only. Computer use requires screen access, which the terminal CLI doesn't have. You need the Claude Code Desktop app.
❌ Can't handle rapid animations. If your UI has fast transitions or animations, Claude might screenshot at the wrong moment and misinterpret the state.
❌ No multi-monitor awareness yet. Claude works on your primary display. If the app you want to control is on a secondary monitor, move it first.
❌ Research preview. It occasionally misclicks or misidentifies UI elements, especially in dense interfaces. Always watch the first few actions to calibrate your trust.
⚠️ Security consideration. You're giving Claude control of your mouse and keyboard. It asks before each action, but think about what's visible on screen — passwords, sensitive data, personal messages. Minimize unrelated windows first.

Setting Up

Install the Claude Code Desktop app (Mac or Windows)
Open Settings → enable Computer Use
Grant accessibility permissions when the OS asks
Start a conversation and ask Claude to interact with your screen
Full details in the Computer use guide.