Orchestrating Sovereign AI Operations with Claude CLI and the Model Context Protocol (MCP)
Architect: Daniel Brody · Yom Shore Ventures LLC dba CTO Rescues · June 11, 2026
As organizations scale, executive cognitive load increases non-linearly. Fractional CTOs, project managers, and venture operators are frequently required to perform manual data routing tasks — extracting action items from email, auditing project boards, and cross-referencing calendar schedules. This perpetual manual synchronization introduces significant cognitive context-switching overhead, operational latency, and systemic tracking omissions. When multiple high-velocity initiatives are managed concurrently, the traditional human-in-the-loop PMO model often fails to scale effectively.
To address this bottleneck, the Autonomic PMO was engineered by Daniel Brody, Principal Architect at CTO Rescues. Constructed upon the open-source Model Context Protocol (MCP), this architecture functions as a state-aware, sovereign AI collaborator. Operating locally within a macOS environment, it executes complex, multi-system operational loops across remote server environments — integrating WhatsApp, OpenProject, AWS S3, and Google Workspace to convert unstructured daily communications into structured, verifiable execution paths. The resulting pipeline reconciles raw conversational data with database states, operating entirely within a secure, self-hosted boundary.
Standalone web-based chat interfaces and autocomplete plugins are fundamentally limited because they operate as contextual silos. Without continuous, secure access to live enterprise systems, an AI assistant remains isolated from the active state of the business — inducing hallucinations, manual copying errors, and a lack of proactive oversight.
Dynamic calendar modifications, meeting durations, blocked focus time, and scheduled project milestones.
Active coordination threads, client commitments, and team action items buried in unstructured chat.
Structured execution milestones, work package states, time logs, and sprint progress boards.
Remote cloud storage, audio recordings, meeting transcripts, and shared document libraries.
To resolve these limitations, a hybrid local-to-remote architecture was designed — positioning low-latency, privacy-centric interfaces (the Claude CLI and local stdio bridges) on a local macOS workstation, while offloading high-compute workloads to a remote, containerized AWS EC2 instance. This decoupling ensures that sensitive credentials and session keys remain within the physical boundary of the local workstation, while heavy media processing and database transactions are handled elastically on secure cloud nodes.
This structural separation respects the Principle of Least Privilege. By containing authentication states — local OAuth tokens and Go-socket session keys — to the physical macOS workstation, the external attack surface of the remote container infrastructure is minimized. No static credentials reside on the remote server, preventing the single-point-of-failure breaches common in fully cloud-integrated SaaS orchestrators.
The source code in dzbrody/claude-assistant-config is structured to decouple local prompt instructions from remote infrastructure configurations — allowing system administrators to modify system prompts locally without rebuilding or redeploying remote containers.
The division between local and remote boundaries enforces clean modularity. The Configuration Domain (.claude/) is maintained locally — system prompts are structured markdown files that run on top of standard CLI execution engines. The Server Domain (mcp-servers/) is an independent service container running behind a secure reverse-proxy layer on AWS, processing complex tasks away from the local workstation.
To maintain high throughput on standard cloud hardware such as a modest AWS t3.large instance, several key performance optimizations were implemented.
Standard audio transcription models consume significant memory and CPU cycles, often leading to Out-Of-Memory (OOM) failures on entry-level cloud instances. To resolve this, the remote container utilizes faster-whisper with 8-bit integer quantization (INT8).
CTranslate2 optimizes transformer models by mapping 32-bit floating-point weights (FP32) to 8-bit signed integers (INT8). The quantized value q for each weight x is computed as:
where x is the original weight, S is the scale factor, Z is the zero-point offset, and q is the resulting quantized value. This compression reduces memory bandwidth constraints by approximately 75% while maximizing CPU SIMD instruction efficiency via AVX2 and AVX-512 register operations — enabling parallel matrix multiplications without GPU compute layers. The result: transcription at up to 4× real-time speeds on a standard CPU, keeping monthly infrastructure costs at approximately $5.00/month while guaranteeing processing isolation for sensitive voice memos, stakeholder briefings, and meeting recordings.
This section outlines the deployment methodology required to replicate this architecture using the dzbrody/claude-assistant-config repository.
Clone the codebase onto the local macOS workstation.
Authorize the Claude CLI to communicate with Google Workspace, the local filesystem, and chat accounts.
Edit ~/.claude/settings.json to configure: google-workspace (calendar & email search), whatsapp-bridge (local authenticated Go socket), document-loader (DOCX, XLSX, PDF via markitdown-mcp).
Provision the secure EC2 instance and S3 buckets using the included Terraform configurations. The Terraform files establish an isolated VPC, private subnets, security groups, and an IAM instance profile — permitting the EC2 instance to read S3 objects on-demand without hardcoded access keys.
Connect the local macOS workstation to the remote EC2 instance without exposing port 39128 to the public internet. This script wraps SSH port-forwarding over an AWS Systems Manager session — traffic is routed through an encrypted WebSocket tunnel authenticated by IAM, eliminating the need for open SSH (port 22) or API (port 39128) ports and reducing the host's external attack surface to zero.
Once configured, the local Claude CLI terminal serves as a unified, command-driven operations center. Running /pmo-menu parses the modular instruction files in .claude/commands/ and displays the available workspace utilities:
/pmo-track-timeUpon executing /pmo-track-time, Claude queries Google Calendar for the current day's events, compares event durations against the active task list, and automatically logs those hours to corresponding work packages in OpenProject. The system parses calendar subjects, fuzzy-matches string sequences against active work package titles, calculates time deltas of calendar blocks, and logs matching hours directly — fully automating the time-tracking process.
/pmo-track-time/ingest-meeting-notesWhen an automated summary arrives from gemini-notes@google.com, it typically contains unstructured, passive descriptions of action items. Running /ingest-meeting-notes initiates a structured sequence:
Identified names are matched against the active OpenProject team directory to route tasks to the correct assignees.
Project context is parsed to categorize items under administration, engineering, or operations boards.
A semantic deduplication check runs against active packages before issuing the task creation payload — no redundant work items are created.
Sensitive IP, corporate agreements, and calendar events are processed exclusively within an AWS VPC and the local macOS filesystem — bypassing public training loops entirely.
Workflows like /ingest-meeting-notes and /pmo-dedup bypass web UI page loads entirely — completing in seconds via direct API transactions.
Local stdio bridges + lightweight quantized remote models keep monthly infrastructure fixed at ~$5.00/month, eliminating unpredictable metered commercial API costs.
Daniel Brody, MBA, is a fractional CTO, enterprise architect, and venture operator specializing in high-integrity software platforms and secure, sovereign AI integrations. As the founder of CTO Rescues, Daniel helps fast-growing companies and national entities align complex technical architectures with strict regulatory requirements.
If your organization is looking to replace manual, fragmented workflows with high-efficiency, sovereign AI systems, contact CTO Rescues to schedule a consultation.
Repository: github.com/dzbrody/claude-assistant-config
Release: v2.0 (Agentic Execution Update) · Date: June 11, 2026
Copyright © 2026 Yom Shore Ventures LLC dba CTO Rescues. All Rights Reserved. · 8 The Green – Suite 12060, Dover, Delaware 19901, United States
The Autonomic PMO is an open-source blueprint for executives and operators who need to eliminate manual data routing without sacrificing data privacy. Fork the repo or bring Daniel Brody in to architect a custom deployment.
Start the Conversation Fork on GitHub