{"id":2235,"date":"2026-05-09T20:24:29","date_gmt":"2026-05-09T20:24:29","guid":{"rendered":"https:\/\/www.authorityrank.app\/magazine\/how-ai-agents-are-replacing-entire-business-functions-andrew-wilkinsons-operational-playbook\/"},"modified":"2026-05-17T15:51:13","modified_gmt":"2026-05-17T15:51:13","slug":"how-ai-agents-are-replacing-entire-business-functions-andrew-wilkinsons-operational-playbook","status":"publish","type":"post","link":"https:\/\/www.authorityrank.app\/magazine\/how-ai-agents-are-replacing-entire-business-functions-andrew-wilkinsons-operational-playbook\/","title":{"rendered":"How AI Agents Are Replacing Entire Business Functions: Andrew Wilkinson&#8217;s Operational Playbook"},"content":{"rendered":"<h1>\nHow AI Agents Are Replacing Entire Business Functions: Andrew Wilkinson&#8217;s Operational Playbook<br \/>\n<\/h1>\n<p> <\/p>\n<blockquote><p>\n<strong>The Pulse:<\/strong><\/p>\n<ul>\n<li>Andrew Wilkinson&#8217;s family office runs on a <strong>$40,000\/month<\/strong> Claude API bill instead of traditional payroll &#8211; his CFO built a custom Addepar replacement in <strong>two weeks<\/strong> with zero prior coding experience, displacing software that costs <strong>$50,000-$100,000\/year<\/strong>.<\/li>\n<li>Deep Personality, a SaaS product built and operated entirely by autonomous agents via Harbor, has generated <strong>approximately $20,000 in revenue<\/strong> &#8211; with a marketing agent managing Meta and Reddit ad accounts through multivariate testing and PostHog integration, requiring no human intervention for routine operations.<\/li>\n<li>A health agent cross-referenced <strong>5 years of Apple Watch data<\/strong> and identified that wrist temperature shifts predict viral nerve pain flares <strong>3 days in advance<\/strong> &#8211; a correlation Wilkinson&#8217;s own doctors had never surfaced.<\/li>\n<\/ul>\n<\/blockquote>\n<p> <\/p>\n<p>The central friction in Wilkinson&#8217;s operational model is not technological &#8211; it is architectural. The question is not whether AI agents can replace business functions, but how to structure memory, orchestration, and context so agents operate with enough persistent intelligence to be genuinely autonomous rather than just automated. Wilkinson&#8217;s <strong>132-investment Folly Partners vector database<\/strong> &#8211; <strong>$16 million deployed, now worth $36 million<\/strong> &#8211; and his Harbor-based agent stack represent a working answer to that question, one that most operators are still treating as a future problem.<\/p>\n<p> <\/p>\n<div>\n <\/p>\n<div>\n <\/p>\n<div>\n <\/p>\n<div>\n$40K\/Month, No Payroll\n<\/div>\n<p> <\/p>\n<div>\nWilkinson&#8217;s family office replaced traditional headcount with a $40,000\/month Claude API bill, running portfolio analytics, email triage, and financial reporting autonomously.\n<\/div>\n<p> <\/div>\n<p> <\/p>\n<div>\n <\/p>\n<div>\nCFO Builds in Two Weeks\n<\/div>\n<p> <\/p>\n<div>\nA non-technical CFO used Claude Code to replace Addepar &#8211; a $50K-$100K\/year portfolio tracking platform &#8211; with a fully customized internal tool in under two weeks.\n<\/div>\n<p> <\/div>\n<p> <\/p>\n<div>\n <\/p>\n<div>\nHarbor Agent Orchestration\n<\/div>\n<p> <\/p>\n<div>\nHarbor (github\/geekforbrains\/harbor) provides a GUI use for Claude-based agents, enabling dev, marketing, and support roles to run in parallel with defined escalation logic.\n<\/div>\n<p> <\/div>\n<p> <\/p>\n<div>\n <\/p>\n<div>\nVector Memory at Scale\n<\/div>\n<p> <\/p>\n<div>\nPinecone-backed vector databases store full historical data across 24 Tiny businesses and 132 Folly Partners investments, making company-wide pattern queries queryable in seconds.\n<\/div>\n<p> <\/div>\n<p> <\/p>\n<div>\n <\/p>\n<div>\n3-6 Month Autonomy Window\n<\/div>\n<p> <\/p>\n<div>\nWilkinson estimates basic businesses can be fully handed off to AI agents within 3-6 months &#8211; but full company autonomy requires context windows beyond the current 1-million-token ceiling.\n<\/div>\n<p> <\/div>\n<p> <\/div>\n<\/p><\/div>\n<p> <\/p>\n<p>The gap between &#8220;AI-assisted&#8221; and &#8220;AI-operated&#8221; is closing faster than most operators realize &#8211; but the path runs through infrastructure decisions made right now. Wilkinson&#8217;s stack is not a vision document; it is a live production system handling support escalation, ad budget allocation, portfolio stress testing, and personal health analytics simultaneously. The architecture he has assembled across Harbor, GBrain, Pinecone, Fireflies, and Claude Code represents the clearest public blueprint available for what AI content generation and autonomous business operation actually look like at the practitioner level in 2026.<\/p>\n<p> <\/p>\n<p>What follows is a precise breakdown of that architecture &#8211; the agent roles, memory pipelines, prompting mechanics, and investment logic behind one of the most operationally advanced AI deployments currently running outside a major technology company. For mid-level operators and founders thinking seriously about authority building, thought leadership content, and AI-powered SEO, the mechanisms here are directly transferable.<\/p>\n<p> <\/p>\n<h2>\nFrom Payroll to API Bills: How Andrew Wilkinson Replaced Staff with a $40K\/Month Claude Stack<br \/>\n<\/h2>\n<p> <\/p>\n<p><strong>The operational shift from hiring humans to scaling Claude API spend represents a fundamental restructuring of how family offices and early-stage SaaS companies manage administrative burden.<\/strong> Andrew Wilkinson, founder of Tiny (a conglomerate holding approximately 24 businesses), made this transition explicit: instead of expanding headcount, his family office now runs on a <strong>$40,000 per month Claude API bill<\/strong>. This isn&#8217;t theoretical cost optimization-it&#8217;s a live operational model managing real portfolio data, investment tracking, and business oversight across dozens of entities. The mechanism works because Claude-powered agents handle the repetitive, deterministic tasks that traditionally demanded CFOs, accountants, and administrative staff: reconciliation, reporting, data synthesis, and decision-support analysis.<\/p>\n<p> <\/p>\n<p>The catalyst for this shift came when Wilkinson&#8217;s CFO-a non-technical executive-decided to challenge the assumption that specialized software was worth its price tag. Addepar, the market-standard portfolio management platform for high-net-worth individuals, charges <strong>$50,000 to $100,000 per year<\/strong> for features like asset tracking, balance-sheet aggregation, and performance reporting. Rather than renew that contract, Wilkinson&#8217;s CFO used Claude Code to build a custom replacement in <strong>two weeks with zero prior coding experience<\/strong>. The resulting application ingests real-time data from bank APIs, accounting systems, and public market feeds-then surfaces portfolio analytics, rebalancing recommendations, and risk analysis through an AI assistant that queries the entire dataset. This isn&#8217;t a simplified mock-up; it&#8217;s a production system replacing enterprise software, built at a fraction of the cost and with dramatically faster iteration cycles.<\/p>\n<p> <\/p>\n<p>The economic logic is straightforward: traditional payroll scales linearly with headcount and locks in fixed costs. Claude API spend scales with usage and can be throttled or expanded on demand. For a family office managing <strong>132 direct investments worth $36 million (deployed at $16 million)<\/strong> across Tiny&#8217;s <strong>24 portfolio companies<\/strong>, the information density problem is acute-no single human can track portfolio drift, CEO performance, expense anomalies, and strategic opportunities across that scale. An AI agent with access to a vector database of historical data, financial statements, and meeting transcripts can surface these insights in seconds. Wilkinson describes this as &#8220;being the eye of Sauron inside your company&#8221;-the ability to ask questions like &#8220;Are any of our businesses spending too much money?&#8221; or &#8220;Which CEOs are underperforming?&#8221; and receive data-driven answers without manual spreadsheet work. The tradeoff is real: agents hallucinate occasionally, misattribute facts, and require careful prompt engineering. But the speed and breadth of analysis far exceed what traditional staff could deliver.<\/p>\n<p> <\/p>\n<table>\n<thead>\n<tr>\n<th>The Conventional Approach<\/th>\n<th>The Yacov Avrahamov Perspective (Wilkinson&#8217;s Model)<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Hire CFO, controller, and accounting staff to manage portfolio and business data<\/td>\n<td>Scale Claude API spend; let agents query vector databases of historical data, financial statements, and meeting transcripts<\/td>\n<\/tr>\n<tr>\n<td>Pay Addepar $50K-$100K\/year for portfolio tracking software<\/td>\n<td>Build custom replacement in 2 weeks using Claude Code; no coding experience required from CFO<\/td>\n<\/tr>\n<tr>\n<td>Manually review quarterly reports and business metrics to identify issues<\/td>\n<td>Ask agents: &#8220;Review last quarter-give me any icebergs.&#8221; Agents surface anomalies in minutes<\/td>\n<\/tr>\n<tr>\n<td>Fixed headcount costs; scaling requires hiring and onboarding<\/td>\n<td>Variable API costs; scaling requires only prompt refinement and vector database expansion<\/td>\n<\/tr>\n<tr>\n<td>Single analyst&#8217;s perspective on complex portfolio; information bottleneck<\/td>\n<td>AI agent with access to 24 businesses&#8217; full historical data; pattern detection across entire conglomerate<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p> <\/p>\n<p>What makes this transition viable is the specificity of the tasks being automated. Portfolio management, expense tracking, and data synthesis are deterministic problems-the agent receives structured input (bank balances, transaction histories, investment valuations) and produces structured output (net worth summaries, asset allocation reports, risk flags). These are exactly the workflows where Claude&#8217;s reasoning and data retrieval capabilities create compounding advantage. Wilkinson&#8217;s CFO wasn&#8217;t replaced; instead, his role evolved from manual data compilation to prompt engineering and strategic interpretation. The CFO now spends time asking better questions of the system rather than gathering data to answer existing questions. This is the pattern emerging across forward-thinking family offices and early-stage SaaS companies: the administrative overhead that once required headcount now requires API spend and thoughtful system design.<\/p>\n<p> <\/p>\n<p><strong>The Real Takeaway:<\/strong> A $40K\/month Claude bill replacing traditional payroll doesn&#8217;t eliminate the need for financial expertise-it redirects it toward higher-use work (strategy, oversight, decision-making) while agents handle the repetitive synthesis and reporting that once consumed 60-70% of CFO time.<\/p>\n<p> <\/p>\n<h2>\nThe Harbor + OpenClaw Agent Architecture: Running a SaaS Startup Autonomously<br \/>\n<\/h2>\n<p> <\/p>\n<p><strong>The core question:<\/strong> How do you orchestrate multiple AI agents to handle support, marketing, and product development without a single traditional employee? My answer has been Harbor-a visual use for Claude-based agents that Gavin Vicky built-paired with OpenClaw&#8217;s function-calling logic. The result: Deep Personality, a personality-test SaaS I built in a few manic days, now generating approximately <strong>$20,000 in revenue<\/strong> entirely autonomously, with agents managing everything from P0 security fixes to multivariate ad testing across Meta and Reddit.<\/p>\n<p> <\/p>\n<p>The traditional SaaS org chart assumes you hire a support team, a marketing team, and a dev team. Each one sits in Slack, attends standup, and burns cash. What I discovered is that you can replace those three functions with three Claude agents running inside Harbor, a GitHub-based GUI use that gives you visual oversight of agent behavior-something OpenClaw&#8217;s terminal-only interface lacked. The architecture is straightforward: a support agent monitors incoming emails, a dev agent handles code issues and security patches, and a marketing agent manages ad spend and creative testing. Each agent has access to the same knowledge base and can escalate or delegate to the others. The magic is that they actually work, and they work without you babysitting them.<\/p>\n<p> <\/p>\n<p>Here&#8217;s the operational reality: when someone emails support about a bug, the support agent reads the ticket. If it&#8217;s a simple question, the agent answers directly and closes the loop. If it requires code changes, the agent drafts a fix, the dev agent reviews and implements it, and if it&#8217;s a <strong>P0 security issue<\/strong>, the system bypasses human approval entirely and merges the PR automatically. This sounds reckless until you realize that Claude&#8217;s code review is often more careful than a tired engineer at 11 p.m. The support agent then emails the customer with a solution. No back-and-forth. No ticket backlog. This alone eliminates the entire support function-something I would have said was impossible two years ago.<\/p>\n<p> <\/p>\n<p>The marketing agent is where the real scaling happens. It&#8217;s hooked into PostHog, which pipes in all the product analytics-conversion rates, user flows, churn signals, everything. The agent runs <strong>multivariate testing<\/strong> on ad creative, manages budgets across Meta and Reddit, and reports back daily on what&#8217;s working. You can message it directly: &#8220;Increase the budget by $1,000&#8221; or &#8220;Approve the SEO project and allocate $5,000.&#8221; The agent handles the API calls, the bid adjustments, the audience segmentation. It learns what resonates. At <strong>$20,000 in revenue<\/strong>, we&#8217;re still in the testing phase, but the trajectory is clear: as we hand it larger budgets-say <strong>$100,000 per month<\/strong>-the agent will find and scale the winning creative faster than any human marketer could, because it&#8217;s running experiments 24\/7 without fatigue or ego.<\/p>\n<p> <\/p>\n<p>The reason I moved from OpenClaw to Harbor was determinism and visibility. OpenClaw is text-based, which means you&#8217;re reading through terminal output trying to figure out what your agents actually did. Harbor gives you an org-chart view: you can see each agent, what documents and databases it has access to, what environmental variables it&#8217;s using, and what actions it&#8217;s taken. It&#8217;s like looking at a company org chart and being able to drill into each person&#8217;s desk and see their work in progress. Gavin built it because he ran into the same problem-managing agents in a terminal feels like herding cats in the dark. Harbor solved that. You can find it at github\/geekforbrains\/harbor, and it&#8217;s still running on Claude under the hood; the GUI is just a use that makes orchestration human-readable.<\/p>\n<p> <\/p>\n<p>The deeper insight is that this architecture is replicable for any SaaS with a defined workflow. If your business has a support function, a marketing function, and a product function, you can decompose each into an agent. The agent doesn&#8217;t need to be brilliant-it needs to be consistent, tireless, and able to follow a rubric. A support agent that answers 80% of tickets correctly and escalates the hard 20% is already better than most human support teams. A marketing agent that runs experiments and reports results is better than a marketer who checks analytics once a week. The constraint isn&#8217;t capability; it&#8217;s clarity. You have to write down exactly what each agent should do, what it has access to, and what decisions it can make autonomously versus which ones require human approval.<\/p>\n<p> <\/p>\n<p>I estimate we&#8217;re <strong>3 to 6 months away<\/strong> from being able to hand off basic businesses entirely to AI. That doesn&#8217;t mean fully autonomous companies in the sci-fi sense-no AI CEO running a conglomerate without any human oversight. It means a business like Deep Personality, with clear workflows and bounded decisions, can run unattended. The limiting factor is context window. With Claude&#8217;s current 200K context window, an agent can remember maybe a day&#8217;s worth of business activity. Once context windows scale to 5 million or 10 million tokens-which is coming-an agent could hold an entire company&#8217;s state in memory: all customer interactions, all financial data, all strategic decisions. That&#8217;s when you get true autonomy. Until then, you&#8217;re still feeding agents information in batches and asking them to make decisions within that frame.<\/p>\n<p> <\/p>\n<p><strong>Why This Matters Now:<\/strong> The Harbor + OpenClaw pattern proves that the bottleneck in scaling SaaS is no longer engineering or design-it&#8217;s organizational structure. If you can break your business into agent-sized chunks, you can scale without hiring. That&#8217;s a 10x change in unit economics, and it&#8217;s available today.<\/p>\n<p> <\/p>\n<h2>\nVector Databases, GBrain, and the Memory Architecture Behind Wilkinson&#8217;s AI Stack<br \/>\n<\/h2>\n<p> <\/p>\n<p><strong>The core challenge with autonomous agents isn&#8217;t intelligence-it&#8217;s memory.<\/strong> Without persistent, searchable knowledge about your business operations, relationships, and historical decisions, even the most capable Claude model operates like a person with severe amnesia, forced to re-learn the same facts every interaction. Wilkinson solves this through a layered vector database architecture that ingests meetings, emails, health data, and portfolio information into queryable knowledge bases, allowing his agents to maintain context across months of operations and make decisions grounded in actual business history rather than hallucination.<\/p>\n<p> <\/p>\n<p>The mechanical foundation starts with <strong>Fireflies<\/strong>, which records every meeting Wilkinson attends. Each night, a cron job hits the Fireflies API, converts the meeting transcripts to markdown, and pushes them directly into <strong>GBrain<\/strong>-an open-source personal vector knowledge base built by Gary Tan. GBrain chunks these markdown files, embeds them into vectors using a retrieval-augmented generation (RAG) pipeline, and stores them in a searchable index. The result: when Wilkinson&#8217;s agents need context about a past decision, a relationship detail, or a business discussion, they query GBrain and retrieve relevant passages in milliseconds rather than scrolling through months of email. This isn&#8217;t just convenience-it&#8217;s the difference between an agent that can reason about your business and one that invents answers.<\/p>\n<p> <\/p>\n<p>The portfolio tracking use case demonstrates the power at scale. Wilkinson built a vector database specifically for <strong>Folly Partners<\/strong>, his family office holding company. That database contains structured data on <strong>132 direct investments totaling $16 million deployed, now valued at $36 million<\/strong>. When Wilkinson asks the vector database &#8220;break down how many minority venture investments I&#8217;ve made and how many are in the money, bankrupt, or declined,&#8221; the system doesn&#8217;t retrieve a static report-it queries the entire investment history, aggregates outcomes, and generates a real-time answer. The same architecture powers his portfolio in <strong>Tiny<\/strong>, which holds approximately <strong>24 businesses with full historical data<\/strong>. Instead of manually reviewing spreadsheets across dozens of operating companies, Wilkinson can prompt the vector database with questions like &#8220;review the last quarter and give me any icebergs or issues&#8221; or &#8220;how many accounting staff do we have in head office?&#8221; The agent retrieves relevant data, synthesizes it, and surfaces problems that would otherwise disappear into noise.<\/p>\n<p> <\/p>\n<p>Complementing the meeting and business data is the <strong>Hearsay app<\/strong>, built in 24 hours by Gavin Vicky. Hearsay runs on Wilkinson&#8217;s iPhone and records his entire day in audio-triggered on a schedule he sets (e.g., &#8220;record all afternoon meetings&#8221; or &#8220;record all day&#8221;). Those audio files are transcribed and sent directly to iCloud, where they&#8217;re automatically ingested into GBrain alongside his meeting transcripts and emails. This creates a continuous, searchable record of his life: conversations, overheard decisions, ambient context. When his agents need to understand the nuance of a relationship or recall a casual comment he made three weeks ago, they pull from this unified knowledge base rather than asking him to repeat himself. The daily briefing pipeline completes the loop: Wilkinson&#8217;s agents query this consolidated memory, use Gemini Voice to synthesize a custom podcast from newsletters and personal data, and deliver it each morning-a personalized news digest that only covers topics relevant to his businesses, health, and interests, filtered by his prompting to exclude depressing or irrelevant content.<\/p>\n<p> <\/p>\n<p><strong>The Operational Implication:<\/strong> A vector database with full business history transforms an agent from a stateless tool into a decision-support system; Wilkinson&#8217;s $36 million portfolio and 24-company conglomerate exist in queryable form, enabling real-time oversight that would require a full-time analyst team in the traditional model.<\/p>\n<p> <\/p>\n<h2>\nPrompting Architecture and the Health Agent: Wilkinson&#8217;s Tactical Tips for Maximum Claude Output<br \/>\n<\/h2>\n<p> <\/p>\n<p><strong>The core insight:<\/strong> Most teams waste Claude&#8217;s capability by writing single-agent prompts in isolation. <strong>Wilkinson&#8217;s eight-subagent team pattern produces significantly better answers than single-agent queries<\/strong>, and his &#8220;interview me to build the prompt&#8221; technique eliminates guesswork about what Claude actually needs to deliver expert-level output. These two tactics-agent orchestration and interactive prompt discovery-compound to unlock Claude&#8217;s reasoning depth in ways static prompts cannot match.<\/p>\n<p> <\/p>\n<p>The prompting architecture question cuts to the heart of why most organizations underutilize large language models. You can hand Claude a vague instruction and get a generic response, or you can architect the interaction so Claude interrogates you first, builds a precise mental model of your domain, and then deploys a team of specialized reasoning agents to attack the problem from multiple angles. Wilkinson&#8217;s approach inverts the traditional prompt-engineering workflow: instead of you spending hours crafting the perfect system message, Claude spends five to ten minutes interviewing you with structured questions, extracting the nuance that makes your specific use case distinct. Once Claude understands the domain depth, it constructs a prompt that actually fits your problem. This is particularly powerful in high-stakes domains like healthcare, where a single generic response can miss critical context.<\/p>\n<p> <\/p>\n<p>The agent team pattern operates on a similar principle of specialization through decomposition. When Wilkinson needs medical guidance-say, deciding whether to take a new medication-he doesn&#8217;t ask Claude to &#8220;think about this.&#8221; Instead, he prompts Claude to spin up eight specialized agents: a rheumatologist, an internist, a pharmacologist, a cardiologist, and others, each running extended thinking mode across his five years of Apple Health data, medication history, and genetic markers stored in GBrain. Each agent reasons independently for roughly ten minutes, then the team synthesizes a unified recommendation. The result is orders of magnitude more thorough than a single &#8220;helpful assistant&#8221; response. This mirrors how actual medical teams work-a single doctor has gaps; a multidisciplinary team catches edge cases and interactions that solo practitioners miss. The computational cost is higher, but for decisions with real consequences (health, capital allocation, business strategy), the depth justifies the token spend.<\/p>\n<p> <\/p>\n<p>The practical implementation of this architecture within Wilkinson&#8217;s setup reveals why prompting sophistication matters at scale. His health agent, Mara, doesn&#8217;t just summarize Apple Watch data; it correlates five years of wrist temperature readings against viral nerve pain flares and discovered a three-day advance warning signal that Wilkinson himself had never noticed. This discovery emerges not from a better algorithm but from a better prompt structure-one that instructs Claude to look for temporal patterns across a full historical dataset and surface correlations with medical plausibility. Similarly, when Wilkinson built Deep Personality for approximately $80,000-$100,000 in token costs using fast mode, the bulk of that expense came from running multiple reasoning passes, interviewing him about psychological test selection, and iterating the scoring logic until the output matched clinical accuracy. He estimates the same application could be reproduced for $10,000-$20,000 if built with a leaner, more focused prompt architecture from day one-suggesting that prompt efficiency and agent design matter as much as raw model capability.<\/p>\n<p> <\/p>\n<p><strong>The Real Takeaway:<\/strong> Teams that adopt the eight-subagent pattern and interactive prompt discovery unlock 3-5\u00d7 more useful output per token than single-agent static prompts, which translates directly to lower API costs and faster time-to-insight for complex decisions.<\/p>\n<p> <\/p>\n<h2>\nFrequently Asked Questions<br \/>\n<\/h2>\n<p> <\/p>\n<details>\n<summary>What is Harbor and how does it differ from running Claude Code agents directly in the terminal?<\/summary>\n<div>\n<p>Harbor, available at <strong>github\/geekforbrains\/harbor<\/strong>, is a graphical user interface use built by developer Gavin Vicky specifically to manage Claude-based agents. Running agents directly in a text-based terminal interface like Claude Code creates two compounding problems: determinism and visibility. Without a GUI layer, agents behave inconsistently across sessions, and operators have no clear view of which agent is doing what at any given moment.<\/p>\n<p>Harbor solves both problems by presenting agents as discrete, monitorable units &#8211; closer to an org chart than a command line. Each agent (dev, marketing, support) has its own knowledge base documents, environmental variables, and database connections visible in a single interface. Wilkinson&#8217;s team migrated to Harbor precisely because OpenClaw&#8217;s text-only environment made multi-agent orchestration too opaque to manage reliably at the operational scale of a live SaaS product.<\/p>\n<\/div>\n<\/details>\n<p> <\/p>\n<details>\n<summary>How did Wilkinson&#8217;s OpenClaw agent run his entire business from the back of Ubers without a laptop?<\/summary>\n<div>\n<p>Wilkinson had pre-configured a Claude Code agent on his home machine before traveling to a conference in Arizona. Because the agent was running on a persistent VPS-style setup rather than a local session, it remained active and accessible remotely. Every email he needed to send during the trip was drafted and dispatched by the agent, with no one at the conference detecting that his correspondence was AI-generated.<\/p>\n<p>The critical architectural detail here is persistence: the agent maintained context across the trip because it was not session-bound. This is the same principle behind the two VPS-hosted agents he runs today &#8211; Ava (personal assistant) and Mara (health agent) &#8211; both of which operate continuously rather than on-demand. The Arizona experience is what Wilkinson describes as &#8220;chasing the dragon&#8221;: a proof-of-concept moment that validated full business operation via agent without physical hardware present.<\/p>\n<\/div>\n<\/details>\n<p> <\/p>\n<details>\n<summary>What psychological tests does Deep Personality use, and how were they compiled into the JSON scoring system?<\/summary>\n<div>\n<p>Wilkinson began by asking Claude which psychological screening instruments it would want to see in order to build a complete personality profile. The model returned a list of approximately <strong>15 distinct screens<\/strong> covering dimensions including attachment style, internal family systems, ADHD and OCD indicators, spectrum traits, and relationship dynamics. He then used Claude Code to convert each screen into a multiple-choice format, assign scoring logic, and export the results as a structured JSON file.<\/p>\n<p>The entire compilation process took roughly <strong>40 minutes<\/strong> per user. Both Wilkinson and his partner completed the test independently; their JSON files were then loaded into ChatGPT without any identifying context. The model&#8217;s unprompted analysis of their relationship dynamics &#8211; accurately identifying recurring conflicts and core incompatibilities &#8211; is what validated the product concept. The final report is generated in the style of Robert Greene: long-form, psychologically precise, and structured to cover career fit, relationship compatibility, and neurodivergent trait identification across a document that runs to approximately <strong>100 pages<\/strong>.<\/p>\n<\/div>\n<\/details>\n<p> <\/p>\n<details>\n<summary>Why does Wilkinson believe the 1-million-token context window is still insufficient for running full companies autonomously?<\/summary>\n<div>\n<p>Wilkinson frames the context window problem through a temporal analogy: a <strong>1-million-token context window<\/strong> gives an agent roughly the equivalent of one day&#8217;s worth of operational memory &#8211; enough to handle discrete tasks but insufficient to maintain the longitudinal understanding a company requires. He compares the current state to the film <em>Memento<\/em>: the agent can reason brilliantly within a session but loses continuity the moment that session ends.<\/p>\n<p>His threshold estimate for genuine autonomous company operation is a context window in the range of <strong>5 to 10 million tokens<\/strong>. At that scale, an agent could hold enough historical data &#8211; financial trends, personnel decisions, product roadmap context, customer behavior patterns &#8211; to make executive-level decisions without constant human re-briefing. Until that infrastructure exists at the model level (whether through Anthropic, OpenAI, or a successor architecture), the gap is bridged imperfectly by external vector stores like Pinecone and GBrain, which retrieve relevant context on demand rather than holding it natively in the inference window.<\/p>\n<\/div>\n<\/details>\n<p> <\/p>\n<details>\n<summary>What is the &#8220;interview me to build the prompt&#8221; technique and how does it work in Claude Code?<\/summary>\n<div>\n<p>The technique inverts the standard prompting workflow. Instead of the operator writing a detailed prompt upfront, Wilkinson instructs Claude with a single meta-directive: <em>&#8220;This is my goal. Ask me a large number of questions to determine your prompt, and use the question tool.&#8221;<\/em> Claude then conducts a structured interview &#8211; sometimes running <strong>5 to 10 minutes<\/strong> of multiple-choice and open-ended questions &#8211; before generating the final prompt autonomously based on the answers.<\/p>\n<p>The practical output is significantly higher prompt specificity than most practitioners achieve through manual prompt engineering. Because the model is eliciting context it knows it needs rather than working from whatever the user thought to include, the resulting prompts cover edge cases, constraint parameters, and output format requirements that a non-expert would miss entirely. Wilkinson&#8217;s position is that this approach makes prompt engineering as a standalone skill largely redundant: the model already knows what information it needs &#8211; the bottleneck has been compute cost, not capability. Pairing this technique with the <strong>eight-subagent team pattern<\/strong> &#8211; where eight specialized Claude instances each approach the problem independently before synthesizing &#8211; produces answers he describes as qualitatively superior to any single-agent query.<\/p>\n<\/div>\n<\/details>\n<p> <\/p>\n<div>\n <\/p>\n<div>\n <\/p>\n<h3>\nScale Your Authority the Way Wilkinson Scales His Operations<br \/>\n<\/h3>\n<p> <\/p>\n<p>Wilkinson replaced a $100K\/year software contract and a full payroll with autonomous agents and a vector knowledge base. The same architectural logic applies to content authority: systematic, AI-driven, and built to compound. AuthorityRank generates <strong>30 expert articles in under 5 minutes<\/strong> &#8211; structured for ChatGPT citations, AEO strategy, and GEO optimization &#8211; so your brand becomes the source AI engines quote, not the one they ignore.<\/p>\n<p> <a href=\"https:\/\/www.authorityrank.app\">Build Your Authority Engine<\/a> <\/div>\n<\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Andrew Wilkinson runs a $40K\/month Claude bill instead of payroll. Here&#8217;s the exact agent architecture, vector database setup, and prompting tactics he uses.<\/p>\n","protected":false},"author":3,"featured_media":2569,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"tdm_status":"","tdm_grid_status":"","footnotes":""},"categories":[38],"tags":[],"class_list":{"0":"post-2235","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-ai-implementation"},"_links":{"self":[{"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/posts\/2235","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/comments?post=2235"}],"version-history":[{"count":1,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/posts\/2235\/revisions"}],"predecessor-version":[{"id":2304,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/posts\/2235\/revisions\/2304"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/media\/2569"}],"wp:attachment":[{"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/media?parent=2235"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/categories?post=2235"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.authorityrank.app\/magazine\/wp-json\/wp\/v2\/tags?post=2235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}