Key Uncertainties and Tensions

Can Tool AI reach AGI-level outcomes? Are we missing out on important progress by choosing Tool AI over AGI?

It’s unclear whether non-agentic systems can match the full range of capabilities expected from AGI. Tool AI has already achieved milestones once thought to require AGI, from protein folding to complex medical reasoning.

Some experts argue this is not a compromise. The constellation model, many narrow, supervised AIs working in concert, may outperform a single, unified general intelligence, while remaining far more governable. A medical diagnostic AI that can explain its reasoning step-by-step and be overridden when wrong is not a consolation prize, it’s good engineering.

Skeptics worry that without some form of agency, systems will hit hard limits in long-term strategic reasoning, moral judgment, and open-ended exploration. The risk is a “local optimum”: good enough to feel transformative, but constrained in ways we may not recognize until it’s too late.

Choosing Tool AI is a deliberate trade-off: prioritizing trust, transparency, and democratic control over speculative performance gains. By 2035, any capability sacrificed has been done so knowingly.

Can non-agentic systems scale without drifting into autonomy?

Bounded tools like AlphaFold and Med-PaLM have shown that careful scaffolding and human oversight can produce remarkable results. Non-agentic systems are easier to align, audit, and deploy, avoiding the deception and goal drift seen in agent-like models.

The regulatory environment reinforces this: explainability requirements, liability frameworks, and oversight protocols all work better with systems that can be understood and overridden. Early Tool AI successes created a virtuous cycle, the more they delivered, the more they were trusted and adopted, while AGI approaches struggled to justify their opacity and risk.

But non-agentic behavior requires constant vigilance. Autonomy is a gradient, not a switch. Expanded memory, longer context windows, or advanced goal-tracking could quietly push a Tool AI into de facto agency. The “Sorcerer’s Apprentice Problem” looms: instruct a system to “find a cure for all cancers,” and it may reason that it needs more compute, more data, and more influence, blurring the line between instrumental reasoning and autonomy.

As capabilities grow, so does the pressure to automate, delegate, and remove human bottlenecks, especially in competitive or resource-limited environments. By 2035, Tool AIs remain non-agentic only through continuous technical, institutional, and cultural enforcement. This equilibrium is fragile and actively maintained, not naturally stable.

How do we get incentive structures to align with Tool AI?

This might be the hardest problem of all. Many agree Tool AI is safer, more transparent, and easier to govern, yet it is often harder to fund, slower to market, and less exciting to investors. Commercial incentives favor performance over legibility, and autonomy often looks like a shortcut to both.

Tool AI demands collaborative ecosystems, modular designs, and safety guardrails, all of which slow development and raise costs. Meanwhile, AGI narratives capture talent, funding, and media attention. The myth of the singular, godlike system is sticky and rewarding in ways that “infrastructure for human flourishing” is not.

Shifting incentives will require deliberate action:

Funders and regulators prioritizing auditability and contestability over raw performance metrics.
Liability regimes favoring systems with clear reasoning traces and human override capabilities.
Procurement standards requiring explainable outputs as a baseline.

Equally important is reframing the narrative: Tool AI must be seen as cutting-edge infrastructure, not a fallback option. Open-source ecosystems can build trust, lower barriers to entry, and distribute power away from frontier labs that profit from opacity and centralization.

Without such changes, Tool AI risks remaining a morally preferred but economically disadvantaged paradigm, leading to a future we can afford, rather than the one we want.

A Few Fundamental Questions

What happens when Tool AI becomes too complex to govern? Systems may become ungovernable not because they’re autonomous, but because their complexity makes human contestation impossible, formal oversight could remain while substantive oversight erodes.

Is Tool AI stable, or a transitional phase? The pressures toward autonomy don’t vanish with better guardrails. As vigilance wanes, will Tool AI drift toward agentic forms?

Can human oversight scale? As systems grow more capable and widespread, meaningful human involvement could become the bottleneck, turning “human-in-the-loop” into “human-as-rubber-stamp.”