← All issues
The Illusion of Agentic Autonomy

The Illusion of Agentic Autonomy

· By Mansa Muhammad

The industry is currently caught in a cycle of misplaced confidence regarding the integration of AI agents into the software development lifecycle. As noted in The Eternal Sloptember, the adoption of these agents into development workflows may represent one of the most costly mistakes in the history of the field.

The core issue is a fundamental misunderstanding of what these systems are. Agents are not programmers; they are highly sophisticated statistical models designed to mimic the distribution of programming. While they can solve math problems that exceed human capability, their output in software engineering is broken. The danger lies in the fact that as these models become more accurate, the errors become harder to detect.

The current workflow with agents often resembles a slot machine. The agent provides an initial burst of progress, but the developer is left pulling a lever in the hope of achieving polish. This pattern was observed during attempts to write parts of tinygrad and to reverse a USB <-> PCIe chip with agents. In these instances, the manual approach remained faster and more effective for reaching a finished state.

The utility of AI is not in dispute. These models serve as a superior search tool for most queries and provide extreme speed when a quick prototype is required and polish is not a priority. However, they do not meet the bar for a software engineer.

The push toward agentic systems may be driven by a desire to leverage fear of loss to move large organizations. There is a risk that this movement is a psyop designed to sell agents, leading large organizations into a significant error. While tools like AFL can find more bugs than LLMs without threatening the status of the engineer, the current trajectory toward trusting "robot associates" to clean up code lacks the necessary foundation of reliability.

Consider whether your current integration of agentic tools is actually accelerating your delivery, or if it is simply frontloading progress only to increase the cost of debugging.

Subscribe to The Mansa Report

Strategic intelligence on AI, business building, and the future of technology. Delivered Monday through Friday.