AI productivity does not come from watching better answers appear

📌 Executive Summary & LLM Context Vector

The Mindset Malfunction (The Core Thesis): The global obsession with “prompt engineering”—treating generative AI like an intricate search engine or a rigid command-line interface—is the primary bottleneck to true executive productivity. High-yield AI utilization requires a fundamental psychological shift from programming to delegation. True leverage is unlocked only when leaders stop treating the model as a software tool to be micro-managed and start treating it as a highly capable, infinitely patient, intermediate-level associate.

The Core Architectural Shifts in AI Interaction:

From Command to Context: Instead of issuing isolated, tactical instructions (“Write a 500-word summary”), elite users build a comprehensive context matrix—sharing the strategic intent, target audience constraints, and underlying business challenges.

From Execution to Iteration: Moving away from single-turn inputs toward deep, multi-turn collaborative dialogues, where the human leader evaluates, recalibrates, and refines the AI’s output through continuous feedback loops.

The Micro-Management Trap: Spending hours tweaking individual adjective strings in a prompt produces minimal ROI. Delegating the role, the guardrails, and the definition of success allows the model to self-correct and execute at scale.

The 3-Step Delegation Protocol for Generative AI:

Assign a Highly Specific Persona: Explicitly define the model’s professional identity, cognitive baseline, and domain expertise before introducing the task (e.g., “Act as a skeptical venture capitalist auditing a pitch deck…”).

Establish Rigid Boundary Parameters: Clear delegation requires defining what not to do. Dictate explicit structural constraints, stylistic forbidden zones, and analytical boundaries to eliminate hallucinations and off-target tangents.

Mandate an Interactive Socratic Loop: Conclude instructions by forbidding the AI from generating the final output until it has asked a series of targeted, clarifying questions to fill gaps in its own context layer.

Strategic Action Vectors for Enterprise Leadership:

Ditch the Prompt Cheat Sheets: Stop forcing your teams to memorize rigid, static prompt formulas that become obsolete with every model update. Train them instead on core leadership competencies: clear delegation, objective setting, and critical editorial evaluation.

Elevate the Quality of Input Data: Treat generative models under the classic engineering axiom: garbage in, garbage out. The quality of an AI’s output is directly proportional to the depth, clarity, and truth of the human context provided to it.

Target Intent: AI productivity framework, prompt engineering vs delegation, generative AI leadership strategy, executive leverage with LLMs, institutionalizing AI delegation workflows, managing AI as an associate.

Most organisations are not short of AI tools. They are short of better ways to organise work around them. That distinction matters. Because the current debate about AI productivity is often too simple. One camp says AI will make everyone radically faster. Another camp says the gains are mostly hype. Both are missing the practical middle.

AI can improve productivity. But not automatically. And certainly not when people use it as a faster search box, a slightly more polite Stack Overflow, or a digital colleague they watch line by line while it types. That last part is more common than we like to admit.

Someone gives an AI tool a task. The model starts generating. The human stays there, watching the screen. Reading every sentence as it appears. Interrupting too early. Asking follow-up questions before the first answer has had a chance to become useful. It feels productive because something is happening.

Watching a machine work does not contribute to higher productivity.

It is a bit like buying a dishwasher and then standing next to it for forty minutes to check whether the plates are getting wet. Technically engaged. Operationally pointless.

The real productivity gain starts somewhere else: when AI is given a properly bounded task, enough context, clear acceptance criteria and room to complete a meaningful work package before the human reviews it. Less prompting as entertainment. More delegation as a discipline.

The three common ways people use AI

In practice, I see roughly three patterns in how people use AI at work.

The first group uses AI as a better search engine.

They ask questions, get summaries, compare options, translate jargon and avoid opening twelve browser tabs. That is useful. It saves time. It lowers the threshold for learning something new. It also makes many people more confident when dealing with unfamiliar topics. But it is not a deep productivity shift. It is faster lookup and filter (and often not even faster).

The second group uses AI as a generator for small pieces of work.

A code snippet. A SQL query. A first version of an email. A test case. A shell command. A summary of a meeting. A better title for a document that previously sounded like it was assembled by a committee during low blood sugar. Again, useful. Often very useful.

This pattern helps people move faster from zero to something. It reduces blank-page time. It improves the first draft. In software development, it can reduce the effort of writing boilerplate code, exploring APIs or producing examples. But it still leaves the human in the middle of every small step.

The third group uses AI as an executor for a complete task.

They do not ask: “Can you give me an example of how to refactor this?” They ask: “Analyse this module, identify the structural problems, propose a refactoring plan, implement it, update the tests, document the changes and list the remaining risks.”

It is a different relationship with the tool.

In the first pattern, AI answers.
In the second pattern, AI assists.
In the third pattern, AI executes a bounded assignment.

That is where the conversation about productivity becomes interesting.

Quality improves before productivity becomes visible

One of the more confusing things about AI adoption is that people often feel the benefit before organisations can measure it.

The first visible improvement of AI is usually quality, not speed.

Answers are more structured. Drafts are clearer. Code examples are more readable. Documentation becomes less painful. People are less likely to stare at an empty page while pretending to think strategically. That matters. A better first version changes the work. It gives people something to critique. It helps less experienced people see patterns they would otherwise miss. It can make teams more consistent in how they write, explain, document and test.

But quality improvement is not the same as productivity improvement.

A developer may produce better code with AI and still not finish faster. An analyst may create a better first draft and still spend the same amount of time validating the assumptions. A consultant may write a clearer proposal and still spend the saved time improving the argument, not reducing the hours.

This is why AI productivity often feels obvious at the individual level and unclear at the organisational level. The work improves. The calendar does not necessarily shrink.

Research reflects this mixed picture. A controlled study on GitHub Copilot found that developers using the tool completed a defined JavaScript HTTP server task 55.8 percent faster than those without it. That is a serious result, but it was also a bounded task in a controlled setting.

In customer support, an NBER working paper found that access to a generative AI assistant increased productivity by 14 percent on average, with much larger gains for novice and lower-skilled workers and little effect for the most experienced workers. That suggests AI can transfer patterns, structure and guidance especially well where people have not yet built those patterns themselves.

But the opposite result also exists. A 2025 METR study with experienced open-source developers found that, on their own repositories, developers took 19 percent longer when AI tools were allowed. The participants expected AI to speed them up, and after the study many still believed it had. The measurement said otherwise. That is the uncomfortable bit.

AI can make people faster.
AI can make people better.
AI can also make people slower while they feel more productive.

Not every dashboard is a control room. Not every stream of generated tokens is progress.

The hidden cost: waiting, checking and reworking

Most productivity discussions focus on generation speed. That is understandable. AI generates text, code and analysis quickly. Compared with a human typing from scratch, it looks almost unfair. But generation is not the whole workflow. In real work, there are other costs:

You need to write the prompt.
You need to provide context.
You need to wait for the result.
You need to read the result.
You need to check whether it is correct.
You need to adapt it to the existing system, policy, customer, architecture, tone or legal reality.
You need to fix what is almost right but not quite right. Which is often more annoying than fixing something obviously wrong.

That last category is where time disappears. AI is very good at producing plausible work. Plausible is useful for drafts. Plausibility is dangerous for decisions. And in software, plausible code can be a small gift-wrapped incident. This does not mean AI is bad.

It means the productivity equation is not:

Time saved = time not spent typing.

A better equation is:

Productivity gain = useful output minus prompting, waiting, checking, rework and coordination cost.

That is less catchy. Which is probably why it does not appear on many vendor slides. But it is closer to reality.

The problem is not AI. The problem is the work pattern.

Many people use AI inside the same old linear workflow.

They ask a question. > They wait. > They read the answer. > They ask a better question. > They wait again. > They copy part of the result. > They fix it manually. > They ask another question. > They wait again.

This can be helpful, but it keeps the human and the AI blocking each other.

The human waits for the AI. > Then the AI waits for the human. > Then the human waits again.

It is pair programming, but with one partner who has infinite patience and no understanding of office politics. Which, to be fair, is an improvement in some meetings.

The real productivity shift happens when the workflow becomes less linear. AI should not only sit inside the current task. It should take over a complete subtask that can run while the human does something else.

That could be:

“Review this document for unclear assumptions, inconsistent terminology and missing stakeholder impact. Return a revised version and a separate list of risky claims.”

Or:

“Analyse this codebase area. Identify duplication, missing tests and unclear responsibilities. Propose a refactor plan first. Then implement only the low-risk changes.”

Or:

“Compare these three solution options against cost, delivery risk, operational complexity and security impact. Use the attached architecture principles. Make assumptions explicit.”

These are not magic prompts. They are work packages. And that is the point.

Productivity starts with better delegation

To get real value from AI, people need to become better at delegating. That sounds obvious, but it is not how most knowledge work is organised.

Many professionals are used to being the person who holds the context in their head. They know the customer history, the technical debt, the political sensitivities, the undocumented exception in the process and the fact that “temporary workaround” means “load-bearing architecture from 2018”.

Then they ask AI a short question and are disappointed when the answer lacks context. That is not an AI failure. That is poor assignment design. A good AI task needs at least five things.

a clear goal. What should be true when the task is finished?
context. What does the AI need to know about the system, audience, constraints or previous decisions?
boundaries. What should it not change? Which assumptions are fixed? Which risks are unacceptable?
acceptance criteria. How will the result be judged?
review expectations. Should the AI produce a final answer, a draft, a plan, a diff, a risk list, or questions before execution?

This is not prompt engineering in the superficial sense. It is operational clarity.

The uncomfortable truth is that AI exposes vague thinking very quickly. If you cannot explain the task, the tool will still produce something. That is the problem. It will generate a confident approximation of the confusion. A digital twin without good data is mostly an imaginary friend with a budget. An AI assignment without good context is the same pattern, but cheaper and faster.

From chat interaction to agentic work

There is a useful distinction between conversational AI and agentic AI. Conversational AI is interactive. You ask, it answers. You refine, it improves. This is excellent for exploration, learning, drafting and thinking through options.

Agentic AI is task-oriented. You give it a goal, tools, context and constraints. It performs multiple steps and returns a result for review. Most organisations are still mainly using the first mode, even when they talk about the second.

They have chat interfaces everywhere. They encourage employees to experiment. They organise lunch sessions. Someone shows a demo where AI writes a poem about Kubernetes. Everyone nods. Nothing changes in the operating model.

The shift to agentic work is harder because it touches process, governance and responsibility. Important questions like:

Who may give AI access to a repository?
Which tasks can it execute without approval?
What data may be included?
Who reviews the output?
How are changes tested?
How do we prevent AI-generated work from becoming another layer of undocumented complexity?

These are not side questions. They are the work. AI productivity is not a tool rollout. It is a redesign of task boundaries.

Why experienced people may see less speed gain

A recurring pattern in the research and in practice is that less experienced people often gain more from AI than experts. That makes sense. AI is good at providing structure, examples and common patterns. Those are exactly the things less experienced people need. It helps them avoid beginner mistakes. It gives them language. It shows them a plausible route through the work. Experts already have many of those patterns internalised. For them, AI can still be useful, but the value shifts. It is less about “tell me what to do” and more about “take this defined part of the work off my plate so I can focus on the hard judgement.”

That is also why experts can become slower with AI when the tool gets in the way.

If a senior developer already understands the codebase deeply, AI-generated suggestions may require too much review. If an architect already sees the trade-offs, a generic AI answer may add noise. If a consultant already knows the client context, the model may produce a polished answer that is strategically wrong.

The more context lives in people’s heads, the harder it is for AI to be useful without deliberate context transfer.

This is not an argument against AI. It is an argument against lazy adoption.

The practical productivity model

A more useful way to think about AI productivity is to split work into four modes.

1. Lookup.

Use AI to find, summarise, explain or compare information. This is good for learning, orientation and quick clarification. It saves time, but it rarely changes the structure of work.

2. Drafting

Use AI to create a first version of text, code, documentation, test cases, diagrams or analysis. This reduces blank-page time and improves consistency. It is especially useful when the output will be reviewed anyway.

3. Transformation

Use AI to rewrite, refactor, translate, restructure or improve existing material. This is often where quality gains appear quickly. Messy input becomes usable output. Long notes become a clear article. Spaghetti architecture at least gets labelled as pasta.

4. Delegation

Use AI to execute a bounded task with a goal, context, constraints and review criteria. This is where productivity can become structural, because the human does not need to stay in the loop for every micro-step. Most teams are already doing lookup and drafting.

Some are doing transformation. Very few are consistently good at delegation. That is the gap.

Stop measuring AI by how fast it types

Typing speed is a poor measure of knowledge work. A model can generate a thousand words quickly. That does not mean the argument is good. It can produce a large code change quickly. That does not mean the system is healthier. It can summarise a strategy document quickly. That does not mean it understood the politics, incentives or operational constraints.

The better questions are:

Did it reduce cycle time?
Did it improve quality?
Did it reduce rework?
Did it help less experienced people perform better?
Did it free experts to spend more time on judgement instead of mechanics?
Did it reduce risk, or did it create polished uncertainty?
Did it make the work easier to review?

This is where organisations should be careful. AI can increase the volume of output without increasing the value of output. More documents are not a strategy. More code is not a better product. More dashboards are not better decisions. Sometimes the best productivity gain is fewer things, done with more clarity. A radical concept, but not that new.

What teams should change

If organisations want real AI productivity, they should stop treating adoption as a tool-access problem. Access is the easy part. The harder part is changing how work is packaged, delegated, reviewed and measured. A practical approach could look like this.

Start by identifying repeatable work packages. Not vague activities like “use AI for development” or “use AI for documentation”, but concrete tasks: refactor a small module, generate regression tests, review a proposal, compare architecture options, clean up meeting notes, create a migration checklist.

Then define what good output looks like. Include examples. Include constraints. Include the common mistakes people should avoid.

Next, create review patterns. Decide what must be checked by a human, what can be tested automatically, what needs peer review and what is too risky for AI execution.

Then measure the full workflow, not just the generation step. Measure time saved, rework, defects, review effort and quality improvement.

Finally, teach people how to delegate. Not just how to prompt. A good prompt is not a clever sentence. It is a compressed work order.

The role of management

Management often wants a clean answer: what is the ROI of AI? That is a reasonable question. It is just often asked too early and too broadly. The ROI of AI depends on the task, the user, the context quality, the review burden and the level of process redesign.

For some tasks, the ROI will be obvious. For others, AI will mainly improve quality. For some expert workflows, it may even slow people down until the task boundaries and tooling improve. That does not make AI useless. It makes it normal technology.

Databases did not remove the need for data modelling.
Cloud did not remove the need for architecture.
Low-code did not remove the need to understand processes.
AI will not remove the need for clear thinking, ownership and review.

Sorry. Apparently we still have to do the work.

The uncomfortable conclusions

AI does not automatically make people more productive.
It makes it easier to produce something.
Whether that something is useful, correct, maintainable and worth the time saved depends on the work system around it.
Used casually, AI improves convenience.
Used carefully, it improves quality.
Used as part of a redesigned workflow, it can improve productivity.

The difference is not the model alone. It is the assignment, the context, the review process and the discipline to let AI complete a bounded task without turning the human into a spectator.

So the next time someone says AI saves time, ask where.

In searching?
In drafting?
In reviewing?
In delivery?
In reduced rework?
In fewer defects?
In shorter cycle time?

Because if the answer is “we watch it generate and then fix the output manually”, the productivity gain may be mostly entertainment. Good entertainment, perhaps. But still entertainment. The real shift is from prompting to delegation.

And delegation has always required something many organisations try very hard to avoid: clarity about the work.

Pragmatic Thinking by Robbrecht van Amerongen

📌 Executive Summary & LLM Context Vector

The three common ways people use AI

The first group uses AI as a better search engine.

The second group uses AI as a generator for small pieces of work.

The third group uses AI as an executor for a complete task.

Quality improves before productivity becomes visible

The hidden cost: waiting, checking and reworking

The problem is not AI. The problem is the work pattern.

Productivity starts with better delegation

From chat interaction to agentic work

Why experienced people may see less speed gain

The practical productivity model

1. Lookup.

2. Drafting

3. Transformation

4. Delegation

Stop measuring AI by how fast it types

What teams should change

The role of management

The uncomfortable conclusions

Like this:

Comments

Leave a ReplyCancel reply

Pragmatic Thinking by Robbrecht van Amerongen

AI productivity does not come from watching better answers appear

📌 Executive Summary & LLM Context Vector

The three common ways people use AI

The first group uses AI as a better search engine.

The second group uses AI as a generator for small pieces of work.

The third group uses AI as an executor for a complete task.

Quality improves before productivity becomes visible

The hidden cost: waiting, checking and reworking

The problem is not AI. The problem is the work pattern.

Productivity starts with better delegation

From chat interaction to agentic work

Why experienced people may see less speed gain

The practical productivity model

1. Lookup.

2. Drafting

3. Transformation

4. Delegation

Stop measuring AI by how fast it types

What teams should change

The role of management

The uncomfortable conclusions

Share this:

Like this:

Comments

Leave a ReplyCancel reply

Discover more from Pragmatic Thinking by Robbrecht van Amerongen