Back to blog
5/30/20266 min readPlannerPoker Team

Agentic Infrastructure Changes Sprint Estimation

AWS, Google, and cloud platforms are redesigning infrastructure for AI agent traffic. Product teams should estimate bursty workloads, permissions, memory, and cost before they ship agent features.

A Scrum board representing sprint estimation for agentic infrastructure work
Scrum board image by Jeff Sutherland, released under CC BY 3.0 via Wikimedia Commons. Source CC BY 3.0

AI agents are changing more than developer tools. They are changing the infrastructure assumptions behind modern software.

This week, AWS announced the next generation of Amazon OpenSearch Serverless, describing it as a search and vector engine designed for customers building agents. AWS says the new version auto-scales much faster than its predecessor, provisions resources in seconds, supports scale-to-zero, and includes integrations with platforms such as Vercel and Kiro.

TechCrunch framed the broader shift clearly: the internet is being rebuilt for machines. Human users browse at human speed. AI agents can call APIs, search documents, fan out into sub-agents, query databases, and then disappear. That creates traffic patterns product teams did not have to estimate a few years ago.

For agile teams, this belongs in sprint planning.

Agent features are not normal features

A traditional user story might involve a person clicking a button, waiting for a response, and maybe trying again later. An agent story can behave very differently.

One user action can trigger:

  • Multiple retrieval calls.
  • Tool use across internal systems.
  • Vector search.
  • Long-running background work.
  • Sandboxed code execution.
  • Permission checks.
  • Costly model calls.
  • Follow-up actions that happen without another click.

That means the implementation estimate is only part of the story. The team also has to estimate operational shape.

Can the system handle a burst? Can it stop work safely? Can it prove what the agent did? Can it avoid paying for idle infrastructure? Can support teams explain the outcome to a customer?

These are planning questions before they become production questions.

What AWS and Google are signaling

AWS's OpenSearch Serverless launch is a useful signal because search and memory sit at the center of many agentic applications. Agents need context, documents, embeddings, history, and retrieval. When that workload is unpredictable, infrastructure needs to scale differently.

Google's I/O 2026 developer announcements point in the same direction from another angle. Google discussed managed agents in the Gemini API, Antigravity tooling, WebMCP for browser-based agent actions, and Chrome DevTools capabilities for agents. The pattern is clear: platforms are adding primitives so agents can act, inspect, debug, and integrate with software more directly.

That is exciting, but it also expands what teams must estimate.

An agent story is rarely just "add AI assistant." It may include retrieval design, tool contracts, permission models, observability, rate limits, cost controls, human escalation, and failure recovery.

Planning poker can expose infrastructure risk

Planning poker is useful here because infrastructure risk is easy to hide behind a polished demo.

The product manager may see a smooth agent workflow. The frontend engineer may see a clean chat interface. The backend engineer may see background jobs, retries, secrets, logs, and unpredictable fan-out. The platform engineer may see cost risk and missing limits.

Those perspectives should not be flattened too early.

If votes split between 3 and 13, that is not a problem with the meeting. It is evidence that the story means different things to different parts of the team.

Use the spread to ask:

  • Is retrieval already available, or does this story need a new index?
  • Does the agent need write access or read-only tools?
  • What is the worst-case number of tool calls per user action?
  • How do we cap model, search, and background-job cost?
  • What should happen when the agent is uncertain?
  • What logs or traces will support and engineering need?
  • What must be reviewed by a human before the agent acts?

The answer to those questions should change the estimate.

Estimate for cost and control

AWS Prescriptive Guidance for serverless agentic AI emphasizes scale, cost alignment, and governed adoption. That is a good checklist for sprint planning too.

When a team estimates an agent feature, include the work to define:

  • Usage limits.
  • Tool permissions.
  • Data boundaries.
  • Monitoring and alerts.
  • Retry behavior.
  • Timeout behavior.
  • Audit logs.
  • Human handoff.
  • Rollout flags.
  • Cost dashboards.

These items may not show up in the happy-path demo, but they decide whether the feature survives real traffic.

Split agent stories before they get too large

Agentic infrastructure stories often become too big because teams combine product experience, AI behavior, retrieval, tool execution, and operations into one ticket.

A healthier backlog might split the work like this:

  • Define the agent task and acceptance criteria.
  • Build the read-only retrieval path.
  • Add the tool contract for a single action.
  • Add permission checks and audit logging.
  • Add observability and cost caps.
  • Run a limited rollout.
  • Expand the agent's allowed actions after review.

Each story can then be estimated with clearer risk. The team gets feedback earlier, and the agent earns more responsibility in stages.

How to write better Jira notes

For agent features, the final estimate should carry infrastructure context.

Good notes might say:

  • "Five points because retrieval exists, but tool permissions and audit logs are new."
  • "Eight points because this can fan out into background work and needs cost limits."
  • "Split before sprint: UI chat, vector index, and write-action permissions are separate risks."
  • "Agent must stay read-only for the first release."
  • "Estimate includes traces, timeout handling, and fallback copy."

Those notes make the next planning session faster. They also protect the team from repeating the same uncertainty when the agent expands.

The takeaway for product teams

The tech news from AWS, Google, and cloud infrastructure teams is pointing in one direction: software is being prepared for more machine-to-machine work.

That does not make planning poker old-fashioned. It makes planning poker more useful.

Agentic features need human agreement about scope, risk, permissions, cost, and review boundaries. A story point estimate is not only a number for capacity. It is a record of what the team believes must be true before the feature should ship.

When agents become part of the product, sprint planning needs to cover the whole operating model, not just the UI or the prompt. That is how teams get the benefits of agents without letting hidden infrastructure work spill into production.

Sources