Blog

How to Map Your Company's Workflows Before Using AI

Every AI workflow problem is a workflow problem first. Here's how to map your company's recurring tasks before configuring AI — and why the sequence matters.

Phos Team ·
Phos AI Labs Operations

Every AI workflow problem that looks like an AI problem is usually a workflow problem.

The outputs are inconsistent because the inputs are inconsistent. The AI misses edge cases because the edge cases were never documented. The human checkpoint is in the wrong place because nobody mapped where the judgment calls actually occur.

Workflow mapping before AI configuration is not bureaucratic overhead. It is the fifteen minutes per workflow that prevents the three months of debugging that follows when the mapping is skipped.

This article describes a practical workflow mapping process that produces one document per workflow; a plain-language map of exactly how the workflow runs today, what decisions it contains, what inputs it requires, and where the human judgment points are.

That document is the specification the AI is built from. Without it, the AI is configured from assumption. With it, the AI is built from the ground truth of how the work actually happens.


Why AI workflows fail without prior mapping: the specific failure modes

Failure mode 1: The incomplete input problem

The AI workflow is built to receive one type of input: the standard case.

The real workflow receives four types of input; including two edge cases that require different processing. The AI handles the standard case well and breaks on the edge cases.

Example: an invoice processing workflow is configured for PDF invoices emailed directly to the AP inbox. The real workflow also receives invoices as image attachments, invoices forwarded from other email accounts, and handwritten delivery receipts that need manual entry.

The AI handles the PDF case correctly. The other three cases surface as “broken” outputs that require the finance lead to investigate and manually correct.

What mapping would have prevented: the input inventory step captures all four input types. The AI configuration is built to handle all four; or to route the edge cases to a separate handling process before they reach the AI.


Failure mode 2: The invisible decision problem

The person running the workflow makes several judgment calls that are so routine they have become invisible.

The AI is not given these decision rules because they were never documented. It produces outputs that fail in the specific situations where the judgment calls matter.

Example: a client follow-up email workflow. The account manager makes four invisible decisions on every email: how urgent to make the tone based on where the deal is; whether to copy a senior team member based on the client’s seniority; whether to include a specific next step or leave it open-ended; how much detail to include based on the client’s stated communication preference.

The AI workflow is configured without any of these decision rules. The outputs are generic because the AI does not know which version of the email to produce.

What mapping would have prevented: the decision points step surfaces all four decisions. The AI configuration includes the decision rules; or the workflow includes a “decision input” field where the account manager provides the relevant context before the AI runs.


Failure mode 3: The missing human checkpoint problem

The AI workflow is configured to run autonomously from input to output. The human review step is defined vaguely (“someone reviews before sending”). The review step is not built into the automation architecture.

As the automation scales, outputs go out without the intended review because the checkpoint was never formalised.

Example: a support ticket response workflow. The intended process: AI drafts the response, support lead reviews, support lead sends. In practice, the automation is configured to draft and queue for sending. During a busy period, the support lead does not process the queue. Responses go out without review. Two contain factually incorrect information.

What mapping would have prevented: the human checkpoints step explicitly documents: who reviews, what they are checking for, what the review timeout is, and what “approved” looks like in the automation. The checkpoint is built into the automation as a required step; not an assumed one.


The five-component workflow map: what to document for each workflow

Component 1: Trigger

What to document: what starts this workflow?

Trigger type documentation:

TRIGGER TYPE: event / schedule / manual

If event: what is the event, what system generates it, how is it identified?
If schedule: what is the cadence; what data is available at that time?
If manual: who initiates it, in what situations, what information do they have
           when they decide to run it?

Why it matters: the trigger determines whether the workflow can be automated (event or schedule triggers) or requires human initiation (manual triggers). Misidentifying the trigger produces an automation that either never fires or fires at the wrong time.


Component 2: Inputs

What to document: everything the workflow requires to produce its output; not just the obvious primary input, but all the inputs including the contextual information the person running the workflow uses without always recognising as an input.

Input typeDescription
Primary inputThe data or document the workflow processes
Context inputsAdditional information the runner uses (client account history, deal stage, a prior decision, the relevant policy)
Variable inputsThe inputs that change between runs (the specific client, the specific invoice, the specific project)
Stable inputsThe inputs that stay constant across runs (company voice, pricing policy, standard format)

Why it matters: stable inputs go into the context pack or workflow template. Variable inputs are loaded at runtime. Context inputs become either part of the context pack or a required field that the runner provides before the AI executes.

Unmapped inputs produce AI outputs that are missing information the runner had but never provided.


Component 3: Decision points

What to document: every moment in the workflow where the person running it makes a judgment call; even the ones that feel automatic.

How to surface invisible decisions:

Ask the person who runs the workflow: “Walk me through the last time you did this task. At each step, ask yourself: was there a moment where you could have done it differently based on the situation?”

This question surfaces the invisible decisions that feel like “just how it’s done.”

Decision point documentation format:

DECISION POINT: [Name the moment]
QUESTION BEING ANSWERED: [What the runner is deciding]
OPTIONS AVAILABLE: [What the possible responses are]
WHAT DETERMINES THE CHOICE: [The factors that drive the decision]
AI HANDLING: AI decides using rule / AI flags for human / Human decides before AI runs

Example:

DECISION POINT: Tone calibration for collections email
QUESTION BEING ANSWERED: How urgent should the collections communication be?
OPTIONS AVAILABLE: Gentle reminder / firm reminder / formal demand
WHAT DETERMINES THE CHOICE: Days overdue (0–15: gentle; 15–30: firm; 30+: formal);
  client relationship tier (long-term clients get one additional gentle reminder
  regardless of days overdue)
AI HANDLING: AI decides using the days-overdue rule; client tier exception flagged
             for human review

Component 4: Human checkpoints

What to document: every point in the workflow where a human must review, approve, or make a decision before the workflow proceeds.

Human checkpoint documentation:

CHECKPOINT NAME: [Name the checkpoint]
CHECKPOINT TYPE: Review / Approval / Decision
  Review: human reads output before it is used
  Approval: human must actively confirm
  Decision: human provides input the AI cannot provide
WHO: [Specific role — not "someone," not "the team"]
WHAT THEY ARE CHECKING: [The specific quality, accuracy, or judgment element]
WHAT HAPPENS IF NOT ACTIONED IN [X] HOURS: [The fallback — escalate, hold, notify]

Why explicitness matters: vague checkpoints get skipped during busy periods.

A checkpoint that says “finance lead reviews before payment” with no timeout and no fallback gets skipped when the finance lead is on holiday.

An explicit checkpoint with a 24-hour timeout and an escalation to the COO is a checkpoint that functions under real operating conditions.


Component 5: Expected outputs

What to document: what the workflow produces, in what format, and what a good output looks like.

Output elementWhat to document
Output typeDocument, email draft, data entry, notification, structured report
Format requirementsSpecific structure, required sections, length range
Quality barThe specific criteria that make an output acceptable for use vs. requiring rework
DestinationWhere the output goes (email draft folder, PM tool task, finance approval queue)
AudienceWho receives or reviews the output; and their expectations

The workflow mapping interview: how to extract the real workflow in 20 minutes

The fastest and most accurate workflow mapping method is a structured 20-minute conversation with the person who runs the task.

Process documentation, SOPs, and observation-based mapping all produce the official version. The interview produces the real version.

The 20-minute mapping interview structure

Opening (2 minutes):

“I want to understand exactly how [workflow name] actually runs; not the way it’s supposed to run, but the way you actually do it. I’ll ask you to walk me through the last time you did it, and I’ll ask some questions along the way.”


Trigger and setup (3 minutes):

  • “What starts this workflow? What just happened when you begin it?”
  • “What information do you have in front of you when you start?”
  • “Is there anything you check or review before you begin?”

Step-by-step walkthrough (8 minutes):

“Walk me through the last time you did this, step by step. I’m going to pause you at any point where you make a choice and ask you what drove the choice.”

At each step: “Was there a moment where you could have done that differently? What would make you do it differently?”

At each visible judgment call: “What information would change your answer here? What would make you go the other way?”


Edge cases (4 minutes):

  • “What is the most unusual situation you have encountered running this workflow? How did you handle it?”
  • “What breaks this workflow? What inputs does it not handle well?”
  • “What would make you not run this workflow and handle the situation differently?”

Output and checkpoint (3 minutes):

  • “Where does the output end up? Who sees it?”
  • “Do you review it before it goes anywhere? What are you looking for?”
  • “Who else might review it? What are they looking for?”

The documentation step:

After the interview, the mapper produces the five-component workflow map within 24 hours. They send it to the person interviewed for a 5-minute review: “Does this accurately describe how you actually run this task?”


The workflow inventory: before mapping individual workflows

Before mapping individual workflows in depth, produce a workflow inventory: a simple list of every recurring task in the business, with enough information to prioritise which to map first.

The workflow inventory format:

Workflow nameOwner roleFrequencyTime per runAI candidate?Priority
Invoice intake and matchingFinance lead30–50x/week8–12 minYesHigh
Client proposal draftAccount manager3–5x/week90–120 minYesHigh
Weekly pipeline summaryOps lead1x/week45–60 minYesHigh
Meeting action item extractionProject manager5–10x/week20–30 minYesMedium
Expense report codingFinance lead10–15x/week15–20 minYesMedium
Custom enterprise quoteFounder1–2x/month3–4 hoursPartialLow

The three columns that drive prioritisation:

  • Frequency × time per run = the total time cost per week; the impact of automating
  • AI candidate? = whether the task is sufficiently rule-based and document-based for AI to handle it well
  • Priority = the combination of time cost and AI candidacy

Who provides the inventory:

Ask every team member in a function-by-function sweep: “List every recurring task you do that you perform at least once per week, in order of the most time it costs you.”

The inventory is produced in a 30-minute team meeting or via a shared form. The founder adds any workflow they directly own.

From inventory to mapping:

Map the top five to eight workflows by priority first. Expand to lower-priority workflows after the high-priority ones are mapped, built, and running.


Common questions on workflow mapping

”What if the person who runs the workflow is too busy to be interviewed?”

Two alternatives:

  • Self-assessment form: send the five component questions as a written form and ask them to complete it asynchronously. Follow up with a 5-minute clarification call. Less accurate than the interview; still better than mapping without them.
  • Shadow mapping: observe one complete run of the workflow and document what you see; then send the map to the person for review. They will correct it; those corrections are the invisible decision points the shadow captured but did not understand.

”How do I map a workflow that is done differently by different people?”

Map both versions. Document the differences explicitly in the decision points component.

Then make a design decision: does the AI run the most common version, the best version, or does it ask the runner which version to use? This is a workflow design question that mapping surfaces; not one the mapper answers.

”Should I map workflows for the whole company or just the ones I plan to automate first?”

Map the top five to eight AI-priority workflows in depth first. Produce a shallow inventory (workflow name, owner, frequency, time per run) for the whole company.

The shallow inventory tells you where to focus. The deep map tells you how to build. Do both; the shallow inventory in one 30-minute team session, the deep maps in individual 20-minute interviews over two weeks.

”What if the workflow is changing: should I map the current version or the intended version?”

Map the current version. Build the AI on the current version. Update the map when the workflow changes.

Building AI on an intended workflow that does not yet exist produces an AI that is wrong on day one. Building it on the current workflow produces an AI that is right on day one and can be updated when the workflow changes.

”How detailed does the workflow map need to be?”

The completeness test: give the workflow map to someone who has never run the workflow and ask them to configure an AI to handle it.

If they can do it without asking you any questions; the map is complete. If they need to ask questions; the answers to those questions are the gaps in the map.

”Can I map workflows myself or does it require an outside facilitator?”

You can map workflows yourself using the 20-minute interview structure above.

The risk of self-mapping: founders tend to map the official version of their own workflows; not the real version. The invisible decisions are harder to surface when you are the person who made them invisible through repetition.

For workflows the founder owns: ask someone else to interview you using the structured questions above. The interview format surfaces what solo reflection often misses.


Want the workflow mapping done before any AI is built: so every workflow is configured from the ground truth?

Workflow mapping before AI configuration is the fifteen minutes per workflow that prevents three months of debugging.

The five-component map; trigger, inputs, decision points, human checkpoints, expected outputs; produces the specification the AI is built from.

The structured mapping interview extracts the real workflow from the person who runs it; including the invisible decisions that configuration without mapping consistently misses.

The AI built from a complete map is more accurate, more maintainable, and more adaptable when the workflow changes than one built from assumption.

Path one: run the workflow inventory this week. Ask every team member to list their highest-frequency, highest-time-cost recurring tasks in a 30-minute meeting. That inventory tells you exactly which five workflows to map first. Map them using the 20-minute interview structure above before any AI is configured.

Path two: bring in a partner. If you want the workflow audit and mapping sprint run correctly before any AI is built; as part of the first two weeks of a structured engagement; that is the Phase 1 work Phos AI Labs does. We have run 400+ AI engagements. Clients include Zapier, Coca-Cola, Medtronic, Dataiku, and American Express. Thirty minutes, no deck. Start here.

The fastest way to know whether we're the right fit, is a conversation.

STEP 1/2 · ABOUT YOU