Industry Deep Dives·May 8, 2026·9 min read·By Rodrigo Ortiz

AI for Consulting Firms: The 40% of Billable Hours You're Losing

AI for consulting firms recovers 40% of billable hours lost to admin and rework. Where it works first and how to roll it out without breaking utilization.

Most consulting and professional services firms are running at 60% utilization and calling it healthy. They are not. The other 40% is not idle time — it is buried inside billable engagements as research, formatting, admin, and rework that nobody pays for. The senior who is supposedly building the deck is rebuilding the deck for the third time. The associate who should be analyzing the data is reformatting the spreadsheet someone else built last month.

AI for consulting firms is the lever that flips that ratio. Done well, it does not replace the consultant — it deletes the work the consultant should not have been doing in the first place.

According to Harvard Business Review's analysis of how people actually use generative AI, the dominant real-world applications — ideation, summarization, editing, research, drafting, technical assistance — map almost one-for-one to what a consultant spends their week doing. For a firm whose business model is selling those exact hours, the implication is uncomfortable: the entire industry is sitting on a productivity overhang it has not yet priced in.

Where the 40% actually goes

If you ask a partner where the time leakage in their firm comes from, they will name the wrong things. The honest answer is rarely the dramatic one. It is not heroic late nights or surge weeks. It is small, daily, structural friction that compounds across every engagement on the books.

The pattern across the consulting and advisory firms we have worked with is consistent. The 40% breaks down roughly into:

Re-research. Two associates working different engagements unknowingly research the same topic three months apart. The institutional memory does not exist outside of someone's laptop.
Document plumbing. Reformatting decks. Cleaning up spreadsheets a client emailed in PDF. Rebuilding the same engagement letter for the fifth time this quarter.
Status reporting. Weekly updates compiled by hand. The same numbers extracted from the same systems. Then a partner re-formats them for the steerco.
Knowledge transfer rework. A senior leaves, takes the engagement context with them, and the next person bills three weeks getting up to speed on a project that should have onboarded in three days.

The non-obvious point. The 40% is invisible because it is billed. Clients pay for the rebuilt deck and the redone research the same way they pay for original analysis. The leakage shows up as margin compression and partner burnout, not as hours written off.

That last part is what makes this problem stubborn. The leakage does not appear on the time-tracking dashboard as red. It appears on the P&L as a slow erosion of effective rate per hour — billable rate stays flat, but the value-per-hour the client actually receives goes down. Eventually clients notice, and rate negotiations start going the wrong direction.

If your effective rate per hour is flat or declining while billing rates are rising, the 40% is the reason — and it is invisible to the time tracker.

Where AI for consulting firms creates the biggest leverage

The wrong starting point is "let's deploy AI across the firm." The right starting point is to find the two or three workflows where the 40% concentrates and replace those first. AI for professional services firms sees the fastest payback in three specific places:

Internal knowledge retrieval. Every firm has a wiki, a SharePoint, and a network of laptops where the actual answers live. AI knowledge automation indexes all of it and returns answers with citations in seconds. The associate who would have spent two hours hunting for the prior engagement on a similar topic spends ninety seconds and gets back to billable work.
Document drafting and review. First-pass drafts of memos, engagement letters, board reports, and client decks. The model produces 70% of the document. The consultant edits and validates. The work is not skipped — it is shifted from typing to thinking, which is what the client is paying for in the first place.
Status and reporting compilation. Weekly client updates, internal pipeline reports, and steerco packs. Automated reporting AI assembles the numbers, the commentary, and the visualizations from the source systems. The partner reviews and signs off. Twenty minutes instead of three hours.

The pattern across all three is the same. AI does the structured information work; the consultant does the judgment work. The firm gets to bill the same hours but at a much higher value-per-hour, and the team stops burning out on the parts of the job nobody went to business school for.

This is not theoretical. The same shape shows up in adjacent industries. We have written about it in the context of AI for law firms and the knowledge management problem — both essentially the same operational story with different domain wrappers.

Start with knowledge retrieval, document drafting, and status reporting — the three workflows where the 40% concentrates in almost every firm.

Three rollout patterns that stick (and one that always fails)

The pattern that fails: a firm-wide GenAI license, an all-hands training session, and an expectation that consultants will figure out the rest. Three months later, usage has collapsed to a small group of early adopters and nobody can show what the investment bought.

The patterns that work share a structural property. They start narrow, they tie AI to a specific deliverable, and they make the new way faster than the old way for the people doing the work — not in theory, in measurable minutes saved per output.

Engagement-anchored rollout. Pick one practice or service line. Build a knowledge index of its prior engagements. Train the team on retrieval and drafting against that specific knowledge base. Measure the time-to-first-draft on the next five engagements. This typically shows a 50-60% reduction in research and drafting time within the first month.
Reporting-first rollout. Pick the most labor-intensive recurring report — usually a weekly client status pack or a monthly internal pipeline review. Automate it end-to-end. Time savings are immediate, visible, and easy to socialize. This is the rollout pattern that wins firm-wide credibility fastest.
Partner-anchored rollout. Identify two or three partners who are willing to use AI as a working tool, not an experiment. Build the workflow with them. Their visible productivity gains create internal demand from the rest of the firm. Top-down mandates fail; partner-pulled rollouts spread.

The trap. Buying enterprise GenAI seats and calling it the rollout. Tools without workflows produce no measurable change in utilization or margin. The seat costs are real. The productivity gains are not.

The 40% is invisible because it is billed. The leakage shows up as margin compression, not as hours written off.

The link to ROI matters here. Calculating AI ROI for a consulting firm is not about cost savings on tools — it is about hours redirected from non-billable rework to billable analysis, plus the increased capacity that allows the firm to take on more work without proportional hiring.

Anchor every rollout to a specific deliverable, a specific practice area, or a specific partner — and measure time-to-deliverable, not tool adoption.

What 12 months of operational AI looks like at a 50-person firm

Take a 50-consultant boutique advisory firm doing $30M in revenue with a 22% net margin. Before AI, the firm bills 1,500 hours per consultant per year at an effective rate of around $400/hour. Utilization sits at 62%. Partners spend 35-40% of their week on non-client work — drafting, reviewing, internal reporting, and chasing context across systems.

Twelve months in, the picture looks different. Knowledge retrieval is sub-second across every prior engagement. First-draft documents are AI-generated and partner-edited. The weekly client status pack is auto-assembled and reviewed in twenty minutes. The same 50 consultants now bill closer to 1,650 hours each — not because anyone is working longer, but because more of the working day is on chargeable activity. Partner non-client time drops to 22%.

The shape is consistent with what enterprise AI research has started to find. Deloitte's State of Generative AI in the Enterprise survey reports that organizations capturing real value from generative AI are the ones that integrate it into actual workflows rather than treating it as a horizontal tool — exactly the pattern that distinguishes firms getting margin gains from firms paying for licenses.

The firm in our example has not hired. Revenue is up roughly 12%, mostly from existing clients buying more depth because the firm can deliver faster. Net margin is up around four percentage points because the cost base barely moved. None of this required replacing a single consultant. It required removing the friction that was sitting in between them and their clients.

The point is not the specific numbers — every firm's mix is different. The shape is what travels: AI does not change what the firm sells. It changes how much of the firm's capacity actually goes into selling it.

The 12-month outcome is not headcount reduction — it is the same team doing 10-15% more billable work, at higher margins, without burning out.

What to do this quarter

If you run a consulting, advisory, or professional services firm and you suspect the 40% is real in your operation, three concrete moves for the next 90 days:

Audit the 40%. Pick five recent engagements and walk through where time actually went. The category breakdown — research, drafting, formatting, status, rework — will tell you which workflow to automate first.
Pick one rollout pattern. Engagement-anchored, reporting-first, or partner-anchored. Do not run all three at once. Resource the one that maps to where your leakage concentrates.
Define the metric in advance. Time-to-first-draft, hours-per-status-pack, or non-billable hours per partner per week. Whatever it is, baseline it before the rollout. Without a number, the project drifts and the firm forgets it ran.

Audit the leakage, pick one rollout pattern, and define the metric before you start — that sequence is what separates the firms that compound from the ones that buy seats.

The opportunity is not abstract. The work that AI is good at is precisely the work that is currently eating consultant time without generating proportional client value. The firms moving on this in 2026 will quietly out-compete the ones that wait, not by having a better strategy practice but by having a more efficient operating model under the hood. If you want to map what AI for consulting firms would unlock in your specific operation, talk to a Groath growth expert and we will walk through your engagement mix, your time leakage, and the realistic 12-month payoff.

AI for Insurance Brokerages: How Firms Are Cutting Compliance Costs and Onboarding 3x Faster

Industry Deep Dives·April 1, 2026

Restaurant Margins Are 3–5%. Here's How AI Stops the Leaks

Industry Deep Dives·March 28, 2026

AI for E-Commerce Brands: Personalization, Support, and Abandoned Cart Recovery

See Industry Deep Dives →Also: Why Your CRM Is a Graveyard (And How AI Brings It Back to Life) →

AI for Consulting Firms: The 40% of Billable Hours You're Losing

Where the 40% actually goes

Where AI for consulting firms creates the biggest leverage

Three rollout patterns that stick (and one that always fails)

What 12 months of operational AI looks like at a 50-person firm

What to do this quarter

Related reading

AI for Insurance Brokerages: How Firms Are Cutting Compliance Costs and Onboarding 3x Faster

Restaurant Margins Are 3–5%. Here's How AI Stops the Leaks

AI for E-Commerce Brands: Personalization, Support, and Abandoned Cart Recovery