Principle
The forward-deployed (post-sales) motion, not go-to-market (pre-sales), is what builds AI-era defensibility
Defensibility is in the forward-deployed post-sales motion that captures tacit knowledge, not in pre-sales go-to-market.
Foody distinguishes go-to-market (pre-sales, easily neutralized when a customer realizes Claude can copy the product) from forward-deployed (post-sales, training agents on company-specific tacit knowledge). OpenAI and Anthropic invest heavily in the forward-deployed motion for this reason — echoing Sequoia's "services are the new software."
Move your differentiation downstream of the sale: capture and operationalize the customer's tacit knowledge.
“if you have a great forward deployed motion where you're going deep with a customer, you're training the agents based on all of this tacit knowledge within the company so that it understands how to perform effectively, that feels incredibly differentiated and hard to recreate.”Brendan Foody
Principle
Tacit knowledge in employees' heads is the durable human contribution; models will clean and structure data themselves
Don't bet human roles on data-cleaning; bet them on codifying the tacit knowledge models can't infer.
Foody pushes back on the "data cleaner will be a top job" thesis: models will structure data as reasoning scales, but the tacit context in employees' heads must be codified by humans to train agents. He found this firsthand trying to automate Mercor's own workflows.
The durable human job is surfacing what was never written down, not tidying what already exists.
“they'll be able to clean the data themselves fairly effectively as reasoning capabilities go up. The thing that humans will need to contribute to is all of the tat knowledge within the organization that isn't written down”Brendan Foody
Principle
The model is the product — defensibility lives upstream of the foundation models, not in the software layer on top
Value and durable margin accrue to infrastructure upstream of the models, not the application software downstream.
Foody's tweet thesis: the next 12 months are dramatically better for infrastructure companies upstream of Anthropic/OpenAI than for application-layer companies downstream. Claude Cowork adding medical/legal is "not a far leap" once it did software engineering, so software-layer defensibility erodes while data and compute moats compound.
In the post-AI stack, bet on the layer the model can't cheaply absorb — data, compute, network effects — not the software it can clone.
“building defensibility in the software layer on top of the models is going to be incredibly difficult. Whereas on the other side of things in the infrastructure side, it feels like there are meaningful moats that are getting built.”Brendan Foody
Principle
Revenue is real, not GMV, when you own the full value chain beyond the marketplace take
Owning the full delivery chain — not just the marketplace match — makes the top line revenue, not GMV.
Mercor books 30-40% gross margin because customers buy finished tasks (e.g. $1,000 for a task that delivers model improvement), with experts as one input. Foody contrasts this with Uber-style marketplaces where the driver network IS the end product.
Revenue quality is a function of how much of the value chain you own beyond the match.
“the experts are actually only one part of the broader value chain that we deliver to customers... that's not the end product in the same way as some of those marketplace businesses.”Brendan Foody
Principle
Quality is the X-factor that creates pricing power, because data value follows a power law
When value is power-law distributed, quality (not volume or price) is the dimension that earns pricing power.
Mercor's vertical integration exists to control this: downstream quality checks inform upstream decisions about which experts to onboard to produce the highest-value data. Quality differentiation compounds into pricing power because labs care most about the top-value tasks.
In a power-law value distribution, compete on quality of the rare high-value units, not on price or breadth.
“out of a data set of 10,000 tasks, the top 2000 tasks will create majority of the value... quality is the X factor that becomes dramatically more valuable than any other dimension.”Brendan Foody
Principle
Network effects are the only durable application-layer moat once software can be cloned end-to-end
The litmus test for whether an app-layer company survives the model is whether it has genuine network effects.
Foody names Salesforce (third-party integration marketplace), Slack (Slack Connect) and Cart as cases where the network effect, not the software, is the moat. Without it, that is the litmus test that determines whether a company becomes worthless or gains dramatic value.
Strip the software from any app-layer company; if no network remains, the model will eat it.
“the companies that have network effects will be able to in some ways generate more value because they can iterate 10 times faster... The companies that don't have network effects are going to struggle very significantly because then there's not really a defensible moat in the pure software”Brendan Foody
Principle
The lump-of-labor fallacy: productivity gains create more jobs, not fewer, because problems are unlimited
Automation reallocates labor to previously-unaddressable problems; it does not reduce total work.
Foody invokes 250 years of history (agricultural, industrial, computer revolutions) where feared mass displacement gave way to more jobs than ever, because society never runs out of problems — climate, cancer, space. He concedes the transition speed is the real risk, not the end state.
Net job loss fears assume fixed work; the historical record says freed labor finds new problems.
“over the last 250 years we've increased productivity by 25 x equivalent to automating about 96% of someone's job... yet 250 years later there's more jobs than ever before And it's because we have no shortage of problems to solve as a society”Brendan Foody
Principle
Aggregation and horizontal capability beat deep vertical specialization when data shapes are cross-applicable
When data shapes transfer across verticals, the aggregator with a referral flywheel beats the niche specialist.
Mercor's 5M+ talent network can refer friends to find the marginal doctor cheaply, and its cross-applicable tooling flexes across law, medicine, etc. Foody concedes niche vendors retain some value but labs prefer one horizontally-capable partner over a hundred specialists.
If your tooling transfers across verticals, aggregate; the buyer's coordination cost is your wedge.
“the kind of data shapes that we would build for a lawyer are often very similar to the kinds of data shapes that we would build for a doctor... the labs tend to prefer partnering with a very horizontally capable vendor that is able to flex across all of the different verticals”Brendan Foody
Principle
Pricing optimizes for the decade-long market structure, not the next six months — leave no oxygen for competitors
Price for the decade-long competitive structure, not near-term margin extraction — high margins invite competition.
Foody concedes Mercor could raise prices ~30% with little demand impact and has demand to double overnight, but deliberately holds back: pricing is set to win the market over a decade and to avoid leaving oxygen for competitors.
Treat your margin as a competitive-entry signal; sometimes the right price is below what the market would bear.
“pricing is not merely a question of optimizing for the next six months. It's optimizing for a structure that wins the market over the next decade... make sure that we're not leaving oxygen in the market because high margins invite competition.”Brendan Foody