You don't have a data problem. You have a question problem.
Most ML projects fail before a single model is trained. Not because the data was bad or the algorithm was wrong — because nobody stopped to check whether they were solving the right problem.
ML EngineeringSeptember 202512 min read
A few years into doing this work, a stakeholder walked into a project kickoff with a deck already built. Slide three had the solution on it: a recommendation engine. The business problem — buried somewhere in the appendix — was that sales conversion from their product catalogue had dropped 18% over two quarters. The recommendation engine was going to fix it.
Nobody in the room asked why.
We spent six weeks on data pipelines, embedding models, and similarity scoring before someone finally pulled the CRM data and noticed that the drop was concentrated in a single customer segment — mid-market accounts that had recently been reassigned to a new sales team. The problem wasn't product discovery. It was onboarding. A recommendation engine was not going to move that number by a single percentage point, and we had just spent six weeks proving it the expensive way.
That experience changed how I start every ML project. Not with the data. Not with the model. With a much more uncomfortable question: is the thing we are being asked to build actually connected to the outcome we are being asked to improve?
The solution arrives before the problem is understood
This is the most common failure mode in applied ML, and it is so pervasive that most teams have stopped noticing it. A business stakeholder reads about a technology — a large language model, a churn prediction system, a computer vision pipeline — and arrives at the project with a solution in mind. The ML team's job, as it gets handed to them, is to build it.
The business problem is treated as the context for the build, not the thing the build is supposed to solve. And those are not the same thing.
What makes this particularly difficult is that the stakeholder is not being irrational. They have read the case studies. They have seen what these systems can do at other companies. The technology is real, the capability is real, and the desire to apply it is understandable. The failure is not in the enthusiasm. It is in the skipped step — the step where you ask whether the causal chain between the proposed solution and the stated problem actually holds.
The question is never "can we build this?" The question is "if we build this perfectly, does the number we care about actually move?"
When you cannot answer that question with confidence before the project starts, you are not doing machine learning. You are doing expensive prototyping in the direction of a hypothesis that nobody has verified.
Why the business problem and the ML problem are not the same thing
Every ML project involves two distinct translations, and both of them can go wrong independently. The first is translating a business outcome into a well-posed problem. The second is translating that well-posed problem into an ML task. Most teams are reasonably careful about the second translation. Almost no teams are careful enough about the first.
A business outcome is something like: reduce customer churn, increase operational throughput, catch defective units before they ship, shorten the time it takes a new employee to become productive. These are real, measurable outcomes with financial weight behind them. They are also not ML problems. They are business problems, and the path from business problem to ML problem requires a causal model — an explicit claim about which mechanism, if modified, would produce the desired outcome.
That causal model is almost never made explicit. It is assumed, usually by the person who proposed the solution, and then the assumption gets carried forward into the project like a piece of undocumented debt. Everything built on top of it is technically sound. The foundation is a guess.
The Two Translations — Where ML Projects Actually Break
What teams skip
Business outcome → well-posed problem
The causal claim. If we intervene on X, does Y move? This is a business question, not an ML question — and it must be answered before any data is touched.
What is the actual mechanism driving the outcome?
Does the proposed intervention sit on that mechanism?
Is there evidence — even weak evidence — that this link exists?
Who in the business can falsify this assumption?
What teams focus on
Well-posed problem → ML task
The technical translation. Classification or regression? Supervised or unsupervised? What is the label? What is the feature space? Teams are thorough here — on top of a foundation they never verified.
Feature engineering and availability at inference time
Evaluation metric selection and baseline definition
The uncomfortable truth is that the second translation — the one teams are careful about — is largely recoverable. A wrong task framing can be corrected in the third week of a project. A wrong causal assumption, discovered in the third week, usually means starting over.
What a bad question looks like
After enough projects, the signs of a poorly-framed question become recognisable before any data is pulled. These are not technical red flags. They show up in the language used in the first meeting, in the way the problem gets handed over, in what happens when you ask why.
Question framing failures — what they sound like and what they produce
The solution is named before the problem
When a stakeholder opens with "we need an LLM" or "we want to build a recommendation engine," the question has already been pre-answered. The ML team is being asked to justify a conclusion rather than reach one. The actual problem — if it surfaces at all — gets retrofitted to the solution someone already decided on.
The metric and the outcome are not the same thing
A team is asked to maximise click-through rate. The business wants to increase revenue. These are not the same objective, and optimising one can actively harm the other. When the metric being handed to the ML team is a proxy — and proxies always are — the distance between the proxy and the actual outcome needs to be interrogated explicitly. It almost never is.
"More data" is the proposed solution to a bad model
When a model underperforms and the first instinct is to collect more data, it usually means the question being asked of the model is wrong. More data answers the question you are already asking — it does not tell you whether that is the right question. A model trained on ten million examples of the wrong problem is not better than a model trained on ten thousand examples of the right one.
The baseline is not defined
If you cannot describe what the business does today — without ML, without any model, with the simplest possible rule — then you do not understand the problem well enough to solve it. The baseline is not a benchmark for the model. It is the check on whether ML is the right tool at all. Sometimes a lookup table and two business rules outperform a fine-tuned transformer. The project should discover that early, not at deployment.
Nobody can say what "good" looks like
If you ask the stakeholder "how will you know, six months after deployment, whether this worked?" and they cannot answer — not vaguely, not with a feeling, but with a specific number that moves in a specific direction — the problem definition is not finished. You are being asked to build something with no agreed-upon definition of success. That is not an ML problem. It is an unresolved business conversation that has been handed to the wrong team.
None of these require a bad-faith stakeholder to produce. They are the natural output of organisations that have learned to be enthusiastic about ML faster than they have learned to think carefully about what they want it to do. The patterns are structural, not personal — which means they will repeat on every project until someone builds a different habit at the front of the process.
The question audit — before any data is touched
The fix is not a longer discovery phase. It is a specific set of questions, asked early, that force the causal assumptions into the open where they can be examined. These are not questions the ML team can answer alone — they require the people who own the business outcome to be in the room. That is the point. The moment these questions cannot be answered without a business owner present is exactly the moment you know you were about to build the wrong thing.