Data Marketing: How B2B Companies Turn Proprietary Data Into a Growth Engine
What Is Data Marketing?
In 2016, I was running marketing at a freight tech company most people couldn’t pick out of a lineup. No brand equity. No paid budget to speak of. What we did have was a lot of data about what freight actually costs around the world — data nobody else was publishing. So we published it.
That accident became a strategy. That strategy has a name: data marketing.
Here’s the thing about data marketing most definitions miss: it’s not about using data to optimize your campaigns. That’s data-driven marketing — a different animal. Data marketing means your data IS the campaign. The content asset. The PR hook. The credibility signal that brings people back. You’re not publishing opinions dressed up as insight. You’re publishing evidence.
The Freightos Baltic Index started as a Google Sheet that I manually updated with container shipping rates. Today it’s the global benchmark for ocean freight pricing, cited by the Wall Street Journal, the New York Times, BBC Radio, and dozens of industry publications. It didn’t start with a grand vision. It started with data nobody else bothered to make public.
Data marketing, in plain English: using data your company already owns — transactional, behavioral, market signals — as the primary content asset. Not as decoration. Not as a chart in a blog post. As the load-bearing structure that everything else rests on.
The keywords that matter here: data marketing strategy, B2B data marketing, and understanding what data marketing actually is versus the data-driven marketing that most people confuse it with.
Why Data Marketing Works (And Why Now)
Here’s the thing nobody says out loud: most B2B content is opinion dressed up as insight. Yours. Mine. Everyone’s.
The problem is, opinion is now free. ChatGPT can generate 2,000 words of plausible-sounding B2B marketing advice faster than you finish your coffee. It will cite research. It will use bullet points. It will sound exactly like a thought leader who has definitely done the thing they’re describing.
Data is different. Data is the one thing an LLM can’t fabricate without getting caught. That’s not a small distinction. That’s the entire game right now.
Three things converged to make this the right moment for data marketing specifically:
1. The AI Content Flood Commoditized Opinion. Every B2B company on the planet is now publishing AI-generated blog posts about the same topics with the same frameworks. The volume of content-shaped oatmeal has exploded. Opinion-based thought leadership is being produced at scale, which means it no longer differentiates. I wrote about this in detail in How to Win Attention When the Internet is Drowning in Slop — the thesis is simple: when everyone can create content, the only unfair advantage is content that nobody else can create.
2. Trust Collapse Makes Proprietary Data More Credible. Trust in media, brands, and thought leaders is declining fast. Audiences are increasingly skeptical of claims without evidence. Proprietary data — the kind that comes from your own transactions, your own users, your own market position — carries inherent credibility because it’s verifiable and unique. You can’t fake transaction data at scale.
3. AI Search Rewards Authoritative, Unique Data. The new AI search engines — ChatGPT, Perplexity, Claude, Gemini, AI Overviews — are trained to prioritize authoritative, unique sources. If you’re the only company publishing a particular data signal, you become the default citation. I’ve covered the mechanics of how to write content that LLMs trust, and proprietary data checks almost every box.
The “Data Moat” concept is straightforward: your proprietary data is the one thing a competitor cannot replicate or summarize. A blog post about “5 trends in fintech” can be rewritten by anyone. A monthly index of what 30,000 companies are actually spending on AI tools? That’s a moat.
The Three-Tier Data Marketing Framework
The first time I realized data could be a marketing engine, I thought it was a fluke. The second time, I started to see a pattern. After watching Redfin, Brex, Ramp, and Profound do it at scale — and doing it ourselves at Freightos for the better part of a decade — I’m convinced every data marketing play goes through exactly three tiers. Not because I made them up. Because I kept watching companies move through them in sequence.
Tier 1: The Number No One Else Has
This is where it starts. You identify a data signal that your company uniquely owns — transaction volumes, pricing trends, behavioral patterns, operational metrics — and you publish it. Not in a gated whitepaper. Not behind a form. You publish it openly, consistently, and in a format that’s easy to cite.
At Freightos, this was the container shipping rate index. For Redfin, it was housing tour request data. For Brex, it was aggregate corporate spend across AI vendors. The raw signal is often boring in isolation. The magic is in making it public.
Tier 2: The Number Becomes a Habit
This is where most companies stall. Tier 1 gets you attention. Tier 2 gets you dependency. The difference is cadence. When you publish data on a regular schedule — weekly, monthly, quarterly — your audience starts to expect it. Journalists save your email. Analysts bookmark your page. Your data becomes part of their workflow.
The Freightos Baltic Index went from a curiosity to a dependency when we started publishing weekly. Bloomberg built it into their terminal. Reuters referenced it in their shipping coverage. That didn’t happen because the data got better. It happened because the data got predictable.
Tier 3: The Data Becomes the Product
This is the end game. Your data asset becomes so valuable that it generates revenue directly — through API access, paid subscriptions, licensing deals, or as a standalone product. Not every company will reach Tier 3. But every company should know it’s there.
I’ve laid out the complete framework — with examples, anti-patterns, and the specific mechanics of each tier — in Marketing Alchemy: Turning Your Data Into Customers. If this page is the overview, that post is the playbook.
The Hard Signal Protocol: How to Find Your Data Asset
The most common objection I get is “we don’t have any interesting data.”
I’ve heard this from logistics companies sitting on a decade of shipment records. From SaaS companies with behavioral data on 100,000 users. From a fintech with transaction patterns across an entire industry vertical.
They all had data. They’d just never thought about it as a publishing asset.
The Hard Signal Protocol is a 120-minute workshop I built to fix exactly that — a structured way to audit what you actually own before concluding you have nothing worth publishing.
The framework has four steps:
- Asset Audit — Catalog every data source your company touches: transaction logs, customer behavior, market signals, operational metrics
- Context & Intent Map — Filter for signals that carry purchase intent. What does this data tell your buyer about their own problem?
- Vehicle Selection — Choose the right format: report, index, newsletter, API, benchmark
- Distribution Mapping — Build the media layer: PR, LinkedIn, email, partnerships, syndication
Read the full Hard Signal Protocol →
Data Marketing Teardowns: How Top Companies Do It
Theory is useful. Watching it actually work — in real companies, with real distribution constraints and real data assets — is more useful.
I’ve been running teardowns of data marketing engines for a while now. Not because these companies are perfect (Brex’s distribution has a gap a mile wide, and I’ll tell you exactly what it is) but because there’s something specific and transferable in each one. Here’s the short version of four I keep coming back to.
Redfin became the housing market’s narrator — not by having the most data, but by publishing theirs first. Four times faster than government sources. Their “nowcasting” advantage turned upstream signals (tour requests, bidding patterns) into the kind of data Bloomberg and the WSJ reference weekly.
Brex transformed 30,000+ companies’ expense reports into a monthly Benchmark — a cultural barometer for where AI and SaaS spending is actually flowing. The raw data is genuinely unremarkable. The editorial packaging is what makes it worth reading.
Ramp took “spend exhaust” — what 25,000+ companies’ credit cards say about where business is flowing — and built a monthly market intelligence report with its own brand. The framing of “spend exhaust” is beautiful positioning that reframes boring transaction data as signal.
Profound built a public AI search visibility index that makes their product’s value proposition immediately, viscerally obvious. The most persuasive thing you can show a prospect is their own problem, reflected back at them in data.
How to Build Your Data Marketing Engine (The Short Version)
Okay. If you’ve made it here, you’ve earned the checklist. But I’d strongly encourage you to read the full frameworks before treating this as a recipe — context changes how you apply each step.
Here’s the short version:
- Audit what data you actually own. Transaction logs, search behavior, customer outcomes, operational metrics. You almost certainly have more than you think.
- Filter for signals with purchase intent. What does this data tell your buyer about their own problem? If the answer is nothing, keep looking.
- Establish a publication cadence. Frequency creates dependency. Monthly is a good starting point. Weekly is better if your data refreshes that fast.
- Choose your distribution vehicle. Report, index, newsletter, API — the format depends on your audience’s consumption habits and your operational capacity.
- Build a media layer. PR, LinkedIn, email, partnerships. Great data that never reaches anyone is the most common failure mode.
Common Data Marketing Mistakes
I’ve made most of these. Consider this a partial confession.
Relying on averages instead of directional signals. Averages are boring. Trends are interesting. “The average shipping cost is $2,400” is a fact. “Shipping costs dropped 14% month-over-month — the steepest decline since 2019” is a story.
Waiting for perfect data before publishing. The first version of the Freightos Baltic Index had gaps you could drive a container ship through. We published it anyway. Perfection is the enemy of cadence, and cadence is what builds dependency.
Using data as proof of product quality. This is the most common mistake. Data marketing isn’t about proving your product works. It’s about providing market intelligence that your buyers find independently valuable. The moment your data report reads like a sales pitch, you’ve lost.
No distribution strategy. I’ve seen teams build beautiful data assets and then publish them on a blog that gets 200 visits a month. The data doesn’t travel on its own. You need a media layer — PR, executive amplification, email, partnerships — or your data asset is a tree falling in an empty forest.
Publishing once and calling it a strategy. A single data report is a campaign. A recurring data publication is a strategy. The difference is dependency. Your audience should notice when you don’t publish.
Data Marketing Resources
Everything I’ve written about data marketing, organized by type:
Frameworks
- Marketing Alchemy: Turning Your Data Into Customers — The complete three-tier framework
- The Hard Signal Protocol — 120-minute workshop for finding your data asset
Teardowns
- Ramp’s Data Marketing Teardown
- Redfin’s Data Marketing Teardown
- Brex’s Data Marketing Teardown
- Profound’s Data Marketing Teardown
Related
If any of this sparked something — good. The teardowns are where it gets specific: four companies, four data engines, and what you can actually steal from each one.
Or if you’d rather just talk through what your data marketing engine could look like, book 45 minutes. I promise not to pitch anything. I’m not a consultant. I’m just a CMO who’s done this enough times to have opinions.