What is Go-To-Market Engineering?

Go-To-Market Engineering (GTM Engineering) is the discipline of building automated, scalable revenue systems. Unlike traditional sales or marketing, GTM engineers combine technical skills (APIs, data pipelines, automation) with commercial strategy (ICP definition, outbound strategy, funnel optimization) to create repeatable pipeline generation machines.

What does a GTM Engineering agency do?

A GTM Engineering agency like GTME builds and operates your entire go-to-market infrastructure - from lead enrichment and data waterfalls, to automated outbound sequences, inbound lead qualification, and CRM workflows. We combine AI automation with human expertise to generate qualified pipeline at scale.

How is GTME different from a traditional lead gen agency?

Traditional agencies run campaigns. We build systems. GTME engineers your entire revenue infrastructure - data pipelines, enrichment waterfalls, multi-channel outbound, signal-based triggers, and CRM automation - so your pipeline generation runs on autopilot, not manual effort.

What tools does GTME work with?

We are experts in the modern GTM stack including Clay, Apollo, HubSpot, Salesforce, Instantly, Smartlead, HeyReach, Phantom Buster, Make, n8n, and 50+ other tools. We are a certified Clay Solutions Partner and build custom integrations via APIs.

How to Build a Lead Scoring Model That Actually Predicts Revenue

A lead scoring model is a systematic method of ranking prospects based on their likelihood to become paying customers, using a combination of demographic, firmographic, behavioral, and intent signals to assign numerical scores that prioritize sales follow-up. Effective lead scoring separates the 5-15% of leads that are genuinely sales-ready from the 85-95% that need more nurturing or don't fit at all. Without it, sales teams waste time on leads that were never going to close.

Most B2B companies that implement lead scoring end up with a system that nobody trusts. Sales ignores it. Marketing defends it. The scores don't correlate with actual revenue. This guide covers how to build a scoring model that actually works, starting with your own closed-won data rather than arbitrary point assignments.

Why Most Lead Scoring Models Fail

The default approach to lead scoring looks like this: someone in marketing or RevOps sits in a conference room and assigns point values based on gut instinct.

Downloaded a whitepaper? +10 points.
VP title? +15 points.
Company has 500+ employees? +20 points.
Visited the pricing page? +25 points.

The problem: these numbers are made up. Nobody validated whether whitepaper downloads actually correlate with closed-won deals in your specific business. And in most cases, they don't.

The Five Failure Modes

Arbitrary point values. The most common failure. Points are assigned based on intuition rather than data. A "pricing page visit" gets 25 points because it feels important, but analysis might show that demo request form views are 3x more predictive of conversion.
Activity bias. Marketing-heavy scoring systems reward engagement volume over quality. A researcher who downloads 10 resources and attends 3 webinars scores higher than a decision-maker who visits the pricing page once and fills out a demo form. The second lead is 5x more likely to close.
No decay logic. A lead who visited your pricing page 8 months ago still has those 25 points. Their situation has changed. They may have already bought from a competitor. Without decay, your scoring model is a museum of outdated signals.
Threshold confusion. "MQL" means a lead hit 50 points. But what does 50 points actually mean? Is it the same as a lead with 80 points? Without validated thresholds tied to conversion rates, the score is just a number.
Set-and-forget. The model is built once and never updated. But your product changes, your ICP evolves, and market conditions shift. A scoring model needs quarterly review at minimum.

The Data-Driven Approach: Start With Closed-Won Analysis

Instead of guessing, start with what you know: which leads actually became customers.

Step 1: Export Your Historical Data

Pull the following from your CRM (HubSpot, Salesforce, or whatever you use):

All closed-won deals from the past 12-18 months (minimum 50 deals, ideally 200+)
All closed-lost deals from the same period
All leads that never converted to opportunity

For each record, you need:

Company firmographics (industry, employee count, revenue, location)
Contact demographics (title, seniority, department)
Behavioral data (page visits, content downloads, email engagement)
Source data (how they entered your funnel)
Deal data (deal size, sales cycle length, win/loss reason)

Step 2: Identify Patterns in Closed-Won Deals

Now analyze. You're looking for attributes that are statistically overrepresented in closed-won deals compared to closed-lost or no-opportunity leads.

Firmographic patterns to check:

Attribute: Industry | Question to Answer: Which 3-5 industries produce the most revenue?

Attribute: Employee count | Question to Answer: What's the sweet spot range?

Attribute: Revenue range | Question to Answer: Is there a minimum company size for viable deals?

Attribute: Geography | Question to Answer: Do certain regions convert better?

Attribute: Technology | Question to Answer: Do companies using specific tools convert more?

Attribute: Growth stage | Question to Answer: Do high-growth companies close faster?

Demographic patterns to check:

Attribute: Job title | Question to Answer: Which titles appear most in closed-won?

Attribute: Seniority level | Question to Answer: Do directors close more than VPs, or vice versa?

Attribute: Department | Question to Answer: Which departments are your real buyers?

Attribute: Title keywords | Question to Answer: Are there title patterns ("ops," "growth," "revenue")?

Behavioral patterns to check:

Attribute: First touch source | Question to Answer: Which channels produce the highest-converting leads?

Attribute: Content consumed | Question to Answer: Which specific pages or resources correlate with closing?

Attribute: Engagement recency | Question to Answer: How recent was their last engagement before converting?

Attribute: Engagement velocity | Question to Answer: How quickly did they engage with multiple touchpoints?

Attribute: Specific page visits | Question to Answer: Does pricing page, case study, or integration page visit predict?

Step 3: Calculate Conversion Lift

For each attribute, calculate the conversion lift - how much more likely a lead with that attribute is to become a customer compared to the baseline.

Formula:

`` Conversion Lift = (Conversion rate with attribute) / (Overall baseline conversion rate) ``

Example:

If your overall lead-to-customer conversion rate is 2%, and leads from the SaaS industry convert at 6%, SaaS industry has a 3x lift.

Here's what a real conversion lift analysis might look like:

Attribute: Industry | Segment: SaaS | Conversion Rate: 6.2% | Lift vs. Baseline: 3.1x

Attribute: Industry | Segment: Fintech | Conversion Rate: 4.8% | Lift vs. Baseline: 2.4x

Attribute: Industry | Segment: Healthcare | Conversion Rate: 1.1% | Lift vs. Baseline: 0.55x

Attribute: Employee Count | Segment: 50-200 | Conversion Rate: 5.1% | Lift vs. Baseline: 2.55x

Attribute: Employee Count | Segment: 201-1000 | Conversion Rate: 4.3% | Lift vs. Baseline: 2.15x

Attribute: Employee Count | Segment: 1-49 | Conversion Rate: 0.8% | Lift vs. Baseline: 0.4x

Attribute: Title | Segment: VP/Director | Conversion Rate: 7.2% | Lift vs. Baseline: 3.6x

Attribute: Title | Segment: Manager | Conversion Rate: 3.1% | Lift vs. Baseline: 1.55x

Attribute: Title | Segment: Individual Contributor | Conversion Rate: 0.9% | Lift vs. Baseline: 0.45x

Attribute: Source | Segment: Demo Request | Conversion Rate: 18.5% | Lift vs. Baseline: 9.25x

Attribute: Source | Segment: Content Download | Conversion Rate: 1.8% | Lift vs. Baseline: 0.9x

Attribute: Source | Segment: Webinar | Conversion Rate: 2.4% | Lift vs. Baseline: 1.2x

Attribute: Behavior | Segment: Pricing Page Visit | Conversion Rate: 8.3% | Lift vs. Baseline: 4.15x

Attribute: Behavior | Segment: Case Study View | Conversion Rate: 5.7% | Lift vs. Baseline: 2.85x

Attribute: Behavior | Segment: 3+ Page Views in 7 Days | Conversion Rate: 6.1% | Lift vs. Baseline: 3.05x

Step 4: Assign Points Based on Lift

Now assign points proportional to the conversion lift. This ensures your point values reflect actual predictive power, not gut feeling.

Recommended approach:

Attributes with 3x+ lift: 15-25 points
Attributes with 2-3x lift: 10-15 points
Attributes with 1-2x lift: 5-10 points
Attributes with <1x lift: 0 points (or negative points)
Strong negative signals: -10 to -20 points

The exact scale doesn't matter as much as the relative proportions. What matters is that a 3x lift attribute gets roughly 3x the points of a 1x lift attribute.

The Three Scoring Dimensions

A complete lead scoring model evaluates three independent dimensions. Each should be scored separately and then combined.

Dimension 1: Fit Score (Firmographic + Demographic)

The fit score answers: "Is this the type of company and person we sell to?"

This is relatively static - it doesn't change based on behavior. It's set when the lead enters your system and updated only when firmographic data changes.

Components:

Industry alignment (0-25 points)
Company size match (0-20 points)
Revenue range (0-15 points)
Geography (0-10 points)
Technology fit (0-15 points)
Title/seniority match (0-25 points)
Department alignment (0-15 points)

Total possible fit score: ~125 points

Classify into tiers:

A (Ideal fit): 90-125 points
B (Good fit): 60-89 points
C (Marginal fit): 30-59 points
D (Poor fit): 0-29 points

Dimension 2: Engagement Score (Behavioral)

The engagement score answers: "Is this lead actively interested right now?"

This changes constantly and is where decay logic becomes critical.

Components:

High-intent page visits: pricing, demo, comparison pages (15-25 points each)
Content engagement: blog, resource downloads (5-10 points each)
Email engagement: opens and clicks (3-5 points each)
Event attendance: webinars, demos (10-20 points each)
Form submissions: demo requests, contact forms (25-40 points each)
Return visits: coming back after absence (10-15 points)

Decay rules:

Page visit points decay 50% after 14 days, 100% after 30 days
Content download points decay 50% after 30 days, 100% after 60 days
Form submission points decay 50% after 30 days, 100% after 90 days
Email engagement points decay 50% after 7 days, 100% after 21 days

Dimension 3: Intent Score (Third-Party Signals)

The intent score answers: "Is this company actively researching solutions like ours?"

This uses data from outside your own properties:

Bombora/G2 intent data: Company is researching your category (20-30 points)
Job posting signals: Hiring for roles that suggest they need your solution (15-25 points)
Technographic changes: Adopted or dropped a related tool (10-20 points)
Review site activity: Visited G2/Capterra listings in your category (15-25 points)
Competitor engagement: Engaging with competitor content (10-15 points)

Implementing Lead Scoring in HubSpot

HubSpot supports both manual and predictive lead scoring. Here's how to implement the data-driven model we've described.

Using HubSpot Score Properties

Create a custom score property for each dimension:

- Fit Score (Score property type) - Engagement Score (Score property type) - Intent Score (Score property type) - Combined Lead Score (Calculated property)

Configure Fit Score rules in the score property settings:

- Add positive rules for each firmographic/demographic attribute - Set point values based on your conversion lift analysis - Add negative rules for disqualifying attributes (competitors, students, etc.)

Configure Engagement Score rules using behavioral triggers:

- Page view rules (use URL contains for key pages) - Form submission rules - Email click rules - Meeting booked rules

Set up decay using workflows:

- Create a workflow that triggers on a schedule (weekly) - Use branching logic to check when the last engagement occurred - Reduce engagement score by a set amount based on inactivity

HubSpot Predictive Lead Scoring

HubSpot's built-in predictive scoring (available in Enterprise) uses machine learning on your historical data. It's worth enabling as a complement to your manual model, not a replacement.

When HubSpot predictive scoring works well:

You have 500+ closed-won and closed-lost deals
Your CRM data is clean and consistently filled
Your sales process is standardized

When it falls short:

Small sample sizes (under 100 deals)
Inconsistent CRM hygiene
Complex, multi-touch sales processes where the CRM doesn't capture the full picture

Using Workflows for Lead Routing

Once scoring is in place, build routing workflows:

``` IF Fit Score >= 90 AND Engagement Score >= 50: → Route to AE immediately (Sales-Ready)

IF Fit Score >= 90 AND Engagement Score 25-49: → Route to SDR for outreach (High-Fit, Warming)

IF Fit Score >= 60 AND Engagement Score >= 75: → Route to SDR (Good-Fit, High-Interest)

IF Fit Score >= 60 AND Engagement Score < 25: → Add to nurture sequence (Good-Fit, Not Ready)

IF Fit Score < 60: → Marketing nurture only (regardless of engagement) ```

Setting Thresholds: The MQL Debate

The term "MQL" (Marketing Qualified Lead) has become loaded. Many organizations are moving away from MQL as a concept and toward "hand-raise" and "signal-based" qualification. Regardless of what you call it, you need thresholds that determine when a lead gets human attention.

How to Set Initial Thresholds

Score your historical closed-won deals retroactively. Apply your scoring model to all deals from the past 12 months and see what score they had at the point they were first contacted by sales.
Find the natural breakpoints. Typically, you'll see a clear separation between leads that closed and leads that didn't when plotted by score.
Set your threshold where the conversion rate meaningfully exceeds baseline. If leads scoring above 70 convert at 8% and leads below 70 convert at 1.5%, 70 is a reasonable initial threshold.
Validate with your sales team. Show them 20 leads above and 20 leads below the threshold. Ask: "Would you want to work these?" If sales agrees the above-threshold leads are worth their time, you're in the right zone.

Threshold Calibration

Your first thresholds will be wrong. Plan to adjust:

Month 1-2: Set thresholds, track which scored leads actually convert
Month 3: First calibration - adjust thresholds based on actual conversion data
Quarterly: Review and adjust based on the most recent quarter's data
Annually: Full model rebuild incorporating new conversion lift analysis

Iteration and Maintenance

A lead scoring model is not a project. It's a system that requires ongoing maintenance.

Monthly Reviews

Check conversion rates by score band (are high-scoring leads actually converting better?)
Review the volume of leads at each threshold (too many MQLs means the threshold is too low)
Look at sales feedback on lead quality

Quarterly Deep Dives

Re-run conversion lift analysis on the latest data
Adjust point values for attributes whose predictive power has changed
Add new attributes that have emerged as predictive
Remove attributes that no longer correlate with conversion
Adjust decay rates based on actual engagement-to-conversion timing

Annual Model Rebuild

Full reanalysis from scratch
Your ICP may have shifted
New products or pricing changes affect which leads convert
Market conditions change which signals matter

Advanced: Predictive Lead Scoring With Machine Learning

For companies with enough data (500+ closed-won deals), machine learning models can outperform manual scoring by discovering non-obvious patterns and interactions between variables.

When to Consider ML-Based Scoring

You have 500+ historical deals with complete data
Your manual scoring model has plateaued in accuracy
You have engineering resources to build and maintain a model
Your sales cycle is complex with many influencing variables

Common ML Approaches

Logistic Regression - The simplest and often most effective approach. Outputs a probability of conversion for each lead. Easy to interpret.
Random Forest - Handles non-linear relationships and variable interactions. More accurate but harder to explain.
Gradient Boosting (XGBoost/LightGBM) - Highest accuracy in most cases. But it's a black box, which can create organizational resistance.

The Practical Path

Most B2B companies don't need custom ML models. The combination of:

Data-driven manual scoring (as described in this guide)
HubSpot/Salesforce Einstein predictive scoring
Regular calibration and iteration

...will get you 80-90% of the accuracy of a custom ML model at 10% of the cost and complexity.

Lead Scoring Model Scorecard

Use this scorecard to evaluate your current model or plan a new one:

Dimension: Data Foundation | Question: Are point values based on conversion lift analysis? | Score (1-5):

Dimension: Fit Scoring | Question: Do you score firmographic and demographic fit separately? | Score (1-5):

Dimension: Engagement Scoring | Question: Do you track behavioral engagement with decay? | Score (1-5):

Dimension: Intent Signals | Question: Do you incorporate third-party intent data? | Score (1-5):

Dimension: Thresholds | Question: Are MQL/SQL thresholds validated against conversion data? | Score (1-5):

Dimension: Routing | Question: Do scores trigger automated routing workflows? | Score (1-5):

Dimension: Sales Alignment | Question: Does your sales team trust and use the scores? | Score (1-5):

Dimension: Iteration | Question: Do you review and adjust the model at least quarterly? | Score (1-5):

Dimension: Coverage | Question: Are you scoring all leads, not just those with complete data? | Score (1-5):

Dimension: Negative Scoring | Question: Do you deduct points for disqualifying signals? | Score (1-5):

35-50: Strong model. Focus on optimization. 20-34: Functional model with gaps. Address weakest areas. Below 20: Model needs a fundamental rebuild using this guide.

FAQ

How many data points do I need to build a reliable lead scoring model?

You need a minimum of 50 closed-won deals and 50 closed-lost deals to identify meaningful patterns, though 200+ of each is ideal. Below 50 deals, your sample size is too small for conversion lift analysis to be statistically reliable. If you're a newer company with limited deal history, start with a simpler model using just 3-5 key attributes and plan to rebuild once you have more data.

Should I use HubSpot's built-in predictive lead scoring or build my own?

Use both. HubSpot's predictive scoring (Enterprise tier) is a useful baseline that updates automatically as your data grows. But it's a black box - you can't see or adjust the weights. Build your own manual model alongside it so you understand what drives the scores, can explain them to sales, and can adjust for factors the algorithm might miss. When the two models disagree on a lead, investigate why.

How often should I recalibrate my lead scoring model?

Monthly light reviews (check conversion rates by score band), quarterly deep dives (re-run conversion lift analysis, adjust points), and annual full rebuilds. The biggest mistake is building a model once and leaving it for a year. Markets shift, your product changes, and yesterday's strong signal becomes noise. At GTME, we recalibrate client scoring models quarterly and see an average 15-25% improvement in MQL-to-SQL conversion rate after each recalibration.

What's the difference between lead scoring and lead grading?

Lead scoring typically combines fit and engagement into a single number. Lead grading separates them - usually a letter grade for fit (A-D) and a number for engagement. Grading is actually the better approach because it lets you distinguish between "perfect fit, not engaged yet" (A3) and "poor fit, very engaged" (D1). Both need sales attention, but for very different reasons. We recommend the grading approach (separate fit and engagement dimensions) as described in this guide.

How do I get my sales team to actually use lead scores?

Three tactics that work: First, involve sales in building the model - show them the conversion lift data and get their input on which attributes matter. Second, prove accuracy by back-testing on recent deals they remember ("this deal that closed last month would have scored 95 in the new model"). Third, start by using scores for routing rather than qualification - route highest-scoring leads to top performers, which creates natural buy-in when those reps start closing more.

How to Build a Lead Scoring Model That Actually Predicts Revenue

How to Build a Lead Scoring Model That Actually Predicts Revenue

Why Most Lead Scoring Models Fail

The Five Failure Modes

The Data-Driven Approach: Start With Closed-Won Analysis

Step 1: Export Your Historical Data

Step 2: Identify Patterns in Closed-Won Deals

Step 3: Calculate Conversion Lift

Step 4: Assign Points Based on Lift

The Three Scoring Dimensions

Dimension 1: Fit Score (Firmographic + Demographic)

Dimension 2: Engagement Score (Behavioral)

Dimension 3: Intent Score (Third-Party Signals)

Implementing Lead Scoring in HubSpot

Using HubSpot Score Properties

HubSpot Predictive Lead Scoring

Using Workflows for Lead Routing

Setting Thresholds: The MQL Debate

How to Set Initial Thresholds

Threshold Calibration

Iteration and Maintenance

Monthly Reviews

Quarterly Deep Dives

Annual Model Rebuild

Advanced: Predictive Lead Scoring With Machine Learning

When to Consider ML-Based Scoring

Common ML Approaches

The Practical Path

Lead Scoring Model Scorecard

FAQ

How many data points do I need to build a reliable lead scoring model?

Should I use HubSpot's built-in predictive lead scoring or build my own?

How often should I recalibrate my lead scoring model?

What's the difference between lead scoring and lead grading?

How do I get my sales team to actually use lead scores?

Need help implementing this?

GTM insights, weekly

Related articles

Want us to build this for you?