MicroBee API Team
Reading Time :
10 Minute
Live in-play betting now represents 60–70% of total sportsbook handle globally. In this environment, the speed at which odds updates reach the operator’s platform directly determines financial exposure, player experience, and competitive positioning.
When a goal is scored in a football match, the odds must update within milliseconds across every connected operator. Any delay creates a window where the displayed odds do not reflect reality — a window that costs operators money through accepted bets at stale prices and costs players through frustrated experiences with suspended or voided bets.
This guide explains the technical architecture behind low-latency sports data delivery, provides benchmarks for evaluating data feed providers, and details the specific engineering decisions that separate sub-second feeds from multi-second ones.
Understanding Latency in Live Betting
Latency in sports data feeds has three components, each contributing to the total delay between a real-world event and the odds update reaching the player’s screen:
1. Event Detection Latency — The time between a real-world event occurring (a goal, a point, a foul) and the data provider registering it. This depends on the provider’s scouting network (in-stadium scouts vs broadcast feeds) and can range from 1–8 seconds depending on the method.
2. Processing and Odds Calculation Latency — The time to recalculate odds based on the new event. Advanced providers use pre-computed probability trees that update in under 10ms. Less sophisticated systems recalculate from scratch, taking 100–500ms.
3. Delivery Latency — The time to transmit the updated odds from the provider’s servers to the operator’s platform. This is the component most directly affected by API architecture decisions.
Total end-to-end latency (event to player screen) typically ranges from 2–12 seconds across the industry. The delivery component — the part controlled by API architecture — can be reduced to under 50ms with proper engineering.
Architecture Decisions That Determine Latency
WebSocket vs REST Polling
The most significant architectural decision is the connection model. REST API polling (the traditional approach) requires the operator’s system to repeatedly request updates at fixed intervals. Even at aggressive polling rates (every 200ms), this introduces up to 200ms of inherent delay plus round-trip network time.
WebSocket connections maintain a persistent, bidirectional channel between provider and operator. Updates are pushed the instant they are available, with zero polling overhead. The latency floor drops from 200ms+ (REST) to the pure network transit time (typically 5–30ms within the same continent).
MicroBee’s data feeds use WebSocket delivery for all live in-play data, ensuring operators receive updates at the physical minimum latency their network connection allows.
Edge Node Distribution
Global operators need data delivered quickly to servers in multiple regions. A centralised data centre in London serving an operator in Singapore adds 150–200ms of network latency per update. Edge node distribution replicates the odds calculation engine to regional points of presence, reducing transit time to 10–40ms for operators in any major market.
Delta Compression
Transmitting a complete market snapshot for every update wastes bandwidth and increases parsing time. Delta compression sends only the changed fields (e.g., “match 12345, market home_win, odds 1.85 → 1.72”) rather than the full market object. This reduces message size by 80–90% and cuts parsing time proportionally.
Latency Benchmarks: What to Expect
Metric | Industry Average | Good | MicroBee Target |
Odds delivery (WebSocket) | 100–300ms | 50–100ms | < 50ms |
Odds delivery (REST poll) | 500ms–2s | 200–500ms | N/A (WebSocket only) |
Market suspension time | 2–5 seconds | < 2 seconds | < 1 second |
Settlement processing | 5–30 seconds | 2–5 seconds | < 2 seconds |
Concurrent live events | 5,000–10,000 | 15,000–25,000 | 30,000+ |
Message throughput | 10K msg/sec | 50K msg/sec | 100K+ msg/sec |
The Financial Impact of Latency
Latency is not just a technical metric — it has direct financial consequences:
• Arbitrage exposure — When odds are stale by even 500ms, sophisticated bettors can identify and exploit the price discrepancy before the market corrects. On high-volume football matches, this can cost operators $5,000–$20,000 per match in adverse selections.
• Market suspension frequency — Operators with slow data feeds must suspend markets more aggressively during fast-moving events, reducing the window for players to bet. Each unnecessary suspension represents lost handle (and therefore lost revenue).
• Player experience and churn — Players who consistently experience delayed odds, suspended markets, or voided bets will migrate to competitors with faster platforms. In live betting, perceived speed is a primary driver of platform preference.
• Regulatory risk — Some jurisdictions require operators to demonstrate that displayed odds reflect current market conditions. Persistent latency issues can trigger regulatory scrutiny and potential licence conditions.
How to Test a Provider’s Latency Claims
Provider marketing materials universally claim “real-time” or “sub-second” delivery. To verify these claims, operators should conduct independent testing:
• Request a sandbox with WebSocket access and measure round-trip times during live events, not just test data.
• Test during peak load — Saturday afternoon football or major tennis tournaments — not Tuesday morning when nothing is live.
• Measure P95 and P99 latency, not just average. A provider with 50ms average but 2-second P99 will spike during the moments when speed matters most.
• Compare timestamp accuracy: log when the provider’s message says the odds changed versus when your system received it. Providers who embed event timestamps enable precise latency measurement.
• Run tests from your production server locations, not from your office on a fast connection.
Frequently Asked Questions
Question | Answer |
What latency is acceptable for live in-play betting? | Sub-second delivery latency is the minimum competitive standard. Total end-to-end latency (event to player screen) should target under 5 seconds. |
Does WebSocket require more server resources than REST? | WebSocket connections use slightly more memory per connection but dramatically less CPU and bandwidth due to eliminated polling overhead. The net resource impact is positive. |
How does latency affect pre-match betting? | Pre-match odds change slowly, so delivery latency is largely irrelevant. The focus for pre-match is data completeness and market depth, not speed. |
Can operators reduce latency beyond what the provider delivers? | Yes. Co-locating servers near the provider’s edge nodes, using direct peering connections, and optimising internal processing pipelines can shave additional milliseconds. |
What causes latency spikes during major events? | Traffic volume, insufficient server capacity at the provider, and network congestion during peak usage. Providers with auto-scaling infrastructure handle spikes better than those with fixed capacity. |

