Alice is impatient

TL;DR

An engineer from AWS discusses how human impatience influences perception of service latency, emphasizing the importance of tail latency in user experience. The explanation clarifies why users often feel services are slow even when metrics suggest otherwise.

An engineer at Amazon Web Services has publicly discussed how human impatience influences perceptions of service speed and recovery times, shedding light on the discrepancy between technical metrics and user experience. This explanation matters because it highlights the importance of understanding tail latency and how users experience outages or delays, which can differ significantly from traditional measurements.

The engineer, identified as Marc Brooker, explains that humans measure time in seconds and minutes, which leads to a skewed perception of latency and outage durations. He illustrates this with examples showing that while technical metrics like mean request time or mean time to recovery (MTTR) may appear low, users often experience much longer delays due to the heavy tail of latency distributions. Brooker discusses the inspection paradox, which causes users to experience longer waits because they are more likely to encounter longer requests or outages.

He uses a log-normal distribution model to demonstrate how median and 99th percentile latencies translate into much longer perceived times for users. For example, a median recovery time of 30 minutes could translate into a user experience of around 6 hours, emphasizing the significance of tail latency. Brooker notes that this discrepancy is often overlooked in service design, where trimmed means or average metrics can mask the impact of rare but long delays.

He further emphasizes that recovery times are especially critical because they cannot be hidden by timeout-and-retry mechanisms, making tail latency a vital factor for user satisfaction and trust in services. The discussion aims to improve understanding of how long tail events affect real-world experiences, beyond what traditional metrics reveal.

Implications of Human Perception on Service Metrics

This explanation is significant because it underscores why tail latency and long recovery times matter deeply to user experience and service reliability. Even if technical metrics suggest acceptable performance, users may still perceive services as slow or unreliable due to the heavy tail in latency distributions. Recognizing this helps service providers prioritize efforts to reduce long tail delays, ultimately improving user satisfaction and trust.

Java Performance Engineering: Profiling and Optimizing Enterprise Applications

As an affiliate, we earn on qualifying purchases.

Understanding the Impact of Tail Latency on User Experience

Marc Brooker’s insights build on established concepts like the inspection paradox, which explains why users tend to experience longer delays than average metrics suggest. This awareness has been discussed in technical circles, but Brooker’s explanation emphasizes the human perspective, highlighting the importance of considering tail latency in designing and measuring service performance. His discussion aligns with ongoing industry efforts to better understand and mitigate long tail delays, especially in cloud services and distributed systems.

“When you have a long request or a long outage, people experience it as much longer than the average suggests.”

— Marc Brooker

The Art of Application Performance Testing: Help for Programmers and Quality Assurance

Used Book in Good Condition

As an affiliate, we earn on qualifying purchases.

Unclear Extent of User Perception Variability

It remains unclear how widespread the impact of tail latency perception is across different types of services and user bases. Specific quantifications of user experience discrepancies in real-world settings are still being studied, and the exact thresholds at which delays become perceived as unacceptable vary among users.

Klein Tools VDV526-200 Cable Tester, LAN Scout Jr. 2 Ethernet Tester for CAT 5e, CAT 6/6A Cables with RJ45 Connections

VERSATILE CABLE TESTING: Cable tester for data (RJ45) terminated cables and patch cords, ensuring comprehensive testing capabilities

As an affiliate, we earn on qualifying purchases.

Next Steps for Service Performance Measurement

Industry experts are expected to increase focus on measuring and mitigating tail latency, especially in critical services. Future research may develop better models to predict user-perceived delays and improve service architectures to reduce long tail events, enhancing overall user satisfaction.

Amazon

server response time analyzers

As an affiliate, we earn on qualifying purchases.

Key Questions

Why do users perceive services as slower than metrics suggest?

Because human perception is heavily influenced by long tail delays, which are more likely to be experienced during longer requests or outages, skewing perceived speed.

What is tail latency, and why is it important?

Tail latency refers to the longer delays in a distribution of response times, often at the 99th percentile or higher. It is important because it heavily impacts user experience, even if average response times are low.

How can service providers reduce perceived delays?

By focusing on reducing tail latency through architectural improvements, better load balancing, and faster recovery mechanisms, they can improve perceived performance.

Does this explanation apply to all types of services?

While the principles are broadly applicable, the impact varies depending on the service and user expectations. Critical services with high reliability demands are most affected by tail latency.

What should service teams do next?

They should incorporate tail latency metrics into their performance monitoring and prioritize reducing long delays to improve user perception and satisfaction.

Source: Hacker News

Up next

Epoll vs. Io_uring in Linux

Author

Tech Trend Trove Team

Share article

Implications of Human Perception on Service Metrics

Java Performance Engineering: Profiling and Optimizing Enterprise Applications

Understanding the Impact of Tail Latency on User Experience

The Art of Application Performance Testing: Help for Programmers and Quality Assurance

Unclear Extent of User Perception Variability

Klein Tools VDV526-200 Cable Tester, LAN Scout Jr. 2 Ethernet Tester for CAT 5e, CAT 6/6A Cables with RJ45 Connections

Next Steps for Service Performance Measurement

server response time analyzers

Key Questions

Why do users perceive services as slower than metrics suggest?

What is tail latency, and why is it important?

How can service providers reduce perceived delays?

Does this explanation apply to all types of services?

What should service teams do next?

Ask HN: Will programmers write more efficient code during the memory shortage?

Elixir v1.20 released: now a gradually typed language

Anchor. The Schwarz Group model.

AMD pulls a bait-and-switch on Linux users with Vivado licensing changes

Best retro controllers 2026: perfect pads for your classic consoles and games

I Gave an AI a Civilization to Run. It Built a Nuke – Launching CivBench

FDA advisors unanimously vote to approve Moderna’s mRNA after agency drama

Cannes Ad Festival Puts OpenAI’s Projections in Spotlight

Alice is impatient

Up next

Author

Tech Trend Trove Team

Share article

Implications of Human Perception on Service Metrics

Java Performance Engineering: Profiling and Optimizing Enterprise Applications

Understanding the Impact of Tail Latency on User Experience

The Art of Application Performance Testing: Help for Programmers and Quality Assurance

Unclear Extent of User Perception Variability

Klein Tools VDV526-200 Cable Tester, LAN Scout Jr. 2 Ethernet Tester for CAT 5e, CAT 6/6A Cables with RJ45 Connections

Next Steps for Service Performance Measurement

server response time analyzers

Key Questions

Why do users perceive services as slower than metrics suggest?

What is tail latency, and why is it important?

How can service providers reduce perceived delays?

Does this explanation apply to all types of services?

What should service teams do next?

You May Also Like