I wish posts like this would explore the relative savings rather than the absolu...

janOsch · 2024-06-07T15:08:42 1717772922

I'm the author of the post. You raise a good point about relative savings. Based on last week's data, our change reduced the task time by 40ms from an average of 3440ms, and this task runs 11 million times daily. This translates to a saving of about 1% on compute.

simonbarker87 · 2024-06-07T15:12:41 1717773161

Thanks for the follow up, sounds like a decent saving and investment of time then.

janOsch · 2024-06-07T15:42:19 1717774939

Fun fact: it probably took more time to write up and refine the blog post than it did to hunt down that sneaky 40ms savings.

simonbarker87 · 2024-06-07T17:41:37 1717782097

True but the value of the hunt and fix may really come from this blog post long term. Content marketing and all that

hiatus · 2024-06-07T17:05:28 1717779928

> This translates to a saving of about 1% on compute.

Does this translate to any tangible savings? I'm not sure what the checkly backend looks like but if tasks are running on a cluster of hosts vs invoked per-task it seems hard to realize savings. Even per-task, 40 ms can only be realized on a service like Lambda—ECS minimum billing unit is 1 second afaik.

serverlessmom · 2024-06-07T17:28:04 1717781284

I think that’s flawed analysis, if you’re running FaaS then sure you can fail to see benefit from small improvements in time (AWS Lambda changed their billing resolution a few years back but before then the Go services didn’t save much money despite being faster) but if you’re running thousands of requests, and speeding them all up, you should be able to realize tangible compute savings whatever your platform.

hiatus · 2024-06-07T17:31:08 1717781468

Help me to understand, then. If this stuff is being done on an autoscaling cluster, I can see it, but if you are just running everything on an always-on box for instance, it is less clear to me.

edit: Do you have an affiliation with the blog? I ask because you have submitted several articles from checkly in the past.

tnolet · 2024-06-07T17:53:12 1717782792

Hey Checkly founder here, we changed our infra quite a bit over the last ~1 year. Still, it's mostly ephemeral compute. We started actually on AWS Lambda. We are on a mix of AWS EC2 and EKS now, all autoscaled per region (we run 20+ of them).

It seems tiny, but in aggregate this will have an impact on our COGS. You are correct that if we had a fixed fleet of instances, the impact would have been not super interesting.

But still, for a couple of hours spent, this saves us quite some $1Ks per year.

serverlessmom · 2024-06-07T17:56:30 1717782990

Yes I work at Checkly, though I didn’t answer authoritatively since this one wasn’t written by me!

dmurray · 2024-06-07T17:59:11 1717783151

The units seem wrong in any case. It's 3 months of compute per day, which is actually much more impressive.

If we think about the business impact, we don't usually think of compute expenditure per-day, so you might reasonably say, the fix saved 90 years of annual compute. Looks better in your promotion packet, too.

redman25 · 2024-06-07T17:31:37 1717781497

I often ask myself the same question. We have some user facing queries that slow the frontend down. I’ve fixed some slowness but it’s definitely not a priority. I wonder how much speed improvements correlate with increased revenue by happy customers.

drakir_nosslin · 2024-06-10T15:02:42 1718031762

Bit late to the party, but companies report that webpage speed correlates with conversion. See e.g. https://www.cloudflare.com/en-gb/learning/performance/why-si... & https://www.cloudflare.com/en-gb/learning/performance/more/w...

This one is also interesting; written in 2012, it claims that Amazon could lose 1b+ from a 1 sec slowdown: https://www.fastcompany.com/1825005/how-one-second-could-cos.... I imagine people are even less tolerant of slow pages today.

Fixing website performance can be one of the cheapest ways to increase conversion because it's hard to figure out what else moves the needle.

encoderer · 2024-06-07T17:55:43 1717782943

Think of this like changing the oil in your car.

Over-optimizing is not going to help you at all but if you ignore it eventually it will all seize up.

You have to keep that stuff in check.

tnolet · 2024-06-07T15:04:55 1717772695

Hey, I work at Checkly and asked my coworker (who wrote the post) to give some more background on this. I can assure you, we're busy and this was not done for some vanity price!

brunoarueira · 2024-06-07T15:09:13 1717772953

I agree, but this post looks like an advertisement about the service itself.

ebcase · 2024-06-07T15:49:05 1717775345

It’s literally on the company’s blog, which is partially about promoting the company’s service. What’s the issue with that?

(Long time happy Checkly user here, the service is fantastic)

brunoarueira · 2024-06-07T15:56:43 1717775803

Not a problem, but the OP is questioning about the savings!

I, for example, like to dive more on insights like the relative savings vs absolut to learn the approaches other engineers take! It's all about metrics we should take care.

(I'll put this service on my list to try someday, looks like fantastic indeed)