When NOT to Go Serverless: 5 Use Cases Where It’s the Wrong Choice

Mar 04, 2026

Serverless is one of the most transformative paradigms in cloud computing. AWS Lambda, API Gateway, DynamoDB, and the broader serverless ecosystem let you build and scale applications without ever thinking about servers. It’s fast to develop, often cheap to run, and incredibly powerful.

But here’s the thing: serverless is not a silver bullet.

After years of building on AWS, I’ve seen teams adopt serverless for everything — and sometimes pay the price in performance, cost, complexity, or all three. The best architects don’t just know when to use a tool. They know when not to.

In this article, I’ll walk you through five real-world use cases where going serverless is the wrong choice, and what you should consider instead.

1. Long-Running, Compute-Heavy Workloads

The Problem

AWS Lambda has a hard timeout of 15 minutes. If your workload takes longer than that, Lambda simply isn’t an option. But even workloads that fit within 15 minutes can be a poor match.

Think about:

Video transcoding or rendering
Large-scale data transformations (ETL)
Machine learning model training
Complex PDF generation from massive datasets

These tasks are CPU-intensive and often run for extended periods. Shoehorning them into Lambda means you’ll spend your time fighting the platform instead of leveraging it.

Why Serverless Fails Here

15-minute max execution time is a hard ceiling you can’t negotiate.
CPU is allocated proportionally to memory in Lambda. To get meaningful compute power, you might need to allocate 4–10 GB of memory — even if you only need 512 MB of RAM. You’re paying for memory you don’t use just to get CPU.
No GPU access. If your workload benefits from GPU acceleration, Lambda is out of the question.
Orchestrating many short Lambda functions to chunk the work adds significant complexity and potential failure points.

What to Use Instead

AWS Fargate (ECS) — Containerized long-running tasks without managing servers
EC2 Spot Instances — Cost-effective heavy compute (batch jobs, rendering)
AWS Batch — Managed batch processing at scale
SageMaker — ML model training specifically
Step Functions + Fargate — Orchestrated pipelines with long-running steps

A Practical Example

Let’s say you’re processing uploaded video files. Instead of Lambda:

S3 Upload Event → EventBridge → Fargate Task (ECS)

Fargate gives you configurable vCPU and memory, no time limits, and you only pay for the duration of the task. You still get the “no servers to manage” benefit without Lambda’s constraints.

2. High-Throughput, Latency-Sensitive Applications

The Problem

You’re building an application that needs to handle thousands of requests per second with consistent, single-digit millisecond response times. Think:

Real-time bidding platforms
High-frequency trading APIs
Multiplayer game backends
Real-time fraud detection at the edge

Why Serverless Fails Here

The biggest villain here is the cold start.

When Lambda spins up a new execution environment, there’s an initialization delay.

Python, Node.js: 100–300 ms
Java, .NET: 500 ms – 2+ seconds

For most applications, cold starts are a minor nuisance. For latency-critical systems, they’re a dealbreaker.

Yes, Provisioned Concurrency exists, and SnapStart helps with Java. But:

Provisioned Concurrency is essentially pre-warming containers — which defeats the core serverless value proposition. You’re paying for idle compute.
You still can’t guarantee tail latency (p99/p99.9) the way you can with a warm, long-running process.
At high throughput, the API Gateway overhead (typically 10–30 ms) adds up and is hard to eliminate.

What to Use Instead

ECS/EKS on Fargate or EC2 — Long-running containers with persistent connections and predictable latency.
EC2 with Auto Scaling — Maximum control over the runtime, networking, and performance tuning.
ElastiCache (Redis) — Pair with containers for ultra-low-latency data access.

The Key Takeaway

If your SLA says “p99 latency under 10 ms”, serverless will make that extremely difficult and expensive to achieve. Use always-on compute with connection pooling and warm caches.

3. Applications with Complex or Stateful Workflows (That Need Persistent Connections)

The Problem

Serverless functions are stateless and ephemeral by design. Each invocation is independent. There’s no shared memory, no local disk that persists, and no guarantee that the same instance handles consecutive requests.

This becomes painful when you need:

WebSocket connections with complex server-side state
Long-lived database connection pools
In-memory caching across requests
Sticky sessions

Why Serverless Fails Here

Database connections are the classic pain point. Every Lambda invocation may open a new connection to your relational database. Under load, you can easily exhaust your database’s connection limit.

100 concurrent Lambda invocations = 100 database connections
1,000 concurrent invocations = 💥 database connection limit exceeded

RDS Proxy helps by pooling connections, but it:

Adds cost
Adds latency (~5–15 ms per query overhead)
Is another component to configure and monitor
Doesn’t eliminate the fundamental mismatch — it just papers over it

In-memory state is another problem. If your application logic depends on keeping data in memory across requests (e.g., a game server tracking player positions, a collaborative editing backend), Lambda can’t help you. You’d have to externalize all state to DynamoDB, ElastiCache, or S3 — adding latency and complexity for every operation.

What to Use Instead

ECS/Fargate with long-running containers that maintain connection pools and in-memory state.
EC2 for maximum control over memory, connections, and runtime behavior.
ElastiCache if you need shared in-memory state across instances.
AppSync for managed GraphQL with real-time subscriptions (a serverless option that does handle WebSockets well, but for a specific use case).

When It’s a Judgment Call

If you’re building a standard REST API with a relational database, serverless can work — especially with RDS Proxy and moderate traffic. But if your application is connection-heavy or state-heavy, you’ll spend more time fighting the architecture than building features.

4. Predictable, Always-On, Steady-State Workloads

The Problem

Serverless pricing is brilliant for spiky, unpredictable, or low-traffic workloads. You pay per invocation, per millisecond of compute. When traffic is zero, your bill is zero.

But what about workloads that run 24/7 at a consistent, high volume?

A backend API serving millions of requests per day, evenly distributed
A stream processor consuming from Kinesis around the clock
A WebSocket server with thousands of persistent connections

Why Serverless Fails Here

Let’s do some back-of-the-envelope math.

Scenario: An API handling 50 million requests/month, average execution time of 200 ms, with 512 MB memory allocated.

Lambda Cost:

Compute: 50M × 0.2s × 0.5 GB = 5,000,000 GB-seconds
Cost:    5,000,000 × $0.0000166667 = ~$83.33
Requests: 50M × $0.20/million     = ~$10.00
API Gateway: 50M × $1.00/million  = ~$50.00
                                    --------
Total:                              ~$143/month

Fargate Equivalent (2 tasks, 0.5 vCPU, 1 GB each):

vCPU:   2 × 0.5 × $0.04048/hr × 730 hrs = ~$29.55
Memory: 2 × 1 GB × $0.004445/hr × 730 hrs = ~$6.49
ALB:    ~$16.20 + LCU charges (~$5-10)     = ~$25.00
                                              --------
Total:                                       ~$61/month

That’s a ~57% cost reduction with Fargate — and the gap widens as traffic increases.

The Rule of Thumb

Ask yourself:

“Will my compute utilization be consistently above 20-30%?”

If yes, reserved/always-on compute (Fargate, EC2 Reserved Instances, or Savings Plans) will almost always be cheaper.

What to Use Instead

Fargate with Auto Scaling — Steady containerized workloads with occasional spikes
EC2 Reserved Instances / Savings Plans — Predictable, long-term workloads at the best price
EKS — When you need Kubernetes orchestration at scale

The Hybrid Approach

The best architectures often mix serverless and containers:

Use Lambda for event-driven, asynchronous tasks (S3 triggers, SNS handlers, cron jobs).
Use Fargate/EC2 for the high-throughput, steady-state API layer.

You get the best of both worlds.

5. Complex Local Development and Debugging Requirements

The Problem

This one is less about production and more about developer experience — which matters more than many teams admit.

Serverless applications are inherently distributed. A single user action might involve:

API Gateway → Lambda → DynamoDB → DynamoDB Stream → Lambda → SNS → Lambda → SQS → Lambda

Try debugging that locally.

Why Serverless Fails Here

Local emulation is imperfect. Tools like SAM CLI and LocalStack are helpful but don’t perfectly replicate IAM permissions, event source mappings, service integrations, or edge cases.
“Console log” debugging is still the primary debugging strategy for many Lambda developers. Attaching a real debugger with breakpoints is possible but cumbersome.
Integration testing is hard. You often need a deployed AWS environment to truly test your application, which slows down the feedback loop.
Distributed tracing adds overhead. You’ll need X-Ray or third-party tools to understand request flows across services.
Each developer needs their own cloud sandbox. Unlike a containerized app where you can docker-compose up and have a full environment locally, serverless often requires deploying to AWS — leading to shared-environment conflicts or the cost of per-developer AWS accounts.

When This Matters Most

Large teams where developer productivity at scale is critical
Complex, multi-service architectures with many interconnected functions
Regulated industries where you need to reproduce and debug production issues quickly
Teams new to serverless with existing expertise in containers or traditional architectures

What to Use Instead

If developer experience is a priority and your team is more productive with containers:

Docker Compose + ECS/Fargate — Develop locally with Docker, deploy to Fargate. Nearly identical environments.
EKS with Skaffold or Telepresence — Hot-reload Kubernetes development.

A Balanced View

To be fair, the serverless developer experience is improving rapidly. Tools like SST (Serverless Stack), AWS SAM Accelerate, and Lambda Powertools have made development significantly better. If you’re starting a greenfield project with a small team, serverless DX is often good enough.

But for large, complex systems — especially if your team has deep container expertise — forcing a serverless-first approach can meaningfully slow development velocity.

So... When SHOULD You Go Serverless?

After all that, let’s end on a positive note. Serverless is the right choice when:

Your traffic is spiky, unpredictable, or low-volume
You want to minimize operational overhead (no patching, no capacity planning)
You’re building event-driven architectures (S3 triggers, message processing, webhooks)
You need to ship fast with a small team
Your workloads are short-lived (seconds, not minutes)
You want automatic scaling from zero to massive without configuration
You’re building microservices, APIs, or background jobs with moderate latency requirements

The Bottom Line

The best cloud architects aren’t loyal to any single paradigm. They pick the right tool for the job.

Serverless is incredible — I use it extensively and recommend it often. But blindly applying it to every problem leads to over-engineered, expensive, and frustrating architectures.

Before going serverless, ask yourself:

Does my workload fit within Lambda’s execution limits?
Can I tolerate cold start latency?
Is my application fundamentally stateless?
Is my traffic pattern spiky or variable?
Can my team debug and develop effectively in a serverless model?

If the answer to any of these is “no”, consider containers, EC2, or a hybrid approach. There’s no shame in using a server. After all, serverless still runs on them.

The Cloud Engineers

Discussion about this post

Ready for more?