Skip to main content

Command Palette

Search for a command to run...

Learning AWS : Cloud Foundations & Resilience by Design

Exploring the ‘why’ behind the cloud, not just the ‘what'

Published
3 min read
Learning AWS : Cloud Foundations & Resilience by Design

This is the first in a series where I’m learning cloud computing in public. I’m taking the AWS Skill Builder course and documenting my progress, not just what I’m learning, but how I’m thinking through it and what it might mean in the real world.

If you're new to the cloud or brushing up after a break, this is a beginner-first space, no jargon, just real concepts.


What I’ve Learned So Far

In this early phase, I focused on three core ideas that form the backbone of cloud computing:

  1. Global infrastructure

  2. Shared responsibility

  3. Designing for things to break (fault tolerance)

Let’s break these down with everyday examples.


1. Global Infrastructure

Cloud platforms aren’t one giant computer in a secret basement somewhere. Instead, they’re more like a network of hubs around the world.

Imagine a delivery company. They don’t ship every package from one country. They have distribution centers in different cities and countries, so your parcel gets to you faster and more reliably.

In cloud terms:

  • A “region” is like a country with several warehouses (data centers).

  • Each “availability zone” is a separate warehouse inside that country.

These warehouses are close enough to work together but far enough apart that if one catches fire (or has a power cut), the others keep delivering.

Takeaway:
If you're building something important, you shouldn't run it from just one building. Use more than one, so if one breaks, your thing still works.


2. Shared Responsibility

Using the cloud is like renting an apartment.
The landlord (cloud provider) handles the walls, plumbing, power, and security at the gate. But you are in charge of your room.

  • If you leave your door unlocked, that’s on you.

  • If you leave food out and get ants, that's also on you.

  • If the roof leaks, now that’s on the landlord.

This is the idea of shared responsibility.
The cloud gives you the space, but it’s up to you to keep it safe and in order.

Takeaway:
Just because your app or site is “in the cloud” doesn’t mean it’s automatically secure. You still have to manage who has access, how data is handled, and what happens inside your setup.


3. Designing for Failure (a.k.a. Fault Tolerance)

Things break. The question is: does everything fall apart when they do?

Imagine running a smoothie shop with just one blender.
If that blender breaks, you’re done for the day.

Now imagine having three blenders. If one stops, you still have two. You might be a bit slower, but you’re still serving smoothies.

That’s fault tolerance, setting things up so failure doesn’t mean shutdown.

Takeaway:
In the cloud, this means planning for failure before it happens. Build things so they can continue working even when a piece goes down.


Questions I'm Still Thinking About

Here are a few questions that came up for me; maybe they'll get you thinking too:

  • Can I launch a server in two regions at once, and would that even make sense?

  • What actually happens if a whole region goes down? Does my app just disappear?

  • If I have two servers in different zones but one database, what happens if the database’s zone goes offline or burns?

  • Where should I keep my data so it's safe and available?

The next topic I’ll be learning is Compute in the Cloud, so stay tuned.

If you're learning too, feel free to follow along or share what tripped you up in the beginning.

A
Anik Sikder10mo ago

Love how you’re breaking down core cloud concepts in such an accessible way! The delivery hub analogy for global infrastructure really clicked for me, it’s a perfect way to explain why multi-region and multi-AZ setups matter.

Shared responsibility is something so many beginners overlook, so highlighting the “landlord vs. tenant” comparison is gold. It’s easy to assume “cloud = secure,” but it’s really about partnership between provider and user.

And designing for failure - yes! Building with fault tolerance from day one is the difference between downtime nightmares and smooth user experiences.

Your questions are spot on and exactly what I’ve been thinking about too. Curious to see how you tackle “region down” scenarios in your next post! Keep these coming, learning in public like this is so valuable for everyone diving into cloud fundamentals. 🚀☁️

1
R

Wow, that's such a detailed feedback. Love that you are able to relate to the regions analogy. Thank you 😊

1

AWS Cloud Essentials

Part 4 of 4

This series breaks down AWS from the ground up—simple, practical, and thought-provoking. Follow as I learn to design cloud systems that don’t just work, but survive failure.

Start from the beginning

AWS Networking Simplified Guide

A Beginner’s Guide to VPC, CIDR, Route 53, and CloudFront