Projects

AWS Failure Case Lab

A Terraform-provisioned cloud environment engineered to simulate real-world AWS infrastructure failures and practice structured incident response across distributed systems. The lab focuses on reproducing production-style outages spanning networking, IAM, compute, and storage layers, requiring systematic investigation using logs, metrics, and command-line diagnostics.

Built to develop operational readiness through controlled fault injection, hypothesis-driven troubleshooting, and documented root cause analysis. Each scenario is intentionally introduced, detected through monitoring signals, resolved with validated remediation steps, and formalized in a postmortem-style incident report to mirror real SRE and DevOps workflows.

Architecture: Terraform (infrastructure as code + failure injection controls) • Amazon VPC (subnets, route tables, security groups) • EC2 (application layer) • RDS (database tier) • IAM (least-privilege access control) • S3 (object storage) • CloudWatch (logs, metrics, alarms, dashboards) • AWS CLI (live diagnostics and verification)

Kubernetes-Based Scalable Application Platform

A containerized backend application deployed to a local Kubernetes cluster to simulate migration from a single-server deployment to a scalable, self-healing environment. The project demonstrates replica management, rolling updates, liveness and readiness probes, and horizontal pod autoscaling to support high availability and controlled, zero-downtime deployments.

Built to showcase practical Kubernetes workload management, configuration externalization using ConfigMaps, and operational resilience within a local cluster setup.

Architecture: Docker (containerization) • Kubernetes (Deployment, Service, HPA, PodDisruptionBudget) • ConfigMaps (configuration management) • kubectl (cluster operations) • kind (local cluster)

Serverless Security Log Detection and Alerting Pipeline

A fully serverless, event-driven monitoring pipeline designed to ingest, process, and analyze security and application logs in real time. Using AWS Lambda for log preprocessing and detection logic, the system monitors for error conditions and suspicious activity, triggering automated alerts through Amazon SNS.

Built to support operational visibility, incident response workflows, and cloud-native monitoring best practices.

Architecture: Amazon S3 (log storage) • Kinesis Data Firehose (streaming ingestion) • AWS Lambda (log preprocessing + detection logic) • Amazon SNS (alerting) • Amazon CloudWatch (monitoring) • GitHub Actions + Terraform (CI/CD + IaC)