DevOps & Site Reliability Engineer focused on building reliable, scalable, and secure cloud-native platforms across AWS, GCP, and Azure.
- Current role:
Software Engineer 2 @ DeepSource - Experience:
4.8+ yearsin DevOps/SRE - Focus:
Kubernetes,Observability,IaC,CI/CD,Distributed Systems
- Designing and operating production-grade cloud infrastructure
- Building end-to-end observability with OpenTelemetry, Prometheus, Grafana, Tempo, and Jaeger
- Automating platform operations using Python, Go, Terraform, and GitOps workflows
- Improving reliability with SLO-driven engineering, incident reduction, and disaster recovery planning
- Maintained
99.9%uptime across30+production Kubernetes clusters - Reduced infrastructure/cloud cost by up to
30%through autoscaling and rightsizing - Improved deployment speed by
35%with optimized CI/CD and GitOps pipelines - Reduced MTTR by
35%with unified observability and faster issue detection - Improved failover readiness with
50%faster recovery for critical components
Python Go Bash Kubernetes Docker Helm Terraform OpenTofu
GitHub Actions Jenkins ArgoCD GitOps Prometheus Grafana OpenTelemetry Tempo
FluentBit Jaeger Istio KEDA Karpenter AWS GCP Azure
- Built an end-to-end observability pipeline for AI and static code analysis workloads.
- Drove infra cost optimization with dynamic autoscaling and workload rightsizing.
- Established Terraform-based IaC across multiple environments.
- Led SRE initiatives across highly available production Kubernetes clusters.
- Built automations and Kubernetes controllers in Python/Go.
- Implemented service mesh, canary/blue-green delivery, and observability platform upgrades.
- Operated distributed systems including Redis, Kafka, PostgreSQL, MongoDB, and ZooKeeper.
- Managed AWS EKS workloads for reliability and performance.
University of Delhi
Bachelor of Computer Science (2018 β 2021)
GPA: 8.2
- Email:
solanki7feb@gmail.com - LinkedIn: linkedin.com/in/rupin-solanki-437648203
- GitHub: github.com/rupinsolanki07



