DevOps Daily with Fexingo: CI/CD, Kubernetes, and Modern Software Operations cover art

DevOps Daily with Fexingo: CI/CD, Kubernetes, and Modern Software Operations

DevOps Daily with Fexingo: CI/CD, Kubernetes, and Modern Software Operations

By: Fexingo
Listen for free

Lucas and Luna dissect the daily realities of DevOps, from CI/CD pipeline design to Kubernetes cluster management and the human systems that keep software running. Each episode grounds abstract principles in real incidents—a failed deployment at a major retailer, a postmortem from a cloud outage, a configuration drift disaster—and traces the operational decisions that turned them around. Lucas brings the technical precision of a working engineer, while Luna pushes on the team dynamics, cost trade-offs, and organizational bottlenecks that separate resilient operations from fragile ones. They discuss monitoring strategies, incident response playbooks, infrastructure-as-code trade-offs, and the cultural friction between development velocity and operational stability—always with concrete examples, never with buzzwords. This is the show for engineers, SREs, and platform leads who want to hear two seasoned practitioners argue through the hard choices: when to rewrite vs. patch, how much observability is enough, and how to keep a multi-cloud deployment from becoming a management nightmare. By the end, you'll carry away a sharpened question about your own stack and a new way to think about reliability. #DevOps #CICD #Kubernetes #SiteReliabilityEngineering #PipelineAutomation #InfrastructureAsCode #IncidentResponse #Monitoring #Observability #CloudOperations #ContainerOrchestration #Postmortem #DeploymentStrategy #Technology #FexingoBusiness #BusinessPodcast #SoftwareEngineering #PlatformEngineering Keep every episode free: buymeacoffee.com/fexingo© 2026 Fexingo. All rights reserved. Economics
Episodes
  • How Kubernetes ResourceQuotas Cause Silent Pod Evictions
    Jun 29 2026
    In this episode of DevOps Daily with Fexingo, Lucas and Luna dive into a subtle but destructive Kubernetes behavior: how ResourceQuotas can silently evict pods when namespace limits are reached, even when the cluster has ample capacity. They walk through a real incident at a mid-sized e-commerce company where a single namespace's quota misconfiguration caused cascading evictions across 12 microservices during a flash sale. Lucas explains the mechanism—how the kube-apiserver rejects pod creations when quota is exceeded, and how the scheduler's lack of awareness leads to orphaned pods. Luna raises the issue of observability gaps, noting that standard dashboards often miss quota-related denials. They discuss mitigation strategies: setting explicit deny messages, monitoring quota metrics via Prometheus, and using admission webhooks for early warnings. The episode delivers a concrete lesson for any team running multi-tenant clusters. A brief donation segment highlights listener support for the ad-free show. #Kubernetes #ResourceQuotas #PodEvictions #DevOps #CloudNative #K8sTroubleshooting #ClusterManagement #AdmissionControl #NamespaceQuotas #Observability #Prometheus #IncidentResponse #Ecommerce #FexingoBusiness #BusinessPodcast #Technology #DevOpsDaily #Infrastructure Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    8 mins
  • How Kubernetes Vertical Pod Autoscaler Misallocates Memory
    Jun 28 2026
    Lucas and Luna dig into the Kubernetes Vertical Pod Autoscaler's recalculations that often leave memory over-provisioned and CPU under-provisioned. They examine a case study where a production e-commerce cluster saw 22% of VPA-recommended memory requests exceed actual usage by over 40%, while CPU recommendations lagged behind real demand by nearly 30%. The episode explains the recommender's sliding-window analysis, the percentile-based target (default 95th), and why spikes in Java garbage collection or Python memory fragmentation trick VPA into over-allocating. They contrast VPA with Horizontal Pod Autoscaler and discuss when to pin memory limits manually. Practical takeaway: set a custom memory target percentile via the VPA config's `targetMemoryPercentile` field, or use a sidecar that exposes real-time RSS metrics to tune recommendations. No fluff, just a concrete debugging path for anyone running VPA in production. #Kubernetes #VerticalPodAutoscaler #VPA #CloudNative #DevOps #PodAutoscaling #ResourceManagement #MemoryAllocation #CPUAllocation #JavaGC #PythonMemory #KubernetesBestPractices #ClusterOptimization #SRE #ProductionKubernetes #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    10 mins
  • How Kubernetes CPU Manager Pins Cause Node Drain Failures
    Jun 28 2026
    Kubernetes CPU Manager static policy seems like a performance win, pinning pods to specific CPU cores. But when you need to drain a node for maintenance, those pinned pods refuse to move—or worse, they crash on restart. In this episode, Lucas and Luna dissect the tension between CPU pinning and node lifecycle. They walk through a real scenario where a 32-core production node stalled a rolling update for 45 minutes because kubelet couldn't evict a CPU Manager pod cleanly. They explain the CPU Manager's topology-aware allocation, the eviction logic gap, and the workarounds: using descheduler with a custom strategy, setting `cpuManagerPolicy: none` for drain-sensitive workloads, and tweaking kubelet eviction thresholds. If your cluster has latency-sensitive apps pinned to cores, this episode will save you from a messy node drain. #Kubernetes #CPUManager #NodeDrain #Kubelet #StaticPolicy #TopologyManager #DevOps #ClusterLifecycle #PodEviction #Descheduler #LatencySensitive #CloudNative #Infrastructure #SRE #Technology #FexingoBusiness #BusinessPodcast #DevOpsDaily Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    12 mins
adbl_web_anon_alc_button_suppression_t1
No reviews yet