πŸš€ Douglas Mendes

Douglas Mendes

🌍 Brazil - Remotely πŸš€

Douglas Mendes - Caricatura

Cloud-Native Platform Engineering Associate & Kubernetes Certified

Site Reliability Engineer focused on building resilient, observable, and automated infrastructure. Experienced with cloud platforms, containerization, and infrastructure-as-code.

Interactive Terminal
douglas@portfolio:~$

πŸš€ Expertise in the Modern Ecosystem

Deep hands-on experience with industry-leading technologies for cloud-native infrastructure, containerization, and automation.

AWS

AWS

Cloud Infrastructure

Kubernetes

Kubernetes

Container Orchestration

Jenkins

Jenkins

CI/CD Automation

GitHub

GitHub

Workflow Automation

Terraform

Terraform

Infrastructure as Code

πŸ† Recent Achievements

CNPA Badge

CNCF Credential

2025

CNPA: Certified Cloud Native Platform Engineering Associate

Certified by the Cloud Native Computing Foundation (CNCF) for expertise in cloud-native platform engineering, Kubernetes, and modern containerized infrastructure practices.

πŸ”—Verify Credential

Focus Areas

🎯

Building reliability at ItaΓΊ

In Progress

βœ“

Implemented end-to-end observability using Datadog

Completed

βœ“

Strengthened engineering teams with SRE best practices

Completed

πŸ… Recognitions

Datadog

Observability Day 2025

Choice of the Day Award

Last year, I led the Datadog implementation for the Beyond Banking platform, with a strong focus on ItaΓΊ Shop 2.0. This work earned first place at Itaú’s Observability Day as the β€œChoice of the Day,” recognized for its impactful use cases and observability excellence.

πŸ’Ό Experience

Senior Site Reliability Engineer

🏒 Itaú Unibanco

02/2022 - Present

  • β€’Led SRE practices for Beyond Banking platforms and served as reliability reference for 15+ teams
  • β€’Architected and managed 10+ EKS clusters with 2k+ pods supporting 300+ microservices
  • β€’Led observability migration from Grafana to Datadog, enabling 300+ monitors and 50+ dashboards
  • β€’Implemented FinOps strategies reducing AWS cloud costs by $60K+ annually
  • β€’Implemented SLO/SLI and error budget practices, improving reliability from 99.5% to 99.9%

Senior Site Reliability Engineer

🏒 ZUP (acquired by Itaú Unibanco)

03/2021 - 02/2022

  • β€’Core SRE for ItaΓΊ Shop, a cloud-native marketplace embedded in ItaΓΊ's super-app
  • β€’Designed and bootstrapped AWS infrastructure using Terraform and Crossplane across 4 environments
  • β€’Managed 300+ AWS resources and owned Kubernetes environments with 1,000+ pods
  • β€’Exposed microservices securely via API Gateway handling 1M+ requests/day
  • β€’Optimized CI/CD with GitHub Actions and Jenkins, reducing lead time from 2h to 15min (87%)
  • β€’Established proactive monitoring with Prometheus, Grafana, and CloudWatch to detect 95%+ incidents before impact

DevOps Engineer

🏒 BMG Bank

08/2020 - 03/2021

  • β€’Supported platform initiatives in a regulated digital banking environment
  • β€’Built and maintained Azure DevOps pipelines enabling 15+ teams to deliver safely to production
  • β€’Established security gates with SonarQube, reducing production vulnerabilities by 25%
  • β€’Migrated legacy on-prem workflows to AWS/Azure, reducing deployment cycle time by 70%
  • β€’Enabled product squads to deploy secure and scalable cloud applications through automation standards

Site Reliability Engineer

🏒 Avenue Code

10/2017 - 08/2020

  • β€’Worked as SRE consultant for global e-commerce platforms with high traffic and strict availability requirements
  • β€’Designed incident automation including ticket creation and intelligent on-call routing
  • β€’Standardized incident management with pre-mortems and post-mortems for stronger root cause analysis
  • β€’Led observability tooling migrations (Datadog ↔ New Relic) and implemented PagerDuty workflows for 15+ teams
  • β€’Created runbooks for 20+ recurring failure scenarios, reducing average resolution time by 50%

βš™οΈ Skills & Expertise

Cloud & Platforms

☁️AWS (EKS, ECS, EC2, IAM, S3, Lambda, API Gateway)β˜…β˜…β˜…β˜…β˜…
☁️Azureβ˜…β˜…β˜†β˜†β˜†
☁️GCPβ˜…β˜…β˜†β˜†β˜†

Containers & Orchestration

β›΅Kubernetesβ˜…β˜…β˜…β˜…β˜…
πŸ‹Dockerβ˜…β˜…β˜…β˜…β˜…

Infrastructure as Code

πŸ—οΈTerraformβ˜…β˜…β˜…β˜…β˜…
πŸ—οΈCrossplaneβ˜…β˜…β˜…β˜…β˜†

CI/CD & Automation

πŸ€–Jenkinsβ˜…β˜…β˜…β˜…β˜†
πŸ”„GitHub Actionsβ˜…β˜…β˜…β˜…β˜†
πŸ”„Azure DevOpsβ˜…β˜…β˜…β˜†β˜†
πŸ’»Shellβ˜…β˜…β˜…β˜…β˜…
πŸ’»PowerShellβ˜…β˜…β˜…β˜†β˜†

Observability & Incident Management

πŸ“ŠDatadogβ˜…β˜…β˜…β˜…β˜…
πŸ“ˆPrometheusβ˜…β˜…β˜…β˜…β˜†
πŸ“ŠGrafanaβ˜…β˜…β˜…β˜…β˜†
πŸ“ˆNew Relicβ˜…β˜…β˜…β˜†β˜†
πŸ”Graylogβ˜…β˜…β˜…β˜†β˜†
🚨PagerDutyβ˜…β˜…β˜…β˜…β˜†
🎫ServiceNowβ˜…β˜…β˜…β˜†β˜†

Reliability & Operations

🎯SRE practicesβ˜…β˜…β˜…β˜…β˜…
πŸ“ŠSLAs/SLOs/SLIsβ˜…β˜…β˜…β˜…β˜…
πŸš‘Incident Responseβ˜…β˜…β˜…β˜…β˜†
πŸ“–Runbooksβ˜…β˜…β˜…β˜…β˜†
⚑Chaos Engineeringβ˜…β˜…β˜…β˜†β˜†
βš™οΈLoad Testingβ˜…β˜…β˜…β˜…β˜†
πŸ’°FinOpsβ˜…β˜…β˜…β˜†β˜†

Skill Radar

πŸŽ“ Education & Certifications

πŸ“š Degrees

Completed

Postgraduate Degree in Site Reliability Engineering

PUC Minas

Completed

Bachelor's Degree in Information Systems

PUC Minas

πŸ† Certifications

CNPA: Certified Cloud Native Platform Engineering Associate

2025

CNPA: Certified Cloud Native Platform Engineering Associate

Cloud-native platform engineering and modern infrastructure practices.

CNCF

AWS Solutions Architect, Associate

2022

AWS Solutions Architect, Associate

Design and deploy AWS infrastructure solutions.

AWS

AWS Cloud Practitioner

2022

AWS Cloud Practitioner

Foundational knowledge of AWS cloud services and best practices.

AWS

Certified Kubernetes Administrator (CKA)

2021

Certified Kubernetes Administrator (CKA)

Expert-level Kubernetes cluster management and orchestration.

CNCF

Google Cloud Associate Cloud Engineer

2020

Google Cloud Associate Cloud Engineer

Deploy and manage Google Cloud infrastructure and applications.

Google Cloud

Oracle Cloud Infrastructure Architect Associate

2020

Oracle Cloud Infrastructure Architect Associate

Design and implement Oracle Cloud Infrastructure solutions.

Oracle

πŸ“š What I'm Learning

Continuous improvement through structured learning and hands-on practice.

πŸ“ŠOverall Learning Progress60%
πŸ—οΈπŸŽ–οΈ Certification

AWS Solutions Architect Professional

Deep dive into AWS architecture, multi-account strategies, and enterprise-scale solutions

Progress65%
⏱️ Est. completion: Q2 2026

Key Topics

Multi-account AWS strategiesAdvanced VPC & networkingCost optimization+3 more
πŸŽ―πŸš€ Specialty

System Design & Architecture

Advanced patterns for scalable, resilient distributed systems

Progress45%
⏱️ Est. completion: Q3 2026

Key Topics

Load balancing strategiesDatabase sharding & scalingCache strategies (Redis, CDN)+3 more
βœ¨πŸ“š Framework

AWS Well-Architected Framework Deep Dive

Master the 6 pillars: Operational Excellence, Security, Reliability, Performance Efficiency, Cost Optimization, Sustainability

Progress70%
⏱️ Est. completion: Q1 2026

Key Topics

Operational excellence practicesSecurity best practicesReliability patterns+3 more

πŸ“¬ Get in Touch

Let's discuss your infrastructure challenges, reliability initiatives, or opportunities.