Sheigfred Bello

DevOps Engineer / Systems Engineer

Optimizing cloud performance, automating infrastructure, and ensuring reliability.

About Me

AWS cloud expert with proven success in cost optimization (15–30% savings), automation (IaC), and monitoring (Datadog, CloudWatch). Skilled in Linux/Windows admin, networking, and CI/CD. Experienced across on-prem and cloud with 100% production uptime.

100%
Production Uptime
30%
Cost Savings
9000+
Managed Desktops

Skills & Technologies

Cloud & AWS

💻EC2 🗄️RDS 📦S3 🌐VPC 🧭Route53 🧩CloudFormation 💰Cost Explorer

Systems & Infrastructure

🐧Linux 🪟Windows Server 💠VMware 🖥️Dell PowerEdge 🧱ESXi 🗃️QNAP

DevOps & Automation

🧰Ansible 🐳Docker ⚙️GitHub CI/CD 🐍Python

Monitoring & Networking

🐶Datadog 📈Zabbix 🏺Prometheus 📊Grafana 🌐TCP/IP 🐬MySQL/MariaDB

Experience

DevOps Engineer

Coreware 2025 – Present

AWS cost optimization, automated deployments, custom monitoring, 100% uptime.

System Engineer

Ubiquity 2023 – 2025

9,000+ desktops, 10-site server deployments, AWS solutions, 99.9% uptime.

IT Support

Ubiquity 2022 – 2023

IT support for 800+ users, asset lifecycle, team training.

IT Tech Support

Colegio de San Juan de Letran 2022

Campus-wide IT support, asset & network management.

Projects

Enterprise SSO Implementation

Implemented comprehensive single sign-on solution using LDAP, SAML, and OpenID protocols.

LDAP SAML OpenID

Windows 10→11 Migration

Large-scale enterprise migration from Windows 10 to Windows 11 across multiple sites.

Windows Migration Enterprise

Linux Server Deployment

Automated deployment and configuration of Linux servers across multiple environments.

Linux Automation Deployment

AWS Cost Optimization

Achieved 15-30% cost savings through strategic AWS resource optimization and monitoring.

AWS Cost Optimization Monitoring

Cross-Domain Authentication

Implemented secure cross-domain authentication system for multi-site enterprise environment.

Authentication Security Multi-Site

AWS CloudWatch Monitoring & Alerts

Designed and implemented end-to-end observability for Flask services using CloudWatch metrics, logs, and alarms. Created dashboards for latency, error rates, and infrastructure health. Alerts routed to Slack/Email with actionable context and runbooks.

AWS CloudWatch Alarms Dashboards Runbooks SNS

ECS Service Auto Scaling (Target Tracking)

Implemented ECS service auto scaling with target tracking on CPU/Memory and request-based policies via ALB metrics. Included cooldown tuning, min/max capacity guards, and safe deployments to keep p95 latency within SLO during traffic spikes.

AWS ECS Auto Scaling ALB Metrics SLO IaC

Datadog Monitoring & Alerting

Integrated Datadog APM, Logs, and Metrics for application and container visibility. Built dashboards (RPS, p95, error rate), SLOs, and composite monitors to reduce alert noise. Added log pipelines and tags for fast root-cause analysis.

Datadog APM Dashboards SLO/SLI Log Pipelines Alerting

Case Studies

🌐 How Users Access This Portfolio

Simple, reliable infrastructure powering your experience

💻

User's Device

You type sheigfredbello.com in your browser

🌍

DNS Lookup

Browser asks "Where is sheigfredbello.com?"

🧭

Route 53

AWS DNS service responds with server location

📍

A Record

Points to public IP: XX.XX.XX.XX

☁️

EC2 Instance

AWS virtual server receives your request

🔄

Nginx

Web server processes and forwards request

🐍

Flask App

Python application generates the page

🎨

HTML/CSS/JS

Browser renders this beautiful portfolio!

🛠️ Technology Stack

🧭 Route 53 DNS Management
☁️ EC2 Virtual Server
🔄 Nginx Web Server
🐍 Flask Python Framework

AWS CloudWatch Monitoring & Alerts

Problem: Flask services had no proactive monitoring; outages were detected late.

Actions: Built dashboards for latency/error rates, created alarms with Slack/Email routing, and wrote runbooks.

Results: MTTR dropped from ~40 min to 12 min, with clear alert context.

AWS CloudWatch Dashboards Alarms

ECS Service Auto Scaling

Problem: High traffic spikes caused 504 errors and service instability.

Actions: Implemented target tracking on CPU/memory & ALB request counts. Tuned cooldowns + min/max guardrails.

Results: Reduced error rate by 70% during peak events; kept p95 latency within SLO (≤300ms).

AWS ECS Auto Scaling IaC

Datadog Monitoring & Alerting

Problem: Alert fatigue and lack of visibility slowed incident response.

Actions: Deployed APM, built RPS/p95/error dashboards, SLOs, and composite monitors. Optimized log pipelines.

Results: Alert noise reduced by 60%, RCA time cut by 50%.

Datadog APM Dashboards SLO

Udemy Certifications

Windows Server 2019 Admin CompTIA Network+ Udemy Getting started with Cisco CCNA CompTIA A+

The Root Access Podcast

Get In Touch

📍 Makati City, Philippines