Infrastructure Engineer
Yuxi Global
Medellín, Medellin, Antioquia, Colombia
•Hace 10 horas
•Ninguna postulación
Sobre
- Company Description
- Veritas Automata is a technology consulting and software development company dedicated to delivering innovative solutions that drive business success. We combine expertise in automation, AI, and advanced technology to enhance operational efficiency and streamline complex processes. Our teams build modern, intelligent, and scalable solutions that empower clients across regulated industries, enterprise platforms, and next-generation AI ecosystems. We are committed to innovation, ownership, and delivering measurable outcomes for our clients and partners.
- Yuxi Global, powered by Veritas Automata, is a South America-based delivery and talent entity that supports Veritas Automata’s global delivery model. We specialize in providing comprehensive solutions, including turnkey enterprise-grade application development, managed development teams, staff augmentation, and strategic consulting via our Veritas Automata Services Team.
- Job Description
- We are looking for an Infrastructure Engineer (L5), who is a senior technical engineer responsible for architecting, implementing, and optimizing cloud and on-premises infrastructure that supports distributed, Kubernetes-based, and high-availability systems. With 5–8 years of experience, this role provides deep technical leadership across infrastructure automation, observability, networking, platform reliability, and DevSecOps enablement.
- This role partners closely with software engineering, platform engineering, SRE, security, and product teams to deliver stable, performant, and scalable infrastructure environments. The Infrastructure Engineer (L5) leads complex technical implementations, removes blockers, and ensures the organization’s infrastructure foundation can support growth, resilience, and innovation.
Core Responsibilities
- Design, deploy, and maintain Kubernetes clusters (K3s, RKE2, AKS, EKS, GKE) across cloud and hybrid environments.
- Implement infrastructure-as-code solutions using Terraform, Pulumi, Ansible, or equivalent automation tools.
- Engineer secure, scalable networking architectures including VPCs, subnets, VPNs, firewalls, service meshes, load balancers, and cross-region connectivity.
- Architect and maintain CI/CD pipelines, GitOps tooling, and automated delivery workflows using GitHub Actions, ArgoCD, Flux, or GitLab CI.
- Configure and operate observability platforms including Prometheus, Grafana, Loki, Tempo, OpenTelemetry, and Thanos for full-stack visibility.
- Collaborate with SRE and platform teams to improve reliability, reduce operational toil, and optimize performance and cost.
- Implement and maintain cloud security best practices including IAM, RBAC, secrets management, encryption, and compliance controls.
- Participate in on-call rotation, incident response, and root cause analysis for platform-related production issues.
- Develop and document runbooks, architecture diagrams, operational standards, and troubleshooting guides.
- Mentor junior engineers and contribute to capability-building around modern infrastructure practices.
- Qualifications
Required qualifications
- 5–8 years of experience in infrastructure engineering, DevOps, SRE, platform engineering, or cloud operations.
- Hands-on experience with Kubernetes cluster administration, operators, workloads, storage, and networking.
- Strong proficiency with infrastructure-as-code, cloud provisioning, and automated configuration management.
- Deep understanding of at least one major cloud platform (AWS, Azure, or GCP) including compute, networking, IAM, and managed services.
- Experience deploying and managing observability stacks including metrics, logs, traces, dashboards, and alerting.
- Familiarity with containers, networking, service meshes, ingress controllers, and distributed application architectures.
- Strong scripting abilities (Bash, Python, PowerShell) for automation and operational efficiency.
- Bachelor’s degree in Engineering, Computer Science, or equivalent practical experience.
- Advanced English level (written and spoken) to communicate effectively across global teams.
Preferred experience
- Experience supporting regulated or high-compliance environments such as healthcare, life sciences, financial services, or critical infrastructure.
- Exposure to edge computing, multi-cluster orchestration, zero-trust networking, and global failover routing.
- Experience with storage platforms like Longhorn, Ceph, EBS, Azure Disk, or GCP Persistent Disks.
- Background in SRE practices including SLO/SLI design, error budgets, and resilience engineering.
- Certifications in cloud technologies, Kubernetes (CKA/CKS/CKAD), or DevOps methodologies.
- Technical skills
Infrastructure Engineers at this level are expected to demonstrate proficiency with one or more tools or platforms in each of the following categories
- Cloud & Infrastructure Platforms
- AWS (EC2, EKS, VPC, IAM, RDS, CloudWatch)
- Azure (AKS, VNet, Load Balancer, AAD, Monitor)
- GCP (GKE, VPC, IAM, CloudOps)
- Kubernetes & Containerization
- Kubernetes (K3s, RKE2, AKS, EKS, GKE)
- Helm, Kustomize, Operators
- Container runtimes and image management
- Automation & IaC
- Terraform, Pulumi, Ansible, Helm
- GitHub Actions, GitLab CI, ArgoCD, Flux (GitOps)
- Observability & Reliability
- Prometheus, Grafana, Loki, Tempo, Thanos
- OpenTelemetry (metrics, logs, traces)
- Alerting, SLO dashboards, performance tuning
- Networking & Security
- DNS, firewalls, VPNs, load balancers, API gateways
- Zero-trust frameworks, RBAC/ABAC
- Secret management (Vault, KMS, SOPS)
- Soft Skills
- Strong analytical and troubleshooting skills with ability to diagnose issues across complex, distributed systems.
- Clear communication with ability to translate infrastructure concepts to both technical and non-technical stakeholders.
- Ownership mindset with focus on stability, reliability, and operational excellence.
- Ability to manage multiple concurrent priorities in fast-moving environments.
- Leadership presence and ability to mentor junior engineers.
- Organizational Competencies
- Remote Collaboration: Works effectively in distributed teams using asynchronous communication.
- Continuous Learning: Actively explores new models, frameworks, and safety techniques.
- Cultural Fit: Embodies Veritas Automata’s values of innovation, integrity, and ownership.
- Strategic Impact: Contributes reusable AI building blocks that accelerate future product delivery.
- Additional Information
- Workplace Conditions and Physical Expectations
- Prolonged periods of sitting at a desk and working on a computer.
- Must be able to lift 15 pounds at times.
- Must access and navigate each department at the organization’s facilities.
- Occasional travel to the client’s site may be required.
- English Level: C1




