Perm Position:
Las cualificaciones, habilidades y toda la experiencia relevante necesaria para este puesto se pueden encontrar en la descripción completa a continuación.
Senior Platform Optimization & Observability Engineer
Location:
Remote (EU)
Duration:
6 months +
Owns platform health design and the observability stack (e.G. ELK). Responsible for optimizing existing infrastructure, expanding the observability platform with APM and security capabilities, and migrating monitoring and security audit assets from Azure-native tooling to the new observability platform. Coordinates with the network engineer on DR optimization and firewall review.
Key Responsibilities
Optimize existing virtualization platforms (e.G. VMware, Hyper‑V, KVM‑based platforms such as Proxmox) – performance tuning, capacity planning, resource efficiency
Optimize storage performance, capacity, and cost efficiency in virtualized environments
Harden platform security – secure configuration, attack surface reduction
Review and optimize existing DR designs and recovery processes to reduce RTO/RPO
Review and optimize firewall rules – identify unused or risky rules, align with security best practices
Expand the observability platform with APM and security capabilities, including Sentinel replacement POC execution
Migrate queries, dashboards, and security audit reports from Log Analytics to the new observability platform
Optimize log collection, retention, and analysis – reduce volume, implement cost‑effective logging without losing visibility
Coordinate with the network engineer on DR automation and firewall review
Required Skills
Hands‑on experience optimizing virtualization platforms (e.G. VMware, Hyper‑V, KVM‑based platforms such as Proxmox)
Experience reviewing and optimizing storage performance and capacity, including latency, throughput, and cost efficiency in virtualized environments
Strong understanding of platform security best practices, system hardening, secure configuration, and reducing attack surface
Hands‑on operational expertise of leading observability platforms (e.G. ELK stack)
Experience with ELK APM and security modules including APM agent deployment, SIEM detection xhfqzwm rules (e.G. Elastic Security), agent fleet management, and security event correlation
Experience migrating queries, dashboards, and security audit reports from Log Analytics to the new observability platform including KQL query translation and compliance reporting migration
Experience with advanced monitoring solutions focused on platform stability, resilience, and early issue detection, including proactive alerting and capacity‑related signals
Experience with security compliance scanning tools (e.G. CIS benchmark tools)
Familiarity with compliance frameworks such as SOC 1, SOC 2, and C5
Experience integrating patch management platforms for reporting
#J-18808-Ljbffr