Senior Observability Engineer, Observability Platform
Imagine a future where complex infrastructure - from Kubernetes to GPU clusters - is fully observable, predictable, and continuously improving.
At Verda, we’re building a fully featured European cloud computing platform—and observability is at the core of making that platform reliable at scale. From distributed systems to AI workloads, we’re designing visibility into every layer of the stack.
We’re ambitious, curious, and pragmatic builders. We operate with low hierarchy, high ownership, and a strong bias for action. We’ve already achieved a lot, but we’re just getting started.
Now it’s your chance to join the ride. Join Verda while it’s still being built - not once it’s finished!
Your responsibilities
In this role, you will contribute to the architecture and evolution of our observability platform across a diverse and high-performance environment - spanning Kubernetes, bare metal, virtual machines, GPU clusters, and network infrastructure.
You will help translate architectural decisions into practical implementations, ensuring adoption across teams and systems. A core part of your work will involve operating and improving a unified observability stack built on Grafana and the VictoriaMetrics ecosystem, while advancing proactive and AI-assisted monitoring capabilities.
You will implement observability for cloud-native applications and distributed systems, as well as support monitoring and optimization of AI/ML workloads and large-scale training runs. You will design dashboards, alerts, and workflows that improve system reliability, performance, and visibility across the platform.
You will also contribute to improving incident detection and response, participate in on-call rotations, and help define standards, telemetry pipelines, and best practices. Working closely with platform, infrastructure, network, and security teams, you will ensure observability is deeply integrated into how systems are designed and operated.
Your key competencies
Strong hands-on experience with Grafana stack and modern observability tooling
Experience with VictoriaMetrics/VictoriaLogs or similar high-scale metrics and logging systems
Solid understanding of Kubernetes in production environments
Experience monitoring cloud-native applications and distributed systems (e.g., Node.js, Go, APIs)
Strong knowledge of OpenTelemetry, including instrumentation, traces, metrics, logs, and integration with observability backends
Solid Linux and infrastructure fundamentals
Experience with incident management and alerting systems (e.g., PagerDuty, Better Stack)
Familiarity with GitOps and infrastructure automation practices
Exposure to security observability (SecOps)
Experience integrating AI into workflows (automation, LLMs, or agent-based systems)
Ability to work across teams and drive adoption of observability practices
Nice to have
Experience with observability for GPU/AI infrastructure or ML workloads
Deeper experience in AIOps, LLMs, or agent-based observability systems
Background in network observability at scale (e.g., NetFlow, SNMP)
Experience working in hybrid environments (bare metal + VMs + Kubernetes)
Hands-on experience with application instrumentation using OpenTelemetry, and ability to guide engineers in implementing telemetry effectively
Why Verda
Cash + equity compensation along with various fringe benefits (e.g., healthcare, lunch, wellbeing, etc.)
Profitable operations with rapid, sustained growth
31 nationalities, with 6 different ones on the management team
A chance to shape observability for cutting-edge infrastructure, including large-scale AI and GPU workloads
Practicalities
Location: Helsinki, Finland (on-site)
Level: Senior / Mid-Senior
Employment type: Full-time and permanent
What's next
We’re building fast and this role needs the right person behind it. There’s no artificial deadline, but when we find who we’re looking for, we move.
If this sounds like your next move, apply now.
Please submit your application through our Careers page. We don’t accept applications sent by email.
- Department
- Research & Development
- Role
- Observability Engineer
- Locations
- Helsinki
- Remote status
- Hybrid
About Verda
Verda (formerly DataCrunch) is a technology company building the next generation of cloud infrastructure for AI – compute that's instant, on-demand and at scale. Headquartered in Helsinki, the company operates globally across Europe, the US and Asia. Verda employs over 100 people from nearly 30 nationalities and has raised over $200M in total funding from investors including Lifeline Ventures, byFounders, J12 Ventures, Skaala, Varma and Tesi, alongside leading financial institutions.