fbpx

Senior Site Reliability Engineer

Kubernetes | GitOps | Docker | Helm | AWS/GCP/Azure | Linux | Go | Python | Bash

Location: Serbia / Remote

Full Time

Application deadline: April, 30th

Description

We are looking for engineer who will be engaged in maintaining, securing and scaling cutting-edge geo-distributed cloud infrastructure software to meet users’ needs while keeping an ever-watchful eye on capacity and performance.

  • You will be responsible for building, maintaining, and scaling across multiple clouds our complex and data-intensive Kubernetes based digital edge fabric.
  • You will be writing / extending Kubernetes operators, Serverless like Knative etc.
  • You will automate the continuous deployment process using approaches like GitOps such that once engineering releases, the product walks itself through various stages to join the fleet on the clouds.
  • You will also act as a consultant to development on infrastructure, networking, scalability, monitoring, operational process, infrastructure efficiency and release process.
  • You will scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

Requirements

  • 5+ experience within the DevOps, CI-CD and similar.
  • You have extensive background in provisioning, scaling and operating cloud-based geo-distributed applications by monitoring availability and taking a holistic view of system health.
  • Deep understanding of Docker, Kubernetes, Helm and ways to orchestrate container systems. You have a good understanding & experience operating on either AWS/GCP/Azure using container native technologies.
  • Fluent in Linux systems. You like to automate your job, using Go, Python, Bash or the likes.
  • You enjoy troubleshooting in a distributed Kubernetes environment and are comfortable in tracing problems through applications, systems and networks.
  • You enjoy talking about stability, scalability and performance limits of web-service
  • Advanced knowledge of Python, Go or Shell scripting.
  • Experience in designing, automating securing and supporting big, fast data stacks on AWS/GCE in containers/Kubernetes.
  • Experience operating critical production systems at scale.
  • Excellent knowledge of English, written and spoken.

Benefits

  • Health insurance (Private)
  • Open vacation policy
  • 7 -1 – 7 working hours plus 1 hour for a pause which makes up the 8 hours total.
  • Training and courses
  • Knowledge transfer
  • Happy hour and events
  • Paternity and back to school
  • Training sessions

BlueGrid.io is a service-oriented IT company with a mission to become a global partner in creating the future. Through strategic partnerships, we help companies bring their ideas to life by creating products that are significant to the industries in which they operate. BlueGrid.io operates within the cybersecurity, cloud, fin-tech, and web3 industries.

Integrating Automated Testing into DevOps and Agile - Luxoft Meetup | Registrations

Nikola Crnogorac - Vicert | Registrations

Junior Corner

BECOME A SPEAKER

Become a speaker