Site Reliability Eng....

STAFIDE - The Hague - Netherlands

Site Reliability Engineer العربية

Site Reliability Engineer

STAFIDE

Posted on : 15-12-2022

Employer Active

The job posting is outdated and position may be filled

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Jobs by Experience

5+ years

Job Location

The Hague - Netherlands

Monthly Salary

Not Disclosed

Salary Not Disclosed

Posted on : 15-12-2022

Job Description

Basic requirement

At Sisar, were searching for a competent applicant who is persistent and self-driven. Along with teamwork, the candidate should have the capacity for independent work and be a good cultural fit for the company.

About us

We are a pioneering company dedicated to offering options that will revolutionize how businesses boost efficiency. Working with us is a terrific and comprehensive experience for all stakeholders because we offer a wide range of cutting-edge solutions and technology-driven services.

Our vision

We have been in the Netherlands for the last 7 years and are now expanding to the UK and India. Officially, were going global. We take great pride in hiring the brightest minds from around the world and having a multinational team.

Requirements

As an SRE you will:

Be on a PagerDuty rotation to respond to availability incidents and provide support for service engineers with customer incidents.
Use your on-call shift to prevent incidents from happening.
Run our infrastructure with Terraform and Kubernetes.
Use monitoring and alerting to alert on symptoms not outages.
Document every action so that your findings turn into repeatable actions (playbooks) and then into automation.
Improve the deployment process--we want to make it as boring as possible.
Design, build and maintain core infrastructure pieces that allow DSX to scale to support hundreds and then thousands of concurrent users.
Debug production issues across services and levels of the stack.
Plan the growth of the DSX infrastructure.

You may be a fit for this role if you:

Think about systems, and particularly edge cases and failure modes.
Know your way around Linux and the Unix .
Have strong programming skills--preferably Nodejs, but it could be Python, Go, .NET or even Ruby.
Have an urge to collaborate and communicate asynchronously.
Have an urge to document all the things so you dont need to learn the same thing twice.
Have an enthusiastic, go-for-it attitude. When you see something broken, you cant help but fix it.
Have an urge for delivering quickly and iterating fast.
Have experience with Nginx, Docker, Kubernetes, Terraform.
Have good experience with GitHub.
Coding infrastructure automation with GitHub Actions and Terraform.
Improving our Prometheus Monitoring or building new Metrics.
Helping to deploy new versions of DSX.
Helping to plan, prepare for, and execute the migration of DSX from virtual machines running on Azure to cloud-native container-based deployments with Kubernetes using Azure Kubernetes Service.

Skills:

General knowledge of 4 of the following areas of technical expertise with deep knowledge in 1 area:
Implement ""Infrastructure as Code"" using Terraform and GitHub CI/CD for automation.
Load balancing of the application including Proxies and CDN.
Kubernetes and containerising our system.
Administering a high-availability MSSQL cluster.
Monitoring and Metrics in Prometheus and Grafana, and their integrations with Slack/PagerDuty.
Logging infrastructure.
Backend storage management and scaling.
Disaster Recovery and High Availability strategy.
Contributing to code for services and automation.

Good To Have:

Provide emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed.
Propose ideas and solutions within the infrastructure team to reduce the workload by automation.
Plan, design and execute solutions within the team to reach specific, agreed-upon, goals.
Plan and execute configuration change operations both at the application and the infrastructure level.
Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation.
Azure Data Factory.
Azure DevOps.
Javascript.

Benefits

Travel allowance

An open culture where you can express your views

Excellent Work life balance

Visa sponsorship

A great group of like-minded colleagues

Relocation support

As an SRE you will: Be on a PagerDuty rotation to respond to availability incidents and provide support for service engineers with customer incidents. Use your on-call shift to prevent incidents from happening. Run our infrastructure with Terraform and Kubernetes. Use monitoring and alerting to alert on symptoms not outages. Document every action so that your findings turn into repeatable actions (playbooks) and then into automation. Improve the deployment process--we want to make it as boring as possible. Design, build and maintain core infrastructure pieces that allow DSX to scale to support hundreds and then thousands of concurrent users. Debug production issues across services and levels of the stack. Plan the growth of the DSX infrastructure. You may be a fit for this role if you: Think about systems, and particularly edge cases and failure modes. Know your way around Linux and the Unix Shell. Have strong programming skills--preferably Nodejs, but it could be Python, Go, .NET or even Ruby. Have an urge to collaborate and communicate asynchronously. Have an urge to document all the things so you don't need to learn the same thing twice. Have an enthusiastic, go-for-it attitude. When you see something broken, you can't help but fix it. Have an urge for delivering quickly and iterating fast. Have experience with Nginx, Docker, Kubernetes, Terraform. Have good experience with GitHub. Projects you could work on Coding infrastructure automation with GitHub Actions and Terraform. Improving our Prometheus Monitoring or building new Metrics. Helping to deploy new versions of DSX. Helping to plan, prepare for, and execute the migration of DSX from virtual machines running on Azure to cloud-native container-based deployments with Kubernetes using Azure Kubernetes Service. Skills: General knowledge of 4 of the following areas of technical expertise with deep knowledge in 1 area: Implement ""Infrastructure as Code"" using Terraform and GitHub CI/CD for automation. Load balancing of the application including Proxies and CDN. Kubernetes and containerising our system. Administering a high-availability MSSQL cluster. Monitoring and Metrics in Prometheus and Grafana, and their integrations with Slack/PagerDuty. Logging infrastructure. Backend storage management and scaling. Disaster Recovery and High Availability strategy. Contributing to code for services and automation. Good To Have: Provide emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed. Propose ideas and solutions within the infrastructure team to reduce the workload by automation. Plan, design and execute solutions within the team to reach specific, agreed-upon, goals. Plan and execute configuration change operations both at the application and the infrastructure level. Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation. Azure Data Factory. Azure DevOps. Javascript.

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

STAFIDE

0-50 employees

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Free AI Resume Review

Get Hired 3x Faster with free, confidential review from Ai resume review service.

Order Now

Resume, LinkedIn, Cover Letter

Elevate your professional profile with expertly crafted documents including your resume, LinkedIn profile, cover letter.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Learn More

Reverse Recruiting

Never apply for a job again. We apply and track jobs for you to find your perfect match.

Site Reliability Engineer

STAFIDE

Job Description

Requirements

Benefits

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Category Specialist Maritime Assets

Category Specialist Maritime Assets

Systems Engineer Bij Voclarion In Amsterdam

Medior ICT Support Engineer Bij LIDL

Junior Sales Engineer At EAE In Oosterhout

VOIP Support Engineer At EVOLVE IP In Rotterdam

Data Engineers Roermond

ICT Security Trainee Zoetermeer Bij True Legends