SaaSOffice, an innovative software startup, is recruiting several enthusiastic Site Reliability Engineers (M/F) to join its teams. Are you looking for a technical challenge? Then you've come to the right place. We're looking for software engineers who combine skills in development and system operations to continue making our solutions more reliable. As a cornerstone of our growth, you'll help us maximize the availability, performance and efficiency of our platform services.
- Innovate by joining teams dedicated to our new products and contribute to the definition of innovative real estate solutions.
- Participate in the transformation of our SaaSOffice Cowork product, dedicated to flexible space management, by supporting its growth and scalability through an automated software infrastructure.
We work in a caring, agile, multi-cloud environment. We believe in the potential of self-organized teams tackling common challenges.
- Ensure the availability, reliability and performance of SaaSOffice's critical systems.
- Participate in the implementation of automation tools, to optimize CI/CD chains, thus improving the efficiency of deployments and updates.
- Implement proactive monitoring solutions to detect and resolve problems before they affect users.
- Collaborate with development teams to anticipate and support infrastructure growth, ensuring it remains scalable and resilient.
- Participate in the implementation of robust security measures and ensure that systems comply with industry standards.
- Identify and implement continuous improvements to optimize infrastructure and application performance.
- Provide advanced technical support in the event of major incidents, contributing to rapid problem resolution.
Knowledge
- In-depth understanding of the principles and best practices of distributed software architectures.
- In-depth knowledge of Azure services and features to orchestrate deployments and manage infrastructure.
- Solid knowledge of Kubernetes for efficient management and orchestration of containers in a production environment.
- Experience in using Datadog to monitor, diagnose and manage application and infrastructure performance.
- Advanced skills in using automation tools, especially Ansible, to optimize deployment and management processes.
Know-how :
- Ability to design, implement and maintain automated CI/CD chains to ensure continuous deployment.
- Practical experience in deploying and managing Docker containers to ensure application portability.
- Expertise in deploying and putting SaaS applications into production, ensuring availability and scalability.
- Ability to actively participate in the implementation of best practices to guarantee system stability and security.
- Ability to contribute to the continuous improvement of processes with a view to ramping up production and deployments, with particular attention to operational efficiency.
Soft skills :
- Ability to work autonomously and proactively to anticipate operational challenges and resolve them effectively.
- Strong collaborative orientation, ability to work closely with development, operational teams and other stakeholders.
- Problem-solving attitude, with a willingness to find innovative solutions to technical challenges.
- Ability to adapt to a fast-changing environment, especially in the context of a SaaS startup.
- Strong inclination to automate processes to improve operational efficiency and foster continuous innovation.
Graduated with a BAC+3 or BAC+5 in computer engineering. Fluency in English is essential.
Site Reliability Engineer (SRE)