Talent Insights are looking to discuss a new Site Reliability Engineer position working full time with a Technology organisation Melbourne based. They pride themselves on driving innovation and pushing the boundaries of technology. We are currently seeking a talented and proactive Site Reliability Engineer (SRE) to join our dynamic team.
If you have a passion for ensuring the reliability, availability, and performance of mission-critical systems, and possess hands-on experience with Google Cloud Platform (GCP or AWS), monitoring tools like Datadog, Splunk, SignalFX, or Dynatrace, along with expertise in Linux, networking, configuration management, and Infrastructure as Code, we want to hear from you.
Key Responsibilities:
- Design, implement, and maintain scalable and reliable infrastructure on Google Cloud Platform.
- Utilize monitoring tools such as Datadog, Splunk, SignalFX, or Dynatrace for proactive issue detection and resolution.
- Implement and manage monitoring solutions with Prometheus or OTEL Collectors.
- Work with Linux (Ubuntu) systems, ensuring optimal performance and reliability.
- Collaborate with network engineers and system engineers to enhance overall system architecture.
- Use configuration management tools like Salt Project, Chef, or Puppet to automate and manage system configurations.
- Develop and maintain Infrastructure as Code (IaC) using Terraform for streamlined deployments.
- Contribute to software engineering projects using Golang, Python, or Node to improve system reliability and performance.
- Create and maintain scripts in Bash for automation and routine tasks.
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Proven experience as a Site Reliability Engineer or similar role.
- Strong knowledge of Google Cloud Platform (GCP) services.
- Experience with monitoring tools such as Datadog, Splunk, SignalFX, or Dynatrace.
- Familiarity with Prometheus or OTEL Collectors for monitoring.
- Proficient in Linux (Ubuntu) system administration.
- Hands-on experience with network engineering and system engineering.
- Expertise in configuration management tools like Salt Project, Chef, or Puppet.
- Software engineering skills in Golang, Python, or Node.
- Scripting experience with Bash.
- Practical experience with Infrastructure as Code, using Terraform.
- Attractive salary and comprehensive benefits package.
- Opportunities for professional development and career growth.
- Collaborative and innovative work environment.
- Exposure to cutting-edge technologies and challenging projects.
- Engage with a diverse and talented team of professionals.
* Please note you will require full working rights for this position as we will be unable to provide sponsorship.
How to Apply:
Please submit your resume by clicking APPLY NOW or email **************@talentinsights.com.au with the subject line "Site Reliability Engineer” Application.