Senior Observability Engineer - TEKspikes LLC : Job Details

Senior Observability Engineer

TEKspikes LLC

Job Location : all cities,AK, USA

Posted on : 2025-08-05T07:47:38Z

Job Description :

Position: Senior Observability Engineer

Company: Tek Spikes

Overview:

As a Senior Observability Engineer at Tek Spikes, you will be responsible for leading the development and implementation of robust observability strategies across our infrastructure and services. You will leverage your expertise to ensure real-time visibility and insights into system performance, enabling proactive management and rapid resolution of issues. Your role will involve mentoring team members and collaborating with various stakeholders to drive improvements that enhance the reliability and efficiency of our systems.

Key Responsibilities:

  • Design and implement comprehensive observability frameworks, including metrics, logging, and tracing functionalities across applications and infrastructure.
  • Utilize and manage observability tools such as OpenTelemetry, Prometheus, Grafana, and ELK stack to provide actionable insights and performance monitoring.
  • Lead initiatives to optimize observability practices, ensuring best practices are consistently followed across teams.
  • Collaborate with software engineering, DevOps, and security teams to identify key performance indicators and monitoring requirements.
  • Conduct performance analysis and troubleshoot complex incidents by analyzing logs, metrics, and traces.
  • Mentor junior engineers and conduct knowledge-sharing sessions on observability tools and practices.
  • Research and introduce innovative observability solutions that enhance system diagnostic capabilities.

Requirements

Qualifications:

- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.

- 5+ years of experience in observability, monitoring, or cloud infrastructure roles.

- Expertise in observability tools and frameworks such as OpenTelemetry, Prometheus, Grafana, or Elastic Stack.

- Proficient in programming/scripting languages like Python, Go, or Java.

- Extensive experience with cloud platforms (AWS, Azure, GCP) including their respective monitoring solutions.

- Strong problem-solving skills with the ability to analyze complex systems and derive actionable insights.

- Excellent communication and collaboration skills, demonstrating the ability to work effectively in a team-oriented environment.

Apply Now!

Similar Jobs ( 0)