Site Reliability Engineer – Software CSG
Austin, Texas, United States Hardware
Summary
Posted: Oct 01, 2024
Role Number: 200570238
Do you love building elegant solutions to highly complex challenges? Do you intrinsically see the importance in every detail? As part of our Silicon Technologies group, you'll help design and manufacture our next-generation, high-performance, power-efficient processor, system-on-chip (SoC). You'll ensure Apple products and services can seamlessly and efficiently handle the tasks that make them beloved by millions! Joining this group means you'll be responsible for crafting and building the technology that fuels Apple's devices. Together, you and your team will enable our customers to do all the things they love with their devices. The Hardware Technology Compute and Storage Group is looking for a customer service oriented, self-driven, and motivated SRE to join our operations team with an emphasis on software automation. The ideal candidate should possess a strong background in programming, excellent communication skills, a sense of ownership, and a drive to produce their best work. They should also possess the ability to analyze and troubleshoot a broad spectrum of problems. You will join an existing team dedicated to supporting the geographically diverse silicon design teams within Apple!
Description
Support and improve the Hardware Technology engineering environment from design through deployment, including additional refinement and scale-up to support future growthSupport the day-to-day operations of the environment including monitoring, measuring, and troubleshooting infrastructure and servicesDrive and participate in automation efforts by identifying, owning, collaborating, and driving new or further automation to enhance the consistent stability of the environmentAchieve and maintain expected productivity levels with minimal supervisionAct as a mentor and interact with people of all levels of abilityContribute to a culture of curiosity, diversity, openness, collaboration, improvement, and resolutionParticipation in a regular ticket and on-call rotationsMinimum Qualifications
Experience in at least one of the following languages: Python, Ruby, Go, Rust, PerlExperience in designing and implementing RESTful services at scaleExperience with software design patterns in multiple languagesExperience with ticketing systems and on-call support dutiesMinimum requirement of a Bachelors degree and 10+ years of relevant industry experiencePreferred Qualifications
Demonstrated ability to participate in or lead cross-functional projects to successful completionExperience working in an operations team supporting high availability environmentsExperience and knowledge of Linux (RedHat/CentOS), containers, and KubernetesExperience with algorithms and data structures in multiple programming languages.Experience working with configuration management software (Puppet, Ansible, Chef, etc.)Experience working with logging tools at scale (Splunk, Elastic Stack, etc.) and with monitoring/graphing tools at scale like (Prometheus, Grafana, etc)Knowledge of and experience with common application protocols like DNS, LDAP, NFS, HTTPSKnowledge in administration, virtualization, diagnostic and performance troubleshooting/profiling.Experience with workflow/data automation pipelines#J-18808-Ljbffr