Join to apply for the Site Reliability Engineer (SRE) role at Rogo.
We're building AI thought partners to make people smarter and more creative, accelerating the creation and sharing of knowledge in financial services. We're ambitious and focused on building the biggest Financial AI company in the world. Our team is lean, smart, and rapidly growing out of our NYC office.
WHY JOIN ROGO?
- Exceptional traction: strong PMF with major investment banks, hedge funds, and private equity firms.
- World-class team: we prioritize talent density and work with highly driven individuals.
- Velocity: fast-paced environment that fosters rapid learning and new challenges.
- Frontier technology: developing cutting-edge AI systems and pushing research boundaries.
- Innovative Product: creating powerful tools that transform how knowledge is discovered, created, and shared.
Key Responsibilities
- Design, deploy, and maintain cloud infrastructure on AWS and/or Azure for high availability and resilience.
- Implement and manage monitoring solutions using Datadog to proactively address system issues.
- Manage Kubernetes clusters, utilizing Helm for deployment automation.
- Develop Infrastructure as Code (IaC) with tools like Terraform and automation scripts in Bash or Python.
- Collaborate with development and operations teams to promote DevOps practices and ensure seamless deployment.
- Troubleshoot and resolve complex system issues related to OS, networking, and databases in cloud environments.
- Maintain comprehensive documentation of configurations and procedures.
Qualifications
- Bachelor's degree in Computer Science, IT, or related field.
- 3-5 years of experience with AWS/Azure, including EC2, S3, VPC, Lambda.
- 2-3 years managing Kubernetes clusters and Helm.
- 2-3 years with Datadog or similar tools.
- 3-5 years Linux system administration and scripting experience.
- 2-3 years with Terraform or similar IaC tools.
Skills include proficiency in Bash and Python, networking fundamentals, CI/CD pipelines, cloud security, problem-solving, and effective communication.
Preferred Qualifications
- Experience with MLOps, PostgreSQL, Elasticsearch, vector databases, and observability tools.
- Certifications in AWS, Azure, or Kubernetes.
- Experience with GCP and distributed tracing tools.
Who You Are
- Thrives in fast-paced, startup environments.
- Ambitious and enjoys solving challenging problems.
- Curious about AI, technology, and finance.
- Autonomous, self-directed, and comfortable with ambiguity.
- Collaborative, organized, and thoughtful.
Seniority level
Employment type
Job function
- Engineering and Information Technology
Industries
#J-18808-Ljbffr