Senior Site Reliability Engineer - Moloco Commerce Media
Company: Moloco, Inc.
Location: Redwood City
Posted on: November 13, 2024
Job Description:
The Impact You'll Be Contributing to Moloco:
- Build a state-of-the-art ad serving infrastructure for our
commerce media platform
- Manage the infrastructure that serves real time ad decisions
based on our machine learning (ML) models and self manageable ad
campaigns
- Maintain and improve the CI/CD pipeline to deploy
infrastructure and code updates in live environments
- Develop infrastructure tools and processes that improve the
productivity of engineering teams
- Traditional SRE/Operational support areas such as tooling and
automation, monitoring, workflow management, maintaining and
improving data pipelines, etc.
The Opportunity:
- Customer Facing: Design, implement, and maintain highly
available infrastructure directly facing customer requests with
high levels of traffic
- Large-Scale Server: Design and implement large scale clusters
capable of handling a wide range of requests with automatic scaling
and resistance against cascading failures
- Deployment Automation: Design and implement deployment
pipelines tightly integrated with code development that can test,
monitor, and decide to refuse or accept new deployments
automatically
- End to End Infrastructure Management: Collaborate with SWEs to
develop end to end infrastructure solutions to minimize operating
cost without compromising on high availability and scalability
How Do I Know if the Role is Right For Me?
- Bachelor's Degree or above in Computer Science or equivalent
technical degree
- Hands-on experience working with GCP or other cloud platforms
(e.g. AWS, Azure)
- Practical, proven knowledge of a high-level language (e.g. Go,
Java, Python, etc.)
- 5+ years of experience in large-scale software development
environment
- Experience working with infrastructure-related software (e.g.
Kubernetes, Helm, Terraform, etc.)
- Experience developing infrastructure, configuration and
deployment scripting and automation for large scale / high
complexity services in a microservices environment
- Experience working with large-scale distributed
systems.
- Passionate about operational excellence and thrive in an
environment where you are able to provide extremely high levels of
customer support
- High level of verbal and written communication skills to
collaborate effectively not only within the team but also with
other infrastructure engineers across the organization
- Tenacious problem solver who takes ownership of issues from
end-to-end to full resolution
#J-18808-Ljbffr
Keywords: Moloco, Inc., Tracy , Senior Site Reliability Engineer - Moloco Commerce Media, Professions , Redwood City, California
Didn't find what you're looking for? Search again!
Loading more jobs...