Senior Site Reliability Engineer - Private Cloud Walnut Creek, California, United States
Company: Tbwa Chiat/Day Inc
Location: Walnut Creek
Posted on: November 13, 2024
Job Description:
Senior Site Reliability Engineer - Private CloudNetwork Optix
(Nx) is a powerhouse in video software development, driven by a
mission to empower the creation of intelligent video-based
solutions and products capable of converting video into actionable
data. Over a decade in the making, the Network Optix Enterprise
Video Platform helps innovative organizations rapidly and
affordably build world-class, custom-tailored, enterprise-scale
video products and solutions.We are looking for a seasoned Sr SRE
Engineer to join our team and lead efforts in deploying,
supporting, and optimizing NX Private Cloud across multiple
infrastructures, including on-premise, hybrid, and all major public
cloud environments such as AWS, Azure, GCP, and Alibaba Cloud. In
this role, you will work on developing solutions for deploying NX
Cloud services, managing CI/CD pipelines, and ensuring the
reliability and scalability of NX deployments. The ideal candidate
will excel at working across multiple cloud platforms and be
proficient in deploying and running Kubernetes infrastructure in
any environment.What you'll be doing
- Lead the deployment, scaling, and optimization of Kubernetes
infrastructure across any cloud or hybrid environment (including
AWS, Azure, GCP, Alibaba Cloud, and on-premise
infrastructures)
- Employ Kubernetes networking principles to ensure robust and
secure interactions within our infrastructure
- Design, implement, and optimize CI/CD pipelines (Jenkins,
GitLab) to automate build, test, and deployment processes across
various environments
- Leverage service tracing tools to monitor, analyze, and
optimize microservices performance and interactions
- Manage and maintain a diverse infrastructure (bare metal and
cloud environments) comprising hundreds of servers, multiple
Kubernetes clusters, and dozens of business-critical services
- Collaborate on the development and testing of NX Private Cloud,
ensuring it can be deployed seamlessly in customer environments,
whether on-premise or in a cloud-based infrastructure
- Rapidly respond to unexpected downtimes, perform root-cause
analysis, and ensure timely customer notifications
- Conduct post-mortem reviews and design preventative measures to
avoid future incidentsWhat we're looking for
- Ability to work seamlessly across multiple cloud platforms,
without reliance on any single provider
- Proven ability to design and implement Kubernetes networking
and microservices architectures
- Experience with service mesh technologies like Istio and
service tracing tools for enhanced observability
- Ability to design and maintain cloud-agnostic Helm charts for
Kubernetes environments
- Experience with CI/CD pipelines in hybrid or multi-cloud
environments, particularly with Jenkins, GitLab, Artifactory,
OpenSearch, and Graylog
- Experience in Ansible, Terraform, or other tools for
multi-cloud infrastructure automation.
- Hands-on experience with cross-platform infrastructure (Linux,
Windows)
- Commitment to continuous learning and staying updated with the
latest industry trends and best practicesWhat we offer
- Competitive compensation
- Paid time off
- Onsite work in our brand-new comfortable office
- Employer-sponsored health coverage
- Working with top industry experts in our international
teamHybrid or RemoteThe role is primarily designed as a hybrid,
with office locations in Portland, OR, Walnut Creek, CA, and
Burbank, CA. Ideally, the position includes some in-office
time.Base pay range: $175,000 - $200,000 USDNetwork Optix is an
equal opportunity employer committed to diversity and inclusion in
the workplace. We celebrate the diversity of our workforce, which
includes people of all cultural, national, racial, gender
identities, and those who have served in the military. We strive
for an environment where creativity and collaborative growth
thrive. If you have a disability or special need that requires
accommodation, please let us know.
#J-18808-Ljbffr
Keywords: Tbwa Chiat/Day Inc, Tracy , Senior Site Reliability Engineer - Private Cloud Walnut Creek, California, United States, Professions , Walnut Creek, California
Didn't find what you're looking for? Search again!
Loading more jobs...