Service Reliability Engineer
Company: Stanford University
Location: Redwood City
Posted on: November 13, 2024
Job Description:
Stanford's central IT organization is looking for an experienced
Service Reliability Engineer to join the Enterprise Technology team
to support the implementation, maintenance, and upkeep of
on-premise and cloud systems.This is a hybrid eligible
position.CORE DUTIES:
- Deploying and managing highly available hybrid systems on On
premise and cloud platforms like AWS and OCI, focusing on
Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS)
offerings.
- Manage installations, configurations, and upgrades;
troubleshoot outages and incidents.
- Implement Infrastructure as Code practices using tools like
Terraform to automate cloud infrastructure provisioning and
management. Improve operational efficiency by automating routine
application tasks using Python and shell scripting.
- Design, implement, and maintain CI/CD pipelines to streamline
application deployment processes, ensuring high-quality software
delivery.
- Deploy and manage containerized applications using Docker and
orchestrate them with Docker Compose or Kubernetes for scalability
and resilience.
- Lead efforts to modernize existing infrastructure and
applications by integrating new technologies and cloud-native
solutions.
- Actively participate in scaling, performance tuning, and
capacity planning of Enterprise Stack, including Single Sign-On and
SSL keystore management.
- Conduct application server hardening to enhance security
against potential threats.
- Provide technical support for complex issues by collaborating
with all stakeholders to assess current systems, recommend
improvements for enhanced performance and scalability. Ensure
effective communication regarding system status and
operations.
- Create and maintain comprehensive documentation for system
configurations, procedures, and best practices to ensure knowledge
transfer and compliance.
- Ensure robust monitoring processes are in place and compliance
with production security standards.EDUCATION &
EXPERIENCE:Bachelor's degree and eight years of relevant experience
or a combination of education and relevant experience.KNOWLEDGE,
SKILLS, AND ABILITIES:
- Experience with diverse middleware technologies on bare metal
and Docker containers.
- Experience with Infrastructure as Code like Terraform and
container orchestration utilities.
- Demonstrate Cloud Infrastructure experience with experience in
building full-stack infrastructure for enterprise-ready
applications.
- Demonstrated experience in the support requirements of large
data management systems including performance analysis and tuning
of high-volume, transaction systems.
- Experience with version control systems (Git, SVN) and CI/CD
tools.
- Proficiency in programming and scripting languages, especially
Python and Shell.
- Strong working knowledge of Linux-based systems.Other
Duties:Additional responsibilities may be assigned as needed.The
expected pay range for this position is $150,922-$155,000 per
annum.Stanford University provides pay ranges representing its good
faith estimate of what the university reasonably expects to pay for
a position. The pay offered to a selected candidate will be
determined based on factors such as (but not limited to) the scope
and responsibilities of the position, the qualifications of the
selected candidate, departmental budget availability, internal
equity, geographic location, and external market pay for comparable
jobs.At Stanford University, base pay represents only one aspect of
the comprehensive rewards package. The Cardinal at Work website
(https://cardinalatwork.stanford.edu/benefits-rewards) provides
detailed information on Stanford's extensive range of benefits and
rewards offered to employees. Specifics about the rewards package
for this position may be discussed during the hiring process.The
job duties listed are typical examples of work performed by
positions in this job classification and are not designed to
contain or be interpreted as a comprehensive inventory of all
duties, tasks, and responsibilities. Specific duties and
responsibilities may vary depending on department or program needs
without changing the general nature and scope of the job or level
of responsibility. Employees may also perform other duties as
assigned.Consistent with its obligations under the law, the
University will provide reasonable accommodations to applicants and
employees with disabilities. Applicants requiring a reasonable
accommodation for any part of the application or hiring process
should contact Stanford University Human Resources by submitting a
contact form.Stanford is an equal employment opportunity and
affirmative action employer. All qualified applicants will receive
consideration for employment without regard to race, color,
religion, sex, sexual orientation, gender identity, national
origin, disability, protected veteran status, or any other
characteristic protected by law.Business Affairs, Redwood City,
California, United States
#J-18808-Ljbffr
Keywords: Stanford University, Tracy , Service Reliability Engineer, Engineering , Redwood City, California
Didn't find what you're looking for? Search again!
Loading more jobs...