Senior Site Reliability Engineer, Infrastructure Foundations
New
W
Wikimedia FoundationNonprofit Internet Services
Please note that we are currently able to hire in the following: US States: Arizona, California, Colorado, Connecticut, District of Columbia*, Florida, Georgia, Idaho, Illinois, Indiana, Iowa, Maryland, Massachusetts, Michigan, Minnesota, Missouri, New Jersey, New Mexico, New York, North Carolina, Ohio, Oklahoma, Oregon, Pennsylvania, Puerto Rico*, Rhode Island, Tennessee, Texas, Utah, Vermont, Virginia, Washington, West Virginia, Wisconsin and Wyoming (*US Territory or Federal District) Countries: Brazil, Canada, Colombia, France, Germany, Ghana, India, Indonesia, Italy, Kenya*, Mexico, Morocco, Netherlands, Poland, Singapore*, South Africa, Spain, Switzerland and the United Kingdom.Full-TimeSenior
Salary113,082 - 175,725 USD per year
Apply NowOpens the employer's application page
Job Details
- Languages
- English
- Experience
- 6+ years
- Required Skills
- PythonKubernetesLinuxDevOps
Requirements
- 6+ years of experience in an SRE/Operations/DevOps role.
- Proficiency with Python and shell scripting (Bash).
- Experience with configuration management tools (Puppet, Ansible).
- Strong Linux system-level troubleshooting skills (Debian focus).
- Experience designing and managing infrastructure security.
- Background in technical incident response and post-incident review rituals.
- Proven history of automating tasks and processes.
- Strong verbal and written English communication skills.
- Ability to work independently in a globally distributed team.
Responsibilities
- Perform day-to-day operational/DevOps tasks on infrastructure including deployment, maintenance, and troubleshooting.
- Implement and utilize configuration management and deployment tools such as Puppet and Kubernetes.
- Lead continuous improvement by automating installation, configuration, and maintenance of services.
- Assist in the architectural design of new services to ensure scalability.
- Participate in a 24/7 on-call rotation for incident response and system diagnosis.
- Collaborate with a global, cross-functional team in an asynchronous environment.
- Mentor peers in technical and operational areas.
- Travel 1-2 times per year for in-person meetings.
View Full Description & ApplyYou'll be redirected to the employer's site