Reliability Engineer (SRE) - Application Performance Specialist
New
BrazilFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Languages
- English
- Required Skills
- AWSDockerNode.jsPostgreSQLPythonBashKubernetesMongoDBNest.jsLinux
Requirements
- Strong experience in Software Engineering, Site Reliability Engineering (SRE), or similar roles.
- Hands-on experience developing and optimizing Node.js applications, preferably with NestJS.
- Solid knowledge of Linux systems, command-line tools, and system troubleshooting.
- Experience with monitoring, logging, and incident response tools and practices.
- Ability to write automation and operational scripts in Python, Bash, or similar languages.
- Strong English communication skills for collaboration in international environments.
- Experience working with cloud environments such as AWS is a plus.
- Familiarity with PostgreSQL and MongoDB performance tuning and database optimization.
- Knowledge of containerization and orchestration tools such as Docker and Kubernetes is a plus.
- Strong analytical mindset with excellent problem-solving and incident management skills.
Responsibilities
- Design, develop, and maintain scalable and high-performance Node.js applications using frameworks such as NestJS, with PostgreSQL and MongoDB databases.
- Ensure system reliability, stability, and efficiency across application and infrastructure layers.
- Optimize application performance for scalability, responsiveness, and resource efficiency.
- Implement and manage monitoring, alerting, and observability systems to proactively detect and resolve issues.
- Conduct root cause analysis of production incidents and implement long-term preventive solutions.
- Collaborate with engineering teams to identify performance bottlenecks and improve system design.
- Develop and maintain technical documentation for system configurations, operations, and troubleshooting procedures.
View Full Description & ApplyYou'll be redirected to the employer's site