Apply

Senior Platform Engineer

Posted about 2 months agoViewed

View full description

๐Ÿ’Ž Seniority level: Senior

๐Ÿ“ Location: Anywhere

๐Ÿ” Industry: Software Development

๐Ÿข Company: Timescale๐Ÿ‘ฅ 101-250๐Ÿ’ฐ $110,000,000 Series C about 3 years agoDatabaseComputerInformation ServicesSoftware

๐Ÿ—ฃ๏ธ Languages: English

Requirements:
  • Strong industry experience operating Postgres databases at scale
  • Established history of incident management in the context of database operations
  • Good software engineering and distributed systems fundamentals
  • Experience working with and debugging high-availability Postgres setups using Patroni or other industry standard HA tooling
  • Intimate knowledge of the Postgres ecosystem, including extensions (e.g., pg_stat_statements, postgres_fdw, plpython) and operational tools (e.g., pgbouncer, pgbackrest, pganalyze)
  • Deep experience with analyzing database performance, tuning Postgres parameters, and optimizing Postgres workloads.
Responsibilities:
  • Ensuring operational excellence, performance, and observability for our fleet of Postgres/Timescale instances
  • Diving deep into complex performance issues faced by our Cloud customers
  • Designing and implementing database related Cloud features, working across many layers of the stack to ensure a seamless cloud-first database experience
  • Instrumenting excellent database observability from the application to the kernel level
Apply

Related Jobs

Apply

  • Experience setting up and maintaining distributed environments know how to design, implement and operate services with functional, scalability or costs constraints
  • Experience working in a DevOps focused culture environment, where the team youโ€™ll join will be providing support to feature teams to accelerate time to market thru tools and baselines offered
  • Experience in AWS services and container-related technologies, specially EKS
  • Experience in automation and programming to keep Infrastructure as Code
  • Knowledge of Golang/Python
  • Engage in and improve the whole lifecycle of Servicesโ€”from inception and design, through deployment, operation and refinement
  • Maintain Services once they are live by measuring and monitoring availability, latency and overall system health
  • End-to-end responsibility for the Services that we own, including taking part in an On-Call rotation, practice sustainable incident response and blameless postmortems
  • Automate as much as humanly possible and always configure as Code
  • Advocate sensible, scalable, systems design and share responsibility within the organisation in diagnosing, resolving and preventing production issues through mechanisms like automation
  • Evolve Services by pushing for changes that improve reliability and velocity
  • Contribute to our Internal Tooling set that helps us improve our Operations processes, manage our Infrastructure, and scale our Systems
Posted about 3 hours ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ’ธ 150000.0 - 195000.0 USD per year

๐Ÿ” Software Development

๐Ÿข Company: Second Dinner

  • Strong experience in operating and scaling backend services in AWS or other cloud infrastructures
  • Proficiency in programming languages such as .NET/C# or similar for building backend systems and automation tools
  • Experience integrating backend platform services with web applications
  • Experience with SQL and NoSQL databases for high-traffic, zero-downtime systems
  • Hands-on experience with monitoring, logging, and observability tools (e.g., Honeycomb, Prometheus, Grafana, ELK Stack)
  • Strong understanding of security best practices for cloud environments, including IAM, network security, and data encryption
  • Experience being in on-call rotation to operate services with live players and proven ability to troubleshoot live service issues and implement sustainable solutions
  • Effective communication skills to articulate technical concepts and collaborate with cross-functional teams
  • A proactive and collaborative mindset to contribute to team success
  • Partner with the game teams to implement and integrate platform services, such as authentication, accounts, payment processing, and compliance management across App Store, Google Play, and Steam
  • Develop and integrate platform services with a serverless backend architecture to ensure scalability, reliability, and efficiency
  • Enable platform service integrations with various systems, such as web shop and other external services, to provide consistent and secure player experiences
  • Ensure seamless functionality through rigorous testing and adherence to platform-specific requirements
  • Monitor, troubleshoot, and optimize backend systems to maintain high availability and performance
  • Enhance the security and reliability of cloud-based infrastructure, implementing best practices for cloud security
  • Provide technical guidance and contribute to operational efficiency and scalability improvements

AWSBackend DevelopmentSQLCloud ComputingGrafanaPrometheusREST APIServerlessNosqlCommunication SkillsComplianceTroubleshooting

Posted about 6 hours ago
Apply
Apply

๐Ÿงญ Full-Time

๐Ÿ” Software Development

๐Ÿข Company: Fieldguide๐Ÿ‘ฅ 101-250๐Ÿ’ฐ $30,000,000 Series B about 1 year agoArtificial Intelligence (AI)Document Management

  • 5+ years of total experience in SRE, DevOps, platform, or infrastructure roles
  • Experience with high-uptime, globally distributed environments and IaC principles
  • Experience with cloud platforms, preferably AWS
  • Experience with container runtime architectures, preferably Kubernetes
  • Proficiency with at least one programming language, preferably Typescript, Python, or Go
  • Experience leveraging network and security concepts to meet business compliance needs
  • Experience with relational database design and administration
  • Serve as a steward of our platform by designing, implementing, and testing our infrastructure to meet our application feature and scalability goals
  • Foster a learning culture within the team by conducting code reviews, root cause analysis, and continuous learning exercises
  • Serve as a subject matter expert on our infrastructure to other teams seeking to build their services upon it
  • Coordinate closely with the Application pillar in our Platform team as we tackle major initiatives like multi-region deployments and telemetry improvements
Posted 1 day ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ’ธ 145000.0 - 185000.0 USD per year

๐Ÿ” Software Development

๐Ÿข Company: DomainTools๐Ÿ‘ฅ 11-50Web HostingSecurityInformation TechnologyCyber Security

  • 7+ years of experience in Linux systems engineering roles supporting bare metal servers and virtualization/container platforms
  • 3+ yearsโ€™ Kubernetes administration experience on Red Hat OpenShift.
  • Experience building and managing infrastructure in both public cloud and physical data center environments using IaC tools
  • 5+ yearsโ€™ experience with enterprise monitoring and logging solutions like Prometheus, ELK, or similar
  • Proven ability to automate the right things in the simplest way possible (scripts, config management tools, CI pipelines, RHOS Operators, etc.)
  • Solid understanding of networking fundamentals and storage technologies
  • Competency in at least one high level programming language (i.e., Golang, Python, etc.)
  • Experience supporting customer-facing SaaS products
  • translate high level platform design into low level technical design and are responsible for implementing, administering, supporting, and patching their corresponding platforms.
  • Installs, configures, and monitors applications and services in the OpenShift cluster.
  • Continually assesses technical components to recommend platform improvements, translating high-level design and RHOS best practices into low-level technical configuration.
  • Ensures the ongoing stability, availability, performance, and security compliance of the platform to meet customer SLAs; authors and executes test cases to validate
  • Collaborates with software delivery teams and architects to build and support self-service mechanisms, CI/CD pipelines, and k8s operators that simplify and accelerate service delivery, in accordance with DevOps and Agile frameworks
  • Maintains the catalog of services for the platform in collaboration with Engineering.
  • Instruments and optimizes application, system, and cluster performance.
  • Forecasts and plans capacity increases to ensure resource availability for engineering teams while meeting budget targets.
  • Helps build and implement Disaster Recovery / Business Continuity plan; conducts related testing of recovery procedures.
  • Helps determine Platform roadmap, manage projects and ticket-based work; ensures these are clearly communicated with stakeholders at all levels.
  • Provides thought leadership on DevOps and Platform Engineering-centric system and process design, giving constructive input to engineers and leaders on proposals and best practices.
  • Builds internal documentation and artifacts describing the mechanisms used for deployment, monitoring, and operators.
  • Leads by showing: mentors and helps develop engineers in a highly demonstrative and collaborative way
  • Participates in an on-call rotation with fellow team members

AWSPythonBashCloud ComputingJenkinsKafkaKubernetesGoPrometheusCI/CDLinuxDevOpsTerraformNetworkingAnsibleScriptingSaaS

Posted 3 days ago
Apply
Apply

๐Ÿงญ Full-Time

๐Ÿ” Software Development

๐Ÿข Company: WunderGraph, Inc.

  • Senior level of proven experience as a Platform/DevOps or SRE engineer
  • Proficiency in a programming language, preferably Go
  • Excellent understanding of the web, networking and distributed systems
  • Profound experience with at least one cloud provider (GCP, AWS, Azure), GCP preferred
  • Experience with Kubernetes, Docker, Terraform, Helm, Prometheus, Grafana, ELK, etc.
  • Familiar with meeting information security requirements (SOC 2, ISO 27001) in a cloud environment
  • Excellent communicator (important for a remote team) in English
  • Helping the team to implement, deploy and maintain the necessary infrastructure
  • Operating and orchestrating our cloud infrastructure (GCP, GKE)
  • Extending our custom telemetry pipeline to ingest and process billions of events
  • Defining adequate monitoring of our cloud services, with a sharp eye on security
  • Implementing and deploying measures to make our cloud even more robust, secure and scalable across the globe / on the edge
  • Automating as much as possible
  • Help customers to integrate Cosmo into their existing infrastructure
  • Efficient use of our cloud resources (traffic, load, provisioning)
  • Documenting and planning new solutions
Posted 5 days ago
Apply
Apply

๐Ÿ“ Poland

  • Vast experience with container orchestration platforms like Kubernetes
  • Experience using Terraform and popular CI/CD tools
  • Experience building scalable and secure production HA environments using AWS
  • Ability to develop tools or scripts in Bash or GO to automate work
NOT STATED

AWSBashKubernetesGoCI/CDTerraform

Posted 6 days ago
Apply
Apply

  • Experience with container orchestration platforms like Kubernetes
  • Knowledge of Terraform and popular CI/CD tools
  • Experience with building scalable and secure production HA environments using AWS
  • Ability to develop tools or scripts in Bash or GO to automate work
NOT STATED
Posted 7 days ago
Apply
Apply

๐Ÿ“ Philippines

๐Ÿงญ Full-Time

๐Ÿ” Software Development

๐Ÿข Company: Dev Partners๐Ÿ‘ฅ 101-250IT ManagementInformation TechnologySoftware

  • At least 5 years of experience in software development
  • Strong knowledge on best practices of software development especially on building API.
  • Hands on experience on CRM tools.
  • Proficiency on Python, Javascript and Typescript
  • Designing, Building, and Maintaining integrations between internal tools, HRIS, CRM tools, no-code/low-code platforms, middleware, and other business systems.
  • Developing and Enhancing API-driven workflows to automate processes and ensure seamless data exchange across platforms.
  • Ensuring integrations and APIs are secure, scalable, and well-documented following industry best practices.
  • Architecting and Implementing cloud infrastructure solutions to support business operations, ensuring high availability and fault tolerance.
  • Collaborating with internal teams to design systems that scale effectively and meet performance benchmarks.
  • Using TypeScript, JavaScript, Python, or similar languages to develop platform-specific solutions and automation scripts.
  • Building, Testing, and Maintaining modular code to enable integration between various tools and environments.
  • Participating in code reviews to ensure high standards for quality and maintainability.
  • Partnering with stakeholders to define requirements, manage timelines, and ensure project deliverables align with business goals.
  • Providing technical guidance to cross-functional teams on large-scale platform projects and integrations.
  • Implementing monitoring, observability, and alerting tools to identify and resolve issues proactively.
  • Optimizing database and infrastructure performance to ensure seamless operations across integrated systems.
  • Identifying and Implementing cost-saving strategies without compromising reliability or performance.
  • Leading incident response efforts, conducting postmortem analysis, and applying learnings to improve system reliability.
  • Collaborating with DevOps teams to improve CI/CD pipelines and enhance deployment processes.

Backend DevelopmentPythonSoftware DevelopmentCloud ComputingJavascriptTypeScriptAPI testingCI/CDRESTful APIsCRM

Posted 10 days ago
Apply
Apply

๐Ÿ“ UK, Ireland, Israel, Estonia, Spain

๐Ÿงญ Contract

๐Ÿ” Software Development

๐Ÿข Company: DoiT๐Ÿ‘ฅ 501-1000๐Ÿ’ฐ $100,000,000 Series A over 5 years agoInternet of ThingsBig DataCloud ComputingRoboticsAnalyticsInformation Technology

  • 6+ years of proven experience in platform engineering, DevOps engineering, or related roles, with a strong track record of building and maintaining complex cloud infrastructure.
  • Strong hands-on experience with AWS/GCP, Kubernetes (EKS/GKE), and Terraform.
  • Demonstrated expertise in building and maintaining scalable, reliable, and secure cloud infrastructure, with a focus on automation and efficiency.
  • Strong coding skills in Go or Typescript, or other relevant languages.
  • Proven experience with CI/CD tools, such as Argo CD, Atlantis, or similar technologies, and a deep understanding of CI/CD principles and best practices.
  • Understanding of networking concepts and protocols.
  • Extensive experience with monitoring and logging tools, such as Prometheus, Grafana, and the ELK stack, and a proven ability to use these tools to diagnose and resolve performance issues.
  • Knowledge of security best practices for cloud environments.
  • Excellent communication skills in English, both written and verbal.
  • Self-organized, goal-oriented, and self-motivated.
  • Ability to work effectively in a remote and distributed team environment.
  • Prior experience working specifically on platform engineering projects.
  • Function as an individual contributor within the team.
  • Architect, Design, and Implement Infrastructure as Code (IaC) using Terraform
  • Deploy, Manage, and Optimize Kubernetes Clusters on AWS (EKS) and GCP (GKE)
  • Develop and Maintain Sophisticated CI/CD Pipelines for Platform Components
  • Diagnose, Troubleshoot, and Resolve Platform-Related Issues
  • Drive Automation Initiatives to Streamline Operational Tasks and Enhance System Reliability
  • Act as a Strategic Partner to Development Teams, Understanding and Addressing Their Infrastructure Needs
  • Contribute to the Development of Internal Tools and Services to Enhance Platform Functionality
  • Implement and Enforce Rigorous Security Best Practices and Ensure Compliance with Industry Standards

AWSGCPKubernetesTypeScriptGoGrafanaPrometheusCI/CDLinuxDevOpsTerraformScripting

Posted 10 days ago
Apply
Apply

๐Ÿ“ UK, Ireland, Israel, Estonia, Spain, other East Europe locations, Portugal

๐Ÿงญ Contract

๐Ÿ” Software Development

๐Ÿข Company: DoiT๐Ÿ‘ฅ 501-1000๐Ÿ’ฐ $100,000,000 Series A over 5 years agoInternet of ThingsBig DataCloud ComputingRoboticsAnalyticsInformation Technology

  • Strong hands-on experience with AWS/GCP, Kubernetes (EKS/GKE), and Terraform.
  • Demonstrated expertise in building and maintaining scalable, reliable, and secure cloud infrastructure, with a focus on automation and efficiency.
  • Strong coding skills in Go or Typescript, or other relevant languages.
  • Proven experience with CI/CD tools, such as Argo CD, Atlantis, or similar technologies, and a deep understanding of CI/CD principles and best practices.
  • Understanding of networking concepts and protocols.
  • Extensive experience with monitoring and logging tools, such as Prometheus, Grafana, and the ELK stack, and a proven ability to use these tools to diagnose and resolve performance issues.
  • Knowledge of security best practices for cloud environments.
  • Excellent communication skills in English, both written and verbal.
  • Self-organized, goal-oriented, and self-motivated.
  • Ability to work effectively in a remote and distributed team environment.
  • Prior experience working specifically on platform engineering projects.
  • Function as an individual contributor within the team: actively collaborating with peers through thorough code reviews, providing constructive support and mentorship, and contributing to a unified technical direction for the platform.
  • Architect, Design, and Implement Infrastructure as Code (IaC) using Terraform:
  • Deploy, Manage, and Optimize Kubernetes Clusters on AWS (EKS) and GCP (GKE):
  • Develop and Maintain Sophisticated CI/CD Pipelines for Platform Components:
  • Diagnose, Troubleshoot, and Resolve Platform-Related Issues:
  • Drive Automation Initiatives to Streamline Operational Tasks and Enhance System Reliability:
  • Act as a Strategic Partner to Development Teams, Understanding and Addressing Their Infrastructure Needs:
  • Contribute to the Development of Internal Tools and Services to Enhance Platform Functionality:
  • Implement and Enforce Rigorous Security Best Practices and Ensure Compliance with Industry Standards:

AWSGCPKubernetesTypeScriptGoGrafanaPrometheusCI/CDRESTful APIsLinuxDevOpsTerraformMicroservices

Posted 10 days ago
Apply

Related Articles

Posted about 1 month ago

Why remote work is such a nice opportunity?

Why is remote work so nice? Let's try to see!

Posted 8 months ago

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

Posted 8 months ago

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Posted 8 months ago

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

Posted 8 months ago

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.