Apply

Platform Engineer

Posted 5 months agoViewed

View full description

💎 Seniority level: Senior, 5-6 years

📍 Location: US

💸 Salary: 125000.0 - 155000.0 USD per year

🔍 Industry: IT consulting and software development

🏢 Company: Capital Technology Group👥 51-100ConsultingInformation Technology

⏳ Experience: 5-6 years

🪄 Skills: AWSDockerSoftware DevelopmentSQLAgileGitJenkinsKubernetesCollaborationCI/CDAgile methodologiesLinuxDevOpsTerraform

Requirements:
  • Bachelors degree.
  • 5-6 years of experience.
  • Strong knowledge of Git.
  • Strong knowledge of Docker/Kubernetes.
  • Strong knowledge of Linux.
  • Strong knowledge of CI/CD pipelines.
  • Experience working within a cloud environment (e.g. AWS, GCP, Azure).
  • Working knowledge of databases and SQL.
  • Problem-solving attitude.
  • Must be able to get up-to-speed quickly on new projects and technologies.
  • Strong communication and problem-solving skills.
Responsibilities:
  • Manage cloud Infrastructure-as-Code.
  • Develop, automate, and maintain CI/CD pipelines.
  • Perform root cause analysis for production errors.
  • Investigate and resolve technical issues.
  • Develop automation scripts.
  • Design procedures for system troubleshooting and maintenance.
  • May mentor and supervise software developers.
Apply

Related Jobs

Apply

📍 United States

🧭 Full-Time

💸 150000.0 - 195000.0 USD per year

🔍 Software Development

🏢 Company: Second Dinner

  • Strong experience in operating and scaling backend services in AWS or other cloud infrastructures
  • Proficiency in programming languages such as .NET/C# or similar for building backend systems and automation tools
  • Experience integrating backend platform services with web applications
  • Experience with SQL and NoSQL databases for high-traffic, zero-downtime systems
  • Hands-on experience with monitoring, logging, and observability tools (e.g., Honeycomb, Prometheus, Grafana, ELK Stack)
  • Strong understanding of security best practices for cloud environments, including IAM, network security, and data encryption
  • Experience being in on-call rotation to operate services with live players and proven ability to troubleshoot live service issues and implement sustainable solutions
  • Effective communication skills to articulate technical concepts and collaborate with cross-functional teams
  • A proactive and collaborative mindset to contribute to team success
  • Partner with the game teams to implement and integrate platform services, such as authentication, accounts, payment processing, and compliance management across App Store, Google Play, and Steam
  • Develop and integrate platform services with a serverless backend architecture to ensure scalability, reliability, and efficiency
  • Enable platform service integrations with various systems, such as web shop and other external services, to provide consistent and secure player experiences
  • Ensure seamless functionality through rigorous testing and adherence to platform-specific requirements
  • Monitor, troubleshoot, and optimize backend systems to maintain high availability and performance
  • Enhance the security and reliability of cloud-based infrastructure, implementing best practices for cloud security
  • Provide technical guidance and contribute to operational efficiency and scalability improvements

AWSBackend DevelopmentSQLCloud ComputingGrafanaPrometheusREST APIServerlessNosqlCommunication SkillsComplianceTroubleshooting

Posted about 7 hours ago
Apply
Apply

📍 U.S., U.K., Finland, India, Singapore, Canada, and Ireland

🧭 Full-Time

🔍 Software Development

🏢 Company: AlphaSense👥 1001-5000💰 $650,000,000 Series F 10 months agoArtificial Intelligence (AI)Search EngineMarket ResearchSaaSMachine LearningAnalytics

  • Proven experience in managing cloud platform teams with a focus on Kubernetes.
  • Hands-on experience in developing, deploying and managing Kubernetes clusters, with CKA level understanding of the different domains involved (architecture, workloads etc.).
  • Experience with cluster lifecycle management solutions, such as Rancher, ClusterAPI, Anthos or similar.
  • Solid background on working with Cloud APIs, with at least AWS plus one of GCP or Azure.
  • Strong understanding of cloud-native technologies and principles.
  • Understanding and prior usage of FinOps framework, principles and tooling that will support a sustainable and optimised growth for our workloads
  • Demonstrated ability to lead and grow teams in a fast-paced environment.
  • Experience with operational processes and production support in cloud-based ecosystems.
  • Excellent communication and interpersonal skills, with an ability to tailor messages to diverse audiences.
  • A track record of making informed decisions quickly, without compromising on quality or trust.
  • Commitment to fostering an inclusive and multiculturally sensitive team environment where all voices are heard.
  • Strategic mindset with an ability to think ahead and anticipate problems before they escalate.
  • Ability to act as an ambassador for the team, advocating for their work and needs across the organization.
  • A passion for personal growth, seeking feedback, and leading by example in both managerial and technical expertise.
  • Drive the delivery of high-quality Kubernetes clusters, prioritizing simplicity and focusing on solving core problems.
  • Maintain and elevate operational processes for the management of Kubernetes environments, emphasizing scalability and reliability, considering clusters full lifecycle.
  • Oversee the proper configuration of cloud services across AWS, GCP, and Azure, and ensure the secure integration with Identity Management and Cost Tracking services.
  • Act as a technical thought leader, providing insights into the strategic development of Kubernetes based solutions.
  • Implement talent frameworks to build a team with the right mix of cloud platform expertise and experience.
  • Champion the growth and development of the team, encouraging a continuous learning environment and supporting its members through coaching and mentoring, while ensuring long-term growth and psychological safety.
  • Drive OKRs (Objectives and Key results) for the team for each quarter, while actively participating in the conversation defining mission and R&D quarter and yearly OKRs.
  • Foster collaboration and effective communication within the team and across organizational boundaries.
  • Uphold rigorous standards for production support of our Kubernetes clusters, including incident management and root cause analysis.
  • Oversee the team’s participation in architectural decisions, advocating for the right trade-offs and technical choices.
  • Actively engage in hiring and retention strategies, including defining or updating role responsibilities and performance expectations.
  • Encourage a team culture that embraces AlphaSense’s Diversity Commitments and promotes organizational health.
  • Guide the team in strategic planning, ensuring effective use of resources and alignment with company-wide initiatives.
  • Facilitate a culture of innovation where team members contribute to both "what to build" and "how to build".
  • Lead by example, establishing yourself as a trustworthy and empathetic leader with a commitment to personal and team growth.

AWSDockerLeadershipCloud ComputingGCPKubernetesAzureCI/CDRESTful APIsDevOpsTerraform

Posted about 16 hours ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 145000.0 - 185000.0 USD per year

🔍 Software Development

🏢 Company: DomainTools👥 11-50Web HostingSecurityInformation TechnologyCyber Security

  • 7+ years of experience in Linux systems engineering roles supporting bare metal servers and virtualization/container platforms
  • 3+ years’ Kubernetes administration experience on Red Hat OpenShift.
  • Experience building and managing infrastructure in both public cloud and physical data center environments using IaC tools
  • 5+ years’ experience with enterprise monitoring and logging solutions like Prometheus, ELK, or similar
  • Proven ability to automate the right things in the simplest way possible (scripts, config management tools, CI pipelines, RHOS Operators, etc.)
  • Solid understanding of networking fundamentals and storage technologies
  • Competency in at least one high level programming language (i.e., Golang, Python, etc.)
  • Experience supporting customer-facing SaaS products
  • translate high level platform design into low level technical design and are responsible for implementing, administering, supporting, and patching their corresponding platforms.
  • Installs, configures, and monitors applications and services in the OpenShift cluster.
  • Continually assesses technical components to recommend platform improvements, translating high-level design and RHOS best practices into low-level technical configuration.
  • Ensures the ongoing stability, availability, performance, and security compliance of the platform to meet customer SLAs; authors and executes test cases to validate
  • Collaborates with software delivery teams and architects to build and support self-service mechanisms, CI/CD pipelines, and k8s operators that simplify and accelerate service delivery, in accordance with DevOps and Agile frameworks
  • Maintains the catalog of services for the platform in collaboration with Engineering.
  • Instruments and optimizes application, system, and cluster performance.
  • Forecasts and plans capacity increases to ensure resource availability for engineering teams while meeting budget targets.
  • Helps build and implement Disaster Recovery / Business Continuity plan; conducts related testing of recovery procedures.
  • Helps determine Platform roadmap, manage projects and ticket-based work; ensures these are clearly communicated with stakeholders at all levels.
  • Provides thought leadership on DevOps and Platform Engineering-centric system and process design, giving constructive input to engineers and leaders on proposals and best practices.
  • Builds internal documentation and artifacts describing the mechanisms used for deployment, monitoring, and operators.
  • Leads by showing: mentors and helps develop engineers in a highly demonstrative and collaborative way
  • Participates in an on-call rotation with fellow team members

AWSPythonBashCloud ComputingJenkinsKafkaKubernetesGoPrometheusCI/CDLinuxDevOpsTerraformNetworkingAnsibleScriptingSaaS

Posted 3 days ago
Apply
Apply

📍 California, Colorado, Florida, Georgia, Hawaii, Illinois, Maryland, Massachusetts, Michigan, Minnesota, New Hampshire, New York, North Carolina, North Dakota, Oregon, Pennsylvania, Rhode Island, South Carolina, Texas, Utah, Vermont, Virginia, Washington, Washington D.C., and Wisconsin

🧭 Full-Time

💸 173676.0 - 210741.0 USD per year

🔍 Software Development

🏢 Company: ActBlue👥 51-100💰 $22,000,000 Series A over 14 years agoPoliticsNon ProfitEnterprise Software

  • An exploratory and tenacious mindset when taking on tasks that might have little to no precedent at the organization - our team is relatively new and much of our work is setting up new standards or delving into areas of code that haven’t been touched in a while
  • Some experience working on problems across a front-end ecosystem—we are looking for someone who is curious about how Webpack works or who wants to optimize an application’s bundle size
  • A willingness to tackle a diverse range of problems within our front-end ecosystem, and an ability to work autonomously on problems that cross team boundaries and touch multiple codebases
  • A natural tendency towards documentation and knowledge sharing over siloing
  • Knowledge or curiosity around Javascript library management, especially around internal component libraries
  • Fluency moving between and across technical systems and stacks – or at least a willingness to try! We are typically coding in Javascript, but our work brings us into contact with work areas from Docker to design systems
  • Excitement for your own and your teammates’ learning and growth - we are a small (but growing) team that works very closely together!
  • A track record of effective collaboration with other engineers to develop abstractions and patterns that make it easy to build reliable software.
  • An understanding of and a desire to co-create systems that help build psychological safety on the team: sharing learning with others, using peer review as an opportunity to celebrate and build others up, and a willingness to practice the duality of listening and leadership.
  • Write maintainable code that is adaptable to future design and roadmap decisions to help set the standard for software quality for our team and the organization at large.
  • Lead the process of architecting, refactoring, and improving our contribution forms and the myriad user flows that an ActBlue user might interface with.
  • Guide the design and execution of technical solutions that prioritize the highest impact opportunities while balancing effort, scope, and other trade-offs.
  • Partner with engineering managers to find sponsorship and growth opportunities for your colleagues.
  • Demonstrate technical leadership by writing documentation, establishing effective monitoring, and fostering clear and audience-oriented communication.
  • Coach and mentor other engineers on your team and create spaces for individuals to be engaged, valued, and heard.

DockerGraphQLNode.jsCypressFrontend DevelopmentJavascriptJestKubernetesReact.jsRuby on RailsTypeScriptYarnReactCI/CDRESTful APIsDevOpsSoftware Engineering

Posted 4 days ago
Apply
Apply

📍 California, Colorado, Florida, Georgia, Hawaii, Illinois, Maryland, Massachusetts, Michigan, Minnesota, New Hampshire, New York, North Carolina, North Dakota, Oregon, Pennsylvania, Rhode Island, South Carolina, Texas, Utah, Vermont, Virginia, Washington, Washington D.C., and Wisconsin

🧭 Full-Time

💸 173676.0 - 210741.0 USD per year

🔍 Software Development

  • Some experience working on problems across a front-end ecosystem—we are looking for someone who is curious about how Webpack works or who wants to optimize an application’s bundle size
  • A willingness to tackle a diverse range of problems within our front-end ecosystem, and an ability to work autonomously on problems that cross team boundaries and touch multiple codebases
  • A natural tendency towards documentation and knowledge sharing over siloing
  • Knowledge or curiosity around Javascript library management, especially around internal component libraries
  • Fluency moving between and across technical systems and stacks – or at least a willingness to try! We are typically coding in Javascript, but our work brings us into contact with work areas from Docker to design systems
  • Excitement for your own and your teammates’ learning and growth - we are a small (but growing) team that works very closely together!
  • A track record of effective collaboration with other engineers to develop abstractions and patterns that make it easy to build reliable software.
  • An understanding of and a desire to co-create systems that help build psychological safety on the team: sharing learning with others, using peer review as an opportunity to celebrate and build others up, and a willingness to practice the duality of listening and leadership.
  • Write maintainable code that is adaptable to future design and roadmap decisions to help set the standard for software quality for our team and the organization at large.
  • Lead the process of architecting, refactoring, and improving our contribution forms and the myriad user flows that an ActBlue user might interface with.
  • Guide the design and execution of technical solutions that prioritize the highest impact opportunities while balancing effort, scope, and other trade-offs.
  • Partner with engineering managers to find sponsorship and growth opportunities for your colleagues.
  • Demonstrate technical leadership by writing documentation, establishing effective monitoring, and fostering clear and audience-oriented communication.
  • Coach and mentor other engineers on your team and create spaces for individuals to be engaged, valued, and heard.
  • Receive support from your manager to grow as an individual and increase your impact on the success of your team and the progressive movement.

DockerGraphQLLeadershipNode.jsFrontend DevelopmentJavascriptKubernetesReact.jsRuby on RailsTypeScriptYarnReactCI/CDProblem SolvingRESTful APIsMentoringDevOpsDocumentationTeamworkSoftware Engineering

Posted 4 days ago
Apply
Apply

📍 US

🏢 Company: New Era Technology👥 1001-5000InternetInformation TechnologyAudio/Visual Equipment

  • Strong communication and teamwork skills
  • Excellent problem-solving skills, attention to detail, and a commitment to continuous improvement.
  • Foundational knowledge or experience in IT, networking, or systems administration.
  • Basic scripting skills in Python, Bash, or PowerShell.
  • Strong desire to learn cloud security, serverless computing, and DevSecOps practices.
  • Familiarity with secure software development and CI/CD principles (experience is a plus).
  • Design, develop, and deploy security services and solutions using Python and other scripting languages (e.g., Bash, PowerShell).
  • Implement and maintain CI/CD pipelines for secure automated deployment of services to cloud providers.
  • Apply DevSecOps principles to ensure the secure and efficient delivery of software, incorporating security best practices into the development lifecycle.
  • Perform continuous monitoring of deployed solutions, ensuring high availability, performance, and reliability.
  • Troubleshoot errors or other issues arising from automated workflows.
  • Contribute to the knowledgebase and documentation of automation, share knowledge, and help promote a culture of continuous learning.
  • Participate in the design, implementation, review, and improvement of security architecture and application architecture.

AWSPythonBashCloud ComputingCybersecurityCI/CDRESTful APIsLinuxDevOpsTerraformScripting

Posted 12 days ago
Apply
Apply
🔥 Data Platform Engineer
Posted 15 days ago

📍 United States

🧭 Full-Time

💸 130000.0 - 160000.0 USD per year

🔍 Data & Analytics

🏢 Company: GameChanger

  • 4+ years of experience in software development as a data or backend engineer.
  • Proficiency in Python and SQL for processing data.
  • Experience with some of the following technologies (or similar tools): Airflow, Docker, Kafka, AWS, Terraform, Snowflake, Git version control / GHA cicd
  • Stay informed about relevant technology trends and developments and contribute to technical design discussions.
  • Enjoy working with others and collaborating to solve problems efficiently.
  • Develop and maintain scalable and efficient data pipelines to support analytics needs across the organization, with a focus on dynamic and generalized solutions.
  • Improve the performance, observability, and reliability of existing data systems including our orchestrator and our custom ETL service.
  • Collaborate with other teams to understand data needs and implement solutions that improve efficiency.
  • Ensure system security and data privacy compliance.
  • Participate in code reviews, technical documentation, and knowledge-sharing initiatives.

AWSBackend DevelopmentDockerPythonSoftware DevelopmentSQLETLGitKafkaSnowflakeAirflowData engineeringCI/CDTerraform

Posted 15 days ago
Apply
Apply

📍 USA

🧭 Full-Time

🔍 Audit and advisory

🏢 Company: Fieldguide👥 101-250💰 $30,000,000 Series B about 1 year agoArtificial Intelligence (AI)Document Management

  • Experience with ML frameworks
  • Experience with cloud platforms, preferably AWS
  • Experience with container runtime architectures, preferably Kubernetes
  • Proficiency with at least one programming language, preferably Python or Typescript
  • Familiarity with CI/CD practices and tools
  • Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation
  • Strong understanding of distributed systems and microservices architecture
  • Ability to work in a fast-paced, changing startup environment
  • Design and implement infrastructure for ML model management, including training, deployment, and monitoring
  • Build and maintain platforms for running ML algorithms at scale
  • Develop systems for A/B testing, performance monitoring, and continuous model training
  • Create and manage ETL infrastructure to support ML workflows
  • Implement best practices for MLOps, including version control for models and datasets
  • Collaborate with ML Engineers to optimize model performance and resource utilization
  • Ensure the scalability, reliability, and security of ML systems
  • Stay current with the latest advancements in MLOps and cloud technologies
  • Contribute to the development of internal tools and frameworks to improve ML workflow efficiency
  • Be an essential technical contributor at a Series B-stage company as it scales

AWSDockerPythonCloud ComputingETLKubernetesMachine LearningCI/CDTerraformMicroservicesSoftware Engineering

Posted 22 days ago
Apply
Apply

📍 Worldwide

🧭 Full-Time

🔍 Software Development

🏢 Company: Xapo Bank👥 251-500AccountingBitcoinFinancial ServicesBanking

  • Strong Problem solving skills and a growth mindset, the ability to learn and use the best in class technology and tools to address our requirements.
  • A good technical knowledge of software and systems – able to dive into details with engineers and speak in plain language with stakeholders.
  • Stays abreast of current technology developments and has demonstrated the ability to retain competitive advantage by implementing relevant technologies in software products.
  • Experience building and shaping developer’s environment as code and using pipelines, solid expertise in GitHub and GitHub action is mandatory.
  • Ideally is comfortable in a variety of scripting and coding languages.
  • Strong knowledge and experience in designing, deploying, and administering complex cloud environments. Preferably AWS or GCP certified and has a good understanding of cloud architecture best practices.
  • Strong understanding of Docker and best practices.
  • Proactively work with stakeholders like Engineering Team, Product and Security, to identify and build a robust, self-serve and scalable framework of automated actions
  • Be part of a technical decision group, as your inputs will influence the Platform landscape and roadmap.
  • Deliver self-serve capabilities to our developers that allow them to provision, manage and operate their own infrastructure and services deployment..
  • Build high quality code into everything we do by following industry best practices and at the same time being able to understand what Xapo needs and make decisions according to it.
  • Provide coaching and mentoring to colleagues around how to build sustainable services and automated pipelines.
  • Keep security and compliance at the forefront of all you do.

AWSDockerCloud ComputingGCPKubernetesCI/CDRESTful APIsTerraformMicroservicesScripting

Posted 27 days ago
Apply
Apply

📍 Canada, United States

🧭 Full-Time

💸 120000.0 - 140000.0 CAD per year

🔍 Software Development

🏢 Company: Creative Market👥 11-50💰 $7,000,000 Series A over 7 years agoMarketplaceE-Commerce PlatformsGraphic Design

  • Relevant search experience.
  • Technical proficiency in Python.
  • Familiarity with our core search-related technologies or equivalent services.
  • Demonstrated success working with containerized infrastructure (eg. Docker).
  • The capability to apply your knowledge of AWS.
  • Experience using infrastructure as code tools.
  • A good understanding of Linux and relational databases (eg. MySQL).
  • Owning and maintaining our search and ranking engine. This is built using Elasticsearch, Python, Flask, MySQL, and Terraform. It will be your job to ensure scalability, reliability, and efficiency while addressing bugs and performance bottlenecks.
  • Collaborating with our Product and Engineering teams to help with implementing new search features, such as filters or query handling improvements.
  • Identifying opportunities for incremental ranking and relevance improvements.
  • Supporting infrastructure initiatives, ensuring reliability, optimizing CI/CD, scaling services, and revolving deployment and infrastructure-related issues.
  • Participating in the on-call rotation, and responding to infrastructure and search-related incidents.

AWSDockerPostgreSQLPythonSQLAmazon RDSElasticSearchFlaskMySQLAlgorithmsAmazon Web ServicesData StructuresRedisCI/CDLinuxDevOpsTerraformScripting

Posted 27 days ago
Apply