Apply

Cloud Platform Engineer

Posted about 2 months agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: United States

💸 Salary: 130000.0 - 150000.0 USD per year

🔍 Industry: Software Development

🏢 Company: AffiniPay👥 501-1000💰 Private 9 months agoFinancial ServicesPaymentsFinTech

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSDockerPythonAWS EKSBashCloud ComputingElasticSearchGitJenkinsKafkaKubernetesMySQLCI/CDLinuxDevOpsTerraformScripting

Requirements:
  • 5+ years of professional technical experience in software engineering, AWS cloud operations or a related function.
  • 3+ years of experience within a DevOps or SRE role
  • Experience implementing and maintaining kubernetes specific CICD pipelines using CircleCI (preferred), ArgoCD (preferred), Jenkins, Github Actions, Gitlab, Azure Devops, etc.
  • Experience creating internal tooling that supports developer productivity
  • Experience designing and implementing automated development environments
  • Documented experience provisioning and managing infrastructure in a public cloud environment such as AWS (heavily preferred), Google Cloud Platform, or Azure
  • Experience with Linux container technologies, such as Docker, OCI, LXC and ability to administer, build and deploy images in an automated manner.
  • Solid understanding of how a Kubernetes Platform operates (service discovery, deployments, monitoring, scheduling, load balancing)
  • Proficiency in at least one, ideally more of the following programming or scripting languages: Ruby, Python, Java, Javascript (NodeJS), Bash
  • Experience utilizing Infrastructure as Code to provision and maintain infrastructure: Terraform (preferred), CloudFormation
  • Understanding of relational database systems such as MySQL (preferred) and PostgreSQL
Responsibilities:
  • Automate deployment, monitoring, and management for 100% AWS cloud based infrastructure
  • Develop, implement and own developer-focused internal tooling and automation as your product
  • Share in new feature design as the cloud SME supporting container/k8s first designs
  • Create and maintain automated CICD pipelines that streamline the release process and ensure high-quality code is delivered rapidly and continuously
  • Collaborate with developers to identify pain points and bottlenecks in the development process and design and implement solutions that improve the developer experience and increase overall productivity.
  • Put together comprehensive onboarding programs and training resources to seamlessly integrate new developers into the organization
  • Participate in an on-call rotation and provide emergency support outside of normally scheduled hours as needed
  • Evolve and extend observability tools and systems to determine and resolve performance and usability issues
Apply

Related Jobs

Apply

📍 USA

🧭 Full-Time

🔍 Software Development

🏢 Company: Sift👥 251-500💰 Secondary Market about 3 years agoFraud DetectionBig DataPredictive AnalyticsAnalyticsNetwork Security

  • 8+ years of experience as a Software Engineer focused on infrastructure/platform services or in a Site Reliability Engineering (SRE) role.
  • Strong programming skills in languages such as Java, Scala, or Python.
  • Experience designing and implementing distributed systems.
  • Experience building and managing cloud infrastructure on AWS or GCP.
  • Expertise in building infrastructure as code and automating provisioning processes using tools like CloudFormation or Terraform.
  • Proficiency in setting up and managing monitoring and alerting systems, both open-source and commercial.
  • Familiarity with Docker and container orchestration technologies like Kubernetes, GKE, or AWS ECS.
  • Strong experience troubleshooting and resolving production system issues, with a focus on building automated solutions to prevent future occurrences.
  • Proven expertise in automation and a solid understanding of configuration management tools.
  • Own the availability, performance, and scalability of Sift’s primary online storage systems and infrastructure
  • Design and build immutable infrastructure and fault-tolerant, multi-AZ/multi-region systems that are resilient and self-healing.
  • Design and Implement multi-region deployments, such as BigTable clusters spanning multiple regions, with strategies to ensure specific customers are routed to designated regions (e.g., sticky sessions at the regional level).
  • Solve complex problems that arise from our unique data volume and request rate which may involve digging deep into data store and messaging internals
  • Optimize local development and testing workflows to be fast, efficient, and seamless.
  • Design and implement services and libraries for components to interact with data stores, messaging layer and services platform
  • Develop tools for monitoring, detecting faults, and automatically repairing distributed systems
  • Provide design support to internal engineering teams for optimal usage of data stores, data growth planning, production workload optimization, messaging, caching and service platform
  • Participate in on-call support and incident response activities, providing 12/7 coverage for one calendar week approximately once every 3-4 weeks.

AWSDockerPythonSQLCloud ComputingGCPJavaJenkinsKafkaKubernetesRubyRuby on RailsSnowflakeAirflowAlgorithmsData engineeringData StructuresREST APISparkCI/CDProblem SolvingRESTful APIsLinuxDevOpsTerraformMicroservicesScalaData modelingScriptingData managementDebugging

Posted about 2 months ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: Collectors👥 1001-5000💰 $100,000,000 Private about 3 years agoConsumer ResearchConsumer SoftwareConsumer Applications

  • 5+ years of progressively responsible DevOps experience.
  • 4+ years of direct experience building end-to-end CI/CD pipelines using some of the popular platforms (i.e., Jenkins, GitHub Actions, CircleCI)
  • 4+ years of AWS cloud experience is mandatory (GCP is a plus). AWS experience should include Computing (EC2, Lambda, Containers), Networking (DNS, VPC, TGW), Storage (S3, EBS, EFS) and Database (RDS, DynamoDB).
  • 4+ years of experience in managing large-scale Infrastructure-as-Code stacks using Terraform (or CloudFormation)
  • Kubernetes and Docker experience is mandatory.
  • Experience with Observability tools is a plus (Datadog, Prometheus, New Relic)
  • Collaborate with cross-functional teams to deliver high-quality products and services.
  • Help development teams to build their Infrastructure through a self-serving platform.
  • Implement and maintain monitoring and alerting systems to promptly detect and respond to issues.
  • Troubleshoot and resolve issues related to infrastructure automation, ensuring minimal downtime and optimal system performance.
  • Develop and maintain infrastructure as code (IaC) using tools such as Terraform and Crossplane.
  • Architect & Build innovative automation projects to help reduce day-to-day toil.
  • Participate in capacity planning and disaster recovery exercises.
  • Implement and manage CI/CD pipelines to automate the build, test, and deployment processes, enabling rapid, secure, and reliable delivery of software updates;
  • Advocate for best practices in DevOps and reliability engineering across the organization.
  • Evaluate and integrate new technologies to enhance our infrastructure.
  • Be part of a weekly on-call rotation.

AWSDockerPythonSQLBashCloud ComputingGitJenkinsKubernetesCommunication SkillsAnalytical SkillsCI/CDRESTful APIsLinuxDevOpsTerraformNetworkingExcellent communication skillsAdaptabilityProblem-solving skillsTeamworkTroubleshootingJSON

Posted 5 months ago
Apply