Cloud Platform Engineer

Posted about 2 months agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: United States

💸 Salary: 130000.0 - 150000.0 USD per year

🔍 Industry: Software Development

🏢 Company: AffiniPay👥 501-1000💰 Private 9 months agoFinancial Services Payments FinTech

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSDockerPythonAWS EKSBashCloud ComputingElasticSearchGitJenkinsKafkaKubernetesMySQLCI/CDLinuxDevOpsTerraformScripting

Requirements:

5+ years of professional technical experience in software engineering, AWS cloud operations or a related function.
3+ years of experience within a DevOps or SRE role
Experience implementing and maintaining kubernetes specific CICD pipelines using CircleCI (preferred), ArgoCD (preferred), Jenkins, Github Actions, Gitlab, Azure Devops, etc.
Experience creating internal tooling that supports developer productivity
Experience designing and implementing automated development environments
Documented experience provisioning and managing infrastructure in a public cloud environment such as AWS (heavily preferred), Google Cloud Platform, or Azure
Experience with Linux container technologies, such as Docker, OCI, LXC and ability to administer, build and deploy images in an automated manner.
Solid understanding of how a Kubernetes Platform operates (service discovery, deployments, monitoring, scheduling, load balancing)
Proficiency in at least one, ideally more of the following programming or scripting languages: Ruby, Python, Java, Javascript (NodeJS), Bash
Experience utilizing Infrastructure as Code to provision and maintain infrastructure: Terraform (preferred), CloudFormation
Understanding of relational database systems such as MySQL (preferred) and PostgreSQL

Responsibilities:

Automate deployment, monitoring, and management for 100% AWS cloud based infrastructure
Develop, implement and own developer-focused internal tooling and automation as your product
Share in new feature design as the cloud SME supporting container/k8s first designs
Create and maintain automated CICD pipelines that streamline the release process and ensure high-quality code is delivered rapidly and continuously
Collaborate with developers to identify pain points and bottlenecks in the development process and design and implement solutions that improve the developer experience and increase overall productivity.
Put together comprehensive onboarding programs and training resources to seamlessly integrate new developers into the organization
Participate in an on-call rotation and provide emergency support outside of normally scheduled hours as needed
Evolve and extend observability tools and systems to determine and resolve performance and usability issues

Apply

Related Jobs

Apply

🔥 Staff Cloud Platform Engineer - Core Infra

Posted about 2 months ago

📍 USA

🧭 Full-Time

🔍 Software Development

🏢 Company: Sift👥 251-500💰 Secondary Market about 3 years agoFraud Detection Big Data Predictive Analytics Analytics Network Security

🔧 Requirements

8+ years of experience as a Software Engineer focused on infrastructure/platform services or in a Site Reliability Engineering (SRE) role.
Strong programming skills in languages such as Java, Scala, or Python.
Experience designing and implementing distributed systems.
Experience building and managing cloud infrastructure on AWS or GCP.
Expertise in building infrastructure as code and automating provisioning processes using tools like CloudFormation or Terraform.
Proficiency in setting up and managing monitoring and alerting systems, both open-source and commercial.
Familiarity with Docker and container orchestration technologies like Kubernetes, GKE, or AWS ECS.
Strong experience troubleshooting and resolving production system issues, with a focus on building automated solutions to prevent future occurrences.
Proven expertise in automation and a solid understanding of configuration management tools.

💡 Responsibilities

Own the availability, performance, and scalability of Sift’s primary online storage systems and infrastructure
Design and build immutable infrastructure and fault-tolerant, multi-AZ/multi-region systems that are resilient and self-healing.
Design and Implement multi-region deployments, such as BigTable clusters spanning multiple regions, with strategies to ensure specific customers are routed to designated regions (e.g., sticky sessions at the regional level).
Solve complex problems that arise from our unique data volume and request rate which may involve digging deep into data store and messaging internals
Optimize local development and testing workflows to be fast, efficient, and seamless.
Design and implement services and libraries for components to interact with data stores, messaging layer and services platform
Develop tools for monitoring, detecting faults, and automatically repairing distributed systems
Provide design support to internal engineering teams for optimal usage of data stores, data growth planning, production workload optimization, messaging, caching and service platform
Participate in on-call support and incident response activities, providing 12/7 coverage for one calendar week approximately once every 3-4 weeks.

AWSDockerPythonSQLCloud ComputingGCPJavaJenkinsKafkaKubernetesRubyRuby on RailsSnowflakeAirflowAlgorithmsData engineeringData StructuresREST APISparkCI/CDProblem SolvingRESTful APIsLinuxDevOpsTerraformMicroservicesScalaData modelingScriptingData managementDebugging

Posted about 2 months ago

Apply

🔥 Senior Cloud Platform Engineer (Remote)

Posted 5 months ago

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: Collectors👥 1001-5000💰 $100,000,000 Private about 3 years agoConsumer Research Consumer Software Consumer Applications

🔧 Requirements

5+ years of progressively responsible DevOps experience.
4+ years of direct experience building end-to-end CI/CD pipelines using some of the popular platforms (i.e., Jenkins, GitHub Actions, CircleCI)
4+ years of AWS cloud experience is mandatory (GCP is a plus). AWS experience should include Computing (EC2, Lambda, Containers), Networking (DNS, VPC, TGW), Storage (S3, EBS, EFS) and Database (RDS, DynamoDB).
4+ years of experience in managing large-scale Infrastructure-as-Code stacks using Terraform (or CloudFormation)
Kubernetes and Docker experience is mandatory.
Experience with Observability tools is a plus (Datadog, Prometheus, New Relic)

💡 Responsibilities

Collaborate with cross-functional teams to deliver high-quality products and services.
Help development teams to build their Infrastructure through a self-serving platform.
Implement and maintain monitoring and alerting systems to promptly detect and respond to issues.
Troubleshoot and resolve issues related to infrastructure automation, ensuring minimal downtime and optimal system performance.
Develop and maintain infrastructure as code (IaC) using tools such as Terraform and Crossplane.
Architect & Build innovative automation projects to help reduce day-to-day toil.
Participate in capacity planning and disaster recovery exercises.
Implement and manage CI/CD pipelines to automate the build, test, and deployment processes, enabling rapid, secure, and reliable delivery of software updates;
Advocate for best practices in DevOps and reliability engineering across the organization.
Evaluate and integrate new technologies to enhance our infrastructure.
Be part of a weekly on-call rotation.

AWSDockerPythonSQLBashCloud ComputingGitJenkinsKubernetesCommunication SkillsAnalytical SkillsCI/CDRESTful APIsLinuxDevOpsTerraformNetworkingExcellent communication skillsAdaptabilityProblem-solving skillsTeamworkTroubleshootingJSON

Posted 5 months ago

Apply