RunPod, Inc.

RunPod, Inc. is a technology company focused on cloud solutions, currently looking to expand its team with positions such as Cloud Software Engineer and Technical Support Analyst (L2).

Related companies:

🏒 DigitalOcean
πŸ‘₯ 1001-5000πŸ’° $34,913,641 Post-IPO Equity over 3 years agoπŸ«‚ Last layoff about 2 years agoVirtualizationDevOpsWeb HostingCloud ComputingSaaS
Website LinkedIn Email Facebook Twitter

Jobs at this company:

Apply

πŸ“ United States, Canada, Europe

🧭 Full-Time

πŸ’Έ 90000.0 - 145000.0 USD per year

πŸ” AI and machine learning

  • 3+ years of experience in a broad IT Systems Administrator or similar role.
  • Hands-on expertise with networking, VPNs, and endpoint security administration.
  • Solid understanding of SSO solutions (e.g., Okta, Google Workspace) and identity/access management best practices.
  • Experience operating a distributed IT environment for a global remote workforce
  • Experience implementing zero trust frameworks with a distributed remote workforce
  • Experience leveraging Apple Business Manager and Windows InTune to perform automated OOBE deployments.
  • Experience with mobile device management (MDM) platforms (e.g., Hexnode, Jamf, Intune).
  • Experience supporting SOC 2 audits or similar compliance frameworks.
  • Proven track record in providing IT support and troubleshooting a range of user and system issues.
  • Strong organizational skills with the ability to multitask and prioritize in a fast-paced environment.
  • Excellent communication skills, comfortable collaborating across distributed teams.
  • Configure and maintain the corporate VPN, firewalls, and network equipment.
  • Oversee office and remote network setups.
  • Provision and de-provision accounts across all company systems.
  • Manage and integrate existing SSO solutions.
  • Centralize procurement, licensing, and distribution of software tools.
  • Coordinate with HR for hardware deployment.
  • Serve as the primary point of contact for technical support queries.
  • Establish and maintain helpdesk ticketing processes.
  • Lead SOC 2 IT initiatives.
  • Implement best practices around endpoint security, MFA, password policies, and data protection.
  • Develop and maintain IT policies, procedures, and user guides.
  • Create documentation for troubleshooting and knowledge-sharing.
  • Collaborate with leadership on IT roadmap planning.
  • Identify areas for automation and process improvement, implement solutions to optimize efficiency.

BashMicrosoft ExchangeAzureCI/CDLinuxDocumentationComplianceNetworkingTroubleshootingScripting

Posted 5 days ago
Apply
Apply
πŸ”₯ Forward Deployed Engineer
Posted about 1 month ago

πŸ“ United States, Canada, Europe

🧭 Full-Time

πŸ’Έ 90000.0 - 140000.0 USD per year

πŸ” AI and machine learning

  • Proficient in deploying and managing AI/ML models as service, with a strong understanding of training, fine-tuning, inference, and deployment.
  • 3+ years of software development experience in JavaScript, Go (Golang), or Python.
  • Strong problem-solving skills and ability to work in a collaborative environment.
  • Excellent communication skills and attention to detail.
  • Troubleshoot and resolve critical or complex technical issues escalated by customers
  • Utilize various tools and methods to identify root causes and deliver solutions
  • Communicate effectively with customers and internal teams
  • Participate in sales meetings with customers, explain RunPod’s specific technologies, provide architectural recommendations, and build proof-of-concept solutions
  • Assist the support team in troubleshooting and resolving escalated technical tickets
  • Contribute to product development and testing efforts by relaying feedback from customers and the support team to the engineering team, helping to shape product improvements.
  • Create and maintain technical documentation, such as knowledge base articles, FAQs, guides, and manuals, while also developing and delivering training sessions, webinars, and demos for customers, partners, and internal teams.

DockerPythonSoftware DevelopmentSQLCloud ComputingJavascriptMachine LearningGoCommunication SkillsCI/CDProblem SolvingRESTful APIsLinuxAccount ManagementTroubleshootingTechnical supportScriptingDebuggingCustomer support

Posted about 1 month ago
Apply
Apply
πŸ”₯ Senior Full-Stack Engineer
Posted about 2 months ago

πŸ“ United States, Canada, Europe

🧭 Full-Time

πŸ’Έ 160000.0 - 190000.0 USD per year

πŸ” Software Development

  • 10+ years of professional experience in full-stack development, with a strong emphasis on scalable PaaS platforms.
  • Deep expertise in Python (AI experience a plus).
  • Proficiency in TypeScript/JavaScript and Go Lang (strict type languages a plus).
  • Proficiency in developing frontend applications (React, Next.js a plus).
  • Proficiency in designing and maintaining high-performance APIs, microservices, and cloud integrations.
  • Experience with SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, DynamoDB, etc.), with a strong understanding of database design, indexing, and query optimization.
  • Hands-on experience with multi-region architectures, replication strategies, and designing for high availability.
  • Familiarity with event-driven patterns, message queues (Kafka, RabbitMQ, NATS, etc.), and pub/sub systems.
  • Experience with cloud platforms (AWS, GCP, or Azure) and best practices for serverless, containerization (Docker or Kubernetes), and cloud-native development.
  • Understanding of engineering team workflows, code maintainability, versioning, and CI/CD pipelines.
  • Knowledge of secure authentication, OAuth, JWT, and compliance frameworks.
  • Ability to clearly explain technical trade-offs, architecture decisions, and system designs to different stakeholders.
  • Build and optimize applications that span from the frontend (React, Next.js, Python, SDKs, CLIs) to the cloud backend (Go Lang and Typescript).
  • Design and optimize SQL (PostgreSQL, MySQL) and NoSQL databases, ensuring data integrity, efficiency, and scalability.
  • Implement industry best practices for scalable, secure, and reliable PaaS platforms.
  • Architect event-driven patterns using message queues, event buses, and eventual consistency models.
  • Design and maintain multi-region architectures that ensure data consistency, fault tolerance, and high availability.
  • Work with transactional integrity (ACID), eventual consistency, and CAP theorem trade-offs to optimize system performance.
  • Improve query performance, caching strategies, and cloud interactions to enhance scalability.
  • Implement best practices for code organization, modularity, and maintainability to support a growing engineering team.
  • Expand and standardize our tests in a TDD approach to increase our test coverage.
  • Work closely with Frontend, Cloud, and Infrastructure teams to ensure smooth communication between the UI, cloud services, and backend systems.
  • Advocate for secure coding practices, protecting customer data, and ensuring compliance with industry standards.

AWSDockerGraphQLPostgreSQLPythonSQLCloud ComputingFull Stack DevelopmentGCPKafkaKubernetesMongoDBMySQLRabbitmqReact.jsTypeScriptAlgorithmsAzureData StructuresGogRPCREST APIServerlessNext.jsNosqlCI/CDMicroservices

Posted about 2 months ago
Apply
Apply

πŸ“ United States, Canada, Europe

🧭 Full-Time

πŸ’Έ 131000.0 - 170000.0 USD per year

πŸ” Software Development

  • Expertise in testing cloud-scale distributed systems with a strong focus on reliability, performance, and scalability.
  • Strong programming skills in at least one language, preferably Python, Golang, or Typescript.
  • Hands-on experience in building test automation frameworks for complex microservices architectures.
  • Deep understanding of CI/CD pipelines, infrastructure as code (IaC), and automated deployment strategies.
  • Extensive experience with load testing tools (e.g., Locust, k6, JMeter) and observability platforms (e.g., Prometheus, Grafana, OpenTelemetry, Datadog).
  • Proven experience in testing containerized applications and Kubernetes-based environments.
  • Strong expertise in chaos engineering and fault injection frameworks (e.g., Chaos Mesh, Gremlin, LitmusChaos).
  • Knowledge of distributed tracing and debugging in cloud-native environments.
  • Design, develop, and maintain robust test automation frameworks for cloud-scale distributed systems.
  • Architect performance, load, and stress tests to validate system resilience under high traffic conditions.
  • Build fault-injection and chaos engineering strategies to assess the reliability of distributed services.
  • Develop and execute end-to-end integration, API, and system-level tests across microservices-based architectures.
  • Implement continuous testing pipelines within CI/CD workflows to accelerate deployment cycles.
  • Collaborate closely with development, SRE, and infrastructure teams to ensure quality best practices are embedded within the SDLC.
  • Analyze system logs, telemetry data, and observability metrics to identify and mitigate potential failures before they impact production.
  • Drive automation of security testing, API contract validation, and infrastructure testing.
  • Participate in on-call rotations to assist in diagnosing critical production issues related to system reliability and performance.

PythonKubernetesTypeScriptGrafanaPrometheusCI/CDMicroservices

Posted 2 months ago
Apply
Apply

πŸ“ US, Canada, Europe

πŸ” AI and machine learning

  • 5+ years of experience in user experience research or related roles.
  • Strong background in qualitative and quantitative research methods.
  • Ability to tie research findings to product outcomes and measure their impact.
  • Preferred familiarity with developer-focused products and comfort engaging with technical audiences.
  • Experience working cross-functionally with product, design, and engineering teams.
  • Comfortable using modern UX research tools.
  • Exceptional storytelling skills to convey research findings.
  • Thrives in a fast-paced, iterative environment.
  • Engage with hundreds of prospects, customers, and partners to understand their needs.
  • Conduct qualitative and quantitative research, including user interviews and surveys.
  • Translate research insights into actionable recommendations for product and engineering teams.
  • Maintain a detailed record of research insights and track their impact.
  • Partner closely with product, design, and engineering teams to prioritize user needs.
  • Establish ongoing mechanisms for user feedback collection.
  • Advocate for user-centered design principles across the organization.
  • Define and track success metrics for UX-related changes.

Data AnalysisFigmaBehavioral science

Posted 2 months ago
Apply
Apply

πŸ“ US, Canada, Europe

🧭 Full-Time

πŸ’Έ 160000.0 - 200000.0 USD per year

πŸ” AI and machine learning

  • Understand and take ownership of developers’ pain points.
  • 6+ years of software development experience in an internal tooling, developer tooling, or developer experience role.
  • Strong understanding of a major programming language and its ecosystem (Python, Go, or Rust preferred).
  • Experience creating images and deploying with Docker.
  • Experience working with Product teams to triage customer issues and create requirements for projects.
  • Identify developer pain points, workflow optimizations, and onboarding improvements, working with the Product team to design solutions.
  • Design and implement tooling, workflows, and sample projects to enhance RunPod's developer experience.
  • Participate in design and architectural discussions with the Product team to ensure developers are represented in new features and services.
  • Maintain open source repositories and documentation for tools, templates, and code samples.
  • Respond to issues and pull requests on open source repositories in a timely manner.

DockerPythonGitMachine LearningGoRust

Posted 2 months ago
Apply
Apply

πŸ“ USA, Canada, Europe

πŸ’Έ 120000 - 140000 USD per year

πŸ” AI and machine learning

  • 3+ years of fullstack engineering experience, particularly in front-end development.
  • Strong understanding of conversion rate optimization principles and user behavior.
  • Demonstrated experience in A/B testing and multivariate testing for web and mobile applications.
  • Proficiency with JavaScript frameworks (React, Next.js) and familiarity with analytics tools (e.g., Google Analytics, Hotjar).
  • Strong data-driven mindset and proactive problem-solving skills.
  • Lead CRO efforts for the website and sign-up flows.
  • Design, develop, and implement A/B tests and multivariate experiments.
  • Develop landing pages and dynamic content experiences to personalize user interactions.
  • Analyze data and report findings, using insights to inform iterations.
  • Collaborate with design and marketing to create high-converting user journeys.

Data AnalysisHTMLCSSJavascriptGoogle AnalyticsNext.jsReactA/B testing

Posted 4 months ago
Apply
Apply
πŸ”₯ Security Engineer
Posted 5 months ago

πŸ“ USA

🧭 Full-Time

πŸ’Έ 152000 - 175000 USD per year

πŸ” AI and cloud computing

  • Bachelor's degree in Computer Science, Cybersecurity, or a related field.
  • 5+ years of experience in information security roles, with a focus on cloud security.
  • Strong programming skills in at least one language (ideally, Python, Go, or C).
  • Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization.
  • Deep understanding of workload/network isolation techniques in multitenant cloud environments.
  • Experience securing and hardening cloud infrastructure, particularly in environments with untrusted workloads.
  • Familiarity with GPU architecture and security considerations in GPU cloud computing.
  • Strong background in network security, application security, and cloud-native security practices.
  • Experience with security testing tools and methodologies (e.g., OWASP, Burp Suite, static/dynamic analysis tools).
  • Familiarity with common cybersecurity frameworks (e.g., NIST, CIS Controls) and their application to cloud environments.
  • Excellent problem-solving skills and ability to think creatively about security challenges in cloud computing.
  • Successful completion of a background check.
  • Design and implement secure architectures for RunPod's multitenant GPU cloud platform, ensuring strong isolation between customer workloads.
  • Conduct thorough security assessments, including threat modeling, code reviews, and penetration testing of our cloud infrastructure and services.
  • Develop and implement security fixes and improvements in collaboration with software engineering teams.
  • Implement and manage security tools and systems (e.g., SIEM, WAF, EDR).
  • Create and maintain security documentation, including policies, procedures, and technical guidelines specific to GPU cloud security.
  • Provide security guidance and training to development teams to foster a security-first culture in cloud development.
  • Participate in incident response activities and contribute to post-incident analysis and improvements.
  • Collaborate with operations team to ensure adherence to relevant standards (e.g., SOC 2, ISO 27001, GDPR).

DockerPythonSoftware DevelopmentCloud ComputingCybersecurityKubernetesGoCollaborationProblem SolvingLinux

Posted 5 months ago
Apply
Apply

πŸ“ USA

🧭 Full-Time

πŸ’Έ $150,000 - $200,000 per year

πŸ” AI and machine learning

  • Bachelor’s degree in Computer Science, Computer Engineering or a related field, or equivalent experience.
  • 3+ years of professional experience in software development, with experience in Go.
  • Strong understanding of the Go programming language and its ecosystem.
  • Strong problem-solving skills and ability to work in a collaborative environment.
  • Excellent communication skills and attention to detail.
  • Successful completion of a background check.
  • Design, develop, and maintain cloud infrastructure software, primarily in Go.
  • Collaborate with managers and other engineers to define and implement product requirements.
  • Troubleshoot and optimize existing code to improve performance and reliability.
  • Participate in code reviews and contribute to the team's technical standards.
  • Contribute to architectural discussions and decisions.
  • Stay up-to-date with industry trends and emerging technologies.

Software DevelopmentGoCommunication Skills

Posted 6 months ago
Apply