RunPod, Inc.

RunPod, Inc. is a technology company focused on cloud solutions, currently looking to expand its team with positions such as Cloud Software Engineer and Technical Support Analyst (L2).

Related companies:

🏢 DigitalOcean
👥 1001-5000💰 $34,913,641 Post-IPO Equity over 3 years ago🫂 Last layoff about 2 years agoVirtualizationDevOpsWeb HostingCloud ComputingSaaS
Website LinkedIn Email Facebook Twitter

Jobs at this company:

Apply
🔥 Forward Deployed Engineer
Posted about 1 month ago

📍 United States, Canada, Europe

🧭 Full-Time

💸 90000.0 - 140000.0 USD per year

🔍 AI and machine learning

  • Proficient in deploying and managing AI/ML models as service, with a strong understanding of training, fine-tuning, inference, and deployment.
  • 3+ years of software development experience in JavaScript, Go (Golang), or Python.
  • Strong problem-solving skills and ability to work in a collaborative environment.
  • Excellent communication skills and attention to detail.
  • Troubleshoot and resolve critical or complex technical issues escalated by customers
  • Utilize various tools and methods to identify root causes and deliver solutions
  • Communicate effectively with customers and internal teams
  • Participate in sales meetings with customers, explain RunPod’s specific technologies, provide architectural recommendations, and build proof-of-concept solutions
  • Assist the support team in troubleshooting and resolving escalated technical tickets
  • Contribute to product development and testing efforts by relaying feedback from customers and the support team to the engineering team, helping to shape product improvements.
  • Create and maintain technical documentation, such as knowledge base articles, FAQs, guides, and manuals, while also developing and delivering training sessions, webinars, and demos for customers, partners, and internal teams.

DockerPythonSoftware DevelopmentSQLCloud ComputingJavascriptMachine LearningGoCommunication SkillsCI/CDProblem SolvingRESTful APIsLinuxAccount ManagementTroubleshootingTechnical supportScriptingDebuggingCustomer support

Posted about 1 month ago
Apply
Apply
🔥 Senior Full-Stack Engineer
Posted about 2 months ago

📍 United States

🧭 Full-Time

💸 160000.0 - 190000.0 USD per year

🔍 Software Development

  • 10+ years of professional experience in full-stack development, with a strong emphasis on scalable PaaS platforms.
  • Deep expertise in Python (AI experience a plus).
  • Proficiency in TypeScript/JavaScript and Go Lang (strict type languages a plus).
  • Proficiency in developing frontend applications (React, Next.js a plus).
  • API Development: Proficiency in designing and maintaining high-performance APIs, microservices, and cloud integrations.
  • Experience with SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, DynamoDB, etc.), with a strong understanding of database design, indexing, and query optimization.
  • Hands-on experience with multi-region architectures, replication strategies, and designing for high availability.
  • Familiarity with event-driven patterns, message queues (Kafka, RabbitMQ, NATS, etc.), and pub/sub systems.
  • Experience with cloud platforms (AWS, GCP, or Azure) and best practices for serverless, containerization (Docker or Kubernetes), and cloud-native development.
  • Understanding of engineering team workflows, code maintainability, versioning, and CI/CD pipelines.
  • Security & Compliance: Knowledge of secure authentication, OAuth, JWT, and compliance frameworks.
  • Ability to clearly explain technical trade-offs, architecture decisions, and system designs to different stakeholders.
  • Successful completion of a background check
  • Build and optimize applications that span from the frontend (React, Next.js, Python, SDKs, CLIs) to the cloud backend (Go Lang and Typescript).
  • Database Design & Architecture: Design and optimize SQL (PostgreSQL, MySQL) and NoSQL databases, ensuring data integrity, efficiency, and scalability.
  • PaaS Best Practices: Implement industry best practices for scalable, secure, and reliable PaaS platforms.
  • Event-Driven Systems: Architect event-driven patterns using message queues, event buses, and eventual consistency models.
  • Multi-Region Architecture: Design and maintain multi-region architectures that ensure data consistency, fault tolerance, and high availability.
  • ACID & Distributed Data Models: Work with transactional integrity (ACID), eventual consistency, and CAP theorem trade-offs to optimize system performance.
  • Performance Optimization: Improve query performance, caching strategies, and cloud interactions to enhance scalability.
  • Scaling Code for Large Teams: Implement best practices for code organization, modularity, and maintainability to support a growing engineering team.
  • Test-Driven Development: Expand and standardize our tests in a TDD approach to increase our test coverage.
  • Cross-Team Collaboration: Work closely with Frontend, Cloud, and Infrastructure teams to ensure smooth communication between the UI, cloud services, and backend systems.
  • Security & Compliance: Advocate for secure coding practices, protecting customer data, and ensuring compliance with industry standards.

AWSDockerGraphQLPostgreSQLPythonSQLCloud ComputingFull Stack DevelopmentGCPKafkaKubernetesMongoDBMySQLRabbitmqReact.jsTypeScriptAlgorithmsAzureData StructuresGogRPCREST APIServerlessNext.jsNosqlCI/CDMicroservices

Posted about 2 months ago
Apply
Apply

📍 United States, Canada, Europe

🧭 Full-Time

💸 131000.0 - 170000.0 USD per year

🔍 Software Development

  • Expertise in testing cloud-scale distributed systems with a strong focus on reliability, performance, and scalability.
  • Strong programming skills in at least one language, preferably Python, Golang, or Typescript.
  • Hands-on experience in building test automation frameworks for complex microservices architectures.
  • Deep understanding of CI/CD pipelines, infrastructure as code (IaC), and automated deployment strategies.
  • Extensive experience with load testing tools (e.g., Locust, k6, JMeter) and observability platforms (e.g., Prometheus, Grafana, OpenTelemetry, Datadog).
  • Proven experience in testing containerized applications and Kubernetes-based environments.
  • Strong expertise in chaos engineering and fault injection frameworks (e.g., Chaos Mesh, Gremlin, LitmusChaos).
  • Knowledge of distributed tracing and debugging in cloud-native environments.
  • Design, develop, and maintain robust test automation frameworks for cloud-scale distributed systems.
  • Architect performance, load, and stress tests to validate system resilience under high traffic conditions.
  • Build fault-injection and chaos engineering strategies to assess the reliability of distributed services.
  • Develop and execute end-to-end integration, API, and system-level tests across microservices-based architectures.
  • Implement continuous testing pipelines within CI/CD workflows to accelerate deployment cycles.
  • Collaborate closely with development, SRE, and infrastructure teams to ensure quality best practices are embedded within the SDLC.
  • Analyze system logs, telemetry data, and observability metrics to identify and mitigate potential failures before they impact production.
  • Drive automation of security testing, API contract validation, and infrastructure testing.
  • Participate in on-call rotations to assist in diagnosing critical production issues related to system reliability and performance.

PythonKubernetesTypeScriptGrafanaPrometheusCI/CDMicroservices

Posted about 2 months ago
Apply
Apply

📍 US, Canada, Europe

🔍 AI and machine learning

  • 5+ years of experience in user experience research or related roles.
  • Strong background in qualitative and quantitative research methods.
  • Ability to tie research findings to product outcomes and measure their impact.
  • Preferred familiarity with developer-focused products and comfort engaging with technical audiences.
  • Experience working cross-functionally with product, design, and engineering teams.
  • Comfortable using modern UX research tools.
  • Exceptional storytelling skills to convey research findings.
  • Thrives in a fast-paced, iterative environment.
  • Engage with hundreds of prospects, customers, and partners to understand their needs.
  • Conduct qualitative and quantitative research, including user interviews and surveys.
  • Translate research insights into actionable recommendations for product and engineering teams.
  • Maintain a detailed record of research insights and track their impact.
  • Partner closely with product, design, and engineering teams to prioritize user needs.
  • Establish ongoing mechanisms for user feedback collection.
  • Advocate for user-centered design principles across the organization.
  • Define and track success metrics for UX-related changes.

Data AnalysisFigmaBehavioral science

Posted about 2 months ago
Apply
Apply
🔥 Developer Experience Engineer
Posted about 2 months ago

📍 US, Canada, Europe

🧭 Full-Time

💸 160000.0 - 200000.0 USD per year

🔍 AI and machine learning

  • Understand and take ownership of developers’ pain points.
  • 6+ years of software development experience in an internal tooling, developer tooling, or developer experience role.
  • Strong understanding of a major programming language and its ecosystem (Python, Go, or Rust preferred).
  • Experience creating images and deploying with Docker.
  • Experience working with Product teams to triage customer issues and create requirements for projects.
  • Identify developer pain points, workflow optimizations, and onboarding improvements, working with the Product team to design solutions.
  • Design and implement tooling, workflows, and sample projects to enhance RunPod's developer experience.
  • Participate in design and architectural discussions with the Product team to ensure developers are represented in new features and services.
  • Maintain open source repositories and documentation for tools, templates, and code samples.
  • Respond to issues and pull requests on open source repositories in a timely manner.

DockerPythonGitMachine LearningGoRust

Posted about 2 months ago
Apply
Apply

📍 USA, Canada, Europe

💸 120000 - 140000 USD per year

🔍 AI and machine learning

  • 3+ years of fullstack engineering experience, particularly in front-end development.
  • Strong understanding of conversion rate optimization principles and user behavior.
  • Demonstrated experience in A/B testing and multivariate testing for web and mobile applications.
  • Proficiency with JavaScript frameworks (React, Next.js) and familiarity with analytics tools (e.g., Google Analytics, Hotjar).
  • Strong data-driven mindset and proactive problem-solving skills.
  • Lead CRO efforts for the website and sign-up flows.
  • Design, develop, and implement A/B tests and multivariate experiments.
  • Develop landing pages and dynamic content experiences to personalize user interactions.
  • Analyze data and report findings, using insights to inform iterations.
  • Collaborate with design and marketing to create high-converting user journeys.

Data AnalysisHTMLCSSJavascriptGoogle AnalyticsNext.jsReactA/B testing

Posted 4 months ago
Apply
Apply
🔥 Data Scientist
Posted 5 months ago

📍 US, Canada, Europe

🧭 Full-Time

💸 180000 - 210000 USD per year

🔍 AI and machine learning

  • Education: Bachelor’s or Master’s degree in Data Science, Statistics, Computer Science, or a related field.
  • Experience: 3+ years in a Data Scientist or similar role, ideally in a SaaS or IaaS setting.
  • Proficiency in Python or R for data analysis and modeling.
  • Strong SQL skills for data extraction and manipulation.
  • Experience with predictive modeling including lookalike modeling, churn prediction, and LTV estimation.
  • Familiarity with revenue forecasting methods with multiple data inputs.
  • Knowledge of machine learning libraries like scikit-learn, TensorFlow, and data visualization tools such as Mode, Tableau, or Power BI.
  • Experience with data warehousing platforms such as Snowflake, Athena, or Redshift.
  • Strong communication skills for presenting findings to non-technical stakeholders.
  • Ability to work cross-functionally with product, engineering, and sales teams.
  • A proactive problem-solving mindset.
  • Build lookalike models to predict user profiles and algorithms for Customer Lifetime Value (LTV).
  • Identify key behaviors associated with retention and create churn prediction models.
  • Translate data discoveries into actionable recommendations for sales, product, and marketing teams.
  • Collaborate with stakeholders to promote data-driven initiatives focusing on user retention and engagement.
  • Develop dynamic revenue forecasts, accounting for signup rates, conversion metrics, and churn.

PythonSQLData AnalysisMachine LearningSnowflakeTableauStrategyAlgorithmsData scienceTensorflowCommunication Skills

Posted 5 months ago
Apply
Apply

📍 USA

🧭 Full-Time

💸 152000 - 175000 USD per year

🔍 AI and machine learning

  • Deep knowledge of Linux kernel internals, containerization (Docker), virtualization (Kata/QEMU), and networking components.
  • Extensive experience with distributed system troubleshooting and design.
  • Proficiency in at least one programming language, preferably Python or Golang.
  • Proven experience implementing and managing SLIs and SLOs.
  • Experience with pull-based configuration management tools such as Chef or Puppet.
  • Demonstrated ability to manage large-scale bare-metal fleets (5,000+ machines) across multiple data centers.
  • Strong background in implementing secure best practices for foundational systems, including secret management, AWS IAM permissions, and key distribution systems.
  • Comprehensive understanding of OSI model Layers 3, 4, and 7.
  • Successful completion of a background check.
  • Design, implement, and maintain robust, scalable, and highly available systems.
  • Troubleshoot and resolve complex issues in distributed environments.
  • Develop and implement SLIs and SLOs to ensure system reliability and performance.
  • Manage and optimize large-scale bare-metal fleets across multiple data centers.
  • Implement and maintain secure practices for foundational systems.
  • Collaborate with cross-functional teams to improve system design and operation.
  • Automate processes to increase efficiency and reduce human error.
  • Participate in on-call rotations to provide 24/7 support for critical systems.

DockerPythonGoGrafanaCommunication Skills

Posted 5 months ago
Apply
Apply
🔥 Security Engineer
Posted 5 months ago

📍 USA

🧭 Full-Time

💸 152000 - 175000 USD per year

🔍 AI and cloud computing

  • Bachelor's degree in Computer Science, Cybersecurity, or a related field.
  • 5+ years of experience in information security roles, with a focus on cloud security.
  • Strong programming skills in at least one language (ideally, Python, Go, or C).
  • Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization.
  • Deep understanding of workload/network isolation techniques in multitenant cloud environments.
  • Experience securing and hardening cloud infrastructure, particularly in environments with untrusted workloads.
  • Familiarity with GPU architecture and security considerations in GPU cloud computing.
  • Strong background in network security, application security, and cloud-native security practices.
  • Experience with security testing tools and methodologies (e.g., OWASP, Burp Suite, static/dynamic analysis tools).
  • Familiarity with common cybersecurity frameworks (e.g., NIST, CIS Controls) and their application to cloud environments.
  • Excellent problem-solving skills and ability to think creatively about security challenges in cloud computing.
  • Successful completion of a background check.
  • Design and implement secure architectures for RunPod's multitenant GPU cloud platform, ensuring strong isolation between customer workloads.
  • Conduct thorough security assessments, including threat modeling, code reviews, and penetration testing of our cloud infrastructure and services.
  • Develop and implement security fixes and improvements in collaboration with software engineering teams.
  • Implement and manage security tools and systems (e.g., SIEM, WAF, EDR).
  • Create and maintain security documentation, including policies, procedures, and technical guidelines specific to GPU cloud security.
  • Provide security guidance and training to development teams to foster a security-first culture in cloud development.
  • Participate in incident response activities and contribute to post-incident analysis and improvements.
  • Collaborate with operations team to ensure adherence to relevant standards (e.g., SOC 2, ISO 27001, GDPR).

DockerPythonSoftware DevelopmentCloud ComputingCybersecurityKubernetesGoCollaborationProblem SolvingLinux

Posted 5 months ago
Apply
Apply
🔥 Data Analyst
Posted 5 months ago

📍 US, Canada, Europe

🧭 Full-Time

💸 140000 - 160000 USD per year

🔍 AI and machine learning

  • Bachelor’s degree in Data Science, Statistics, Computer Science, or a related field. A Master’s degree is a plus.
  • At least three years of experience in a data analytics role, preferably with user and machine data exposure.
  • Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy.
  • Proficiency in data analysis tools and programming languages such as SQL, Python, and R.
  • Experience with data visualization tools (e.g., Mode, Tableau, or Power BI).
  • Experience with data warehousing platforms (e.g., Snowflake, Athena, or Redshift).
  • Excellent verbal and written communication skills, with the ability to translate complex data into actionable insights.
  • Strong problem-solving skills.
  • Successful completion of a background check.
  • Analyze large, complex datasets to extract actionable insights about users, product performance, and operational efficiency.
  • Work closely with cross-functional teams, including engineering, marketing, operations, and sales, to support data-driven decision-making.
  • Develop and implement data collection systems and strategies that optimize statistical efficiency and data quality.
  • Identify, analyze, and interpret trends or patterns in complex data sets.
  • Prepare reports and dashboards that use relevant data to communicate trends, patterns, and predictions.
  • Collaborate with engineering teams to enhance data collection and analysis processes.
  • Present findings and recommendations to stakeholders and executive leadership clearly and compellingly.
  • Stay abreast of industry trends and best practices in data analysis and data science.

PythonSQLData AnalysisSnowflakeTableauData scienceCommunication SkillsAnalytical SkillsCollaborationData visualization

Posted 5 months ago
Apply
Shown 10 out of 11