Yotta Labs

Building the GPU Cloud for efficient ML with heterogeneous hardware and cross-cloud orchestration

1-10 employees

Artificial Intelligence (AI)

Private Company

Website LinkedIn Email Share Tweet

Artificial Intelligence (AI)Cloud Computing GPU AI Infrastructure

Remote Work Available

This company offers remote work opportunities.

Open Positions3

United StatesHong KongCanadaSingaporeFull-TimeAI InfrastructurePosted

Research Engineer - AI Systems

Design and implement high-performance kernels for Attention, MoE, GEMM, collective communication, and quantization.
Optimize kernels for NVIDIA, AMD, and AWS Trainium.
Develop custom operators and graph optimizations using Neuron SDK, PyTorch/XLA, Torch Dynamo, and Neuron Compiler.
Improve performance of vLLM, SGLang, TensorRT-LLM, and custom inference runtimes.
Design scalable distributed training and inference solutions across thousands of accelerators.
Contribute to open-source projects, publish technical findings and engage with the developer community.

PythonPyTorchC++

Showing 1 of 3 positions

About Yotta Labs

Yotta Labs is building the GPU Cloud for efficient machine learning, pioneering a Decentralized Operating System (DeOS) to orchestrate AI workloads across diverse hardware at a planetary scale. You can access on-demand GPU instances and Model APIs, enabling AI companies, research labs, and enterprises to train and deploy cutting-edge models. Their platform aggregates geo-distributed GPUs, offering elastic and cost-effective access to AI compute. They unify fragmented GPU capacity into a single execution fabric, making high-performance computing accessible for AI training and inference. You can deploy GPU pods in under 3 seconds.

How We Work

You will find a flexible, remote work environment that values innovation and autonomy. Your team members are encouraged to collaborate and contribute to open-source projects. You will work within a small, focused team, directly impacting strategic decisions. This environment supports strong problem-solving skills and independent contribution.

Engineering at Yotta Labs

You will build the next-generation AI infrastructure, developing a Decentralized Operating System (DeOS) for AI workload orchestration across multi-cloud and multi-silicon environments. Engineers optimize AI workloads across a heterogeneous network of GPUs. Your work will involve cutting-edge technologies that bridge AI and decentralized computing. You will contribute to high-performance computing for AI training and inference on a wide spectrum of hardware. This means tackling challenges in distributed systems, cloud computing, and blockchain technologies.

Why Join Us

Be part of a visionary team redefining AI infrastructure.
Work on cutting-edge technologies bridging AI and decentralized computing.
Collaborate with experts from leading institutions and tech companies.
Enjoy a flexible, remote work environment valuing innovation and autonomy.
Directly impact the scalability and performance of AI applications.

Benefits & Perks

Flexible, remote work environment
Autonomy in your role

Tech Stack

analyticsmobilewidgets