Senior Backend Engineer (High-Throughput Platforms)

New

100% remote work across the United StatesFull-TimeSenior

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Experience: 6+ years
Required Skills: JavaC++Apache KafkaGoRustMicroservicesScalaDistributed Systems

6+ years of backend or platform engineering experience in high-scale environments.
Strong hands-on experience designing distributed systems with emphasis on reliability, latency, and scalability.
Advanced proficiency in a backend programming language such as Java, Go, Scala, Rust, or C++.
Proven experience building and operating shared platforms or infrastructure used by multiple engineering teams.
Deep understanding of event-driven systems, streaming architectures, and messaging frameworks such as Apache Kafka or equivalent tools.
Experience working with relational, NoSQL, and distributed storage systems.
Strong system design skills with the ability to evaluate trade-offs across consistency, availability, and performance.
Experience leading multi-quarter technical initiatives and influencing architectural direction.
Strong communication skills with the ability to articulate complex technical decisions clearly.
Experience mentoring engineers and contributing to engineering best practices across teams.

Design, build, and evolve high-throughput backend platforms and shared infrastructure services that support multiple engineering teams.
Architect and develop internal platform services supporting request routing, traffic management, caching, and asynchronous processing at large scale.
Design and implement event-driven and streaming systems using technologies such as Apache Kafka and similar messaging infrastructures.
Build durable, replayable pipelines for data processing, event handling, and background job execution across distributed systems.
Define and maintain platform SDKs, libraries, and abstractions that enable consistent and efficient product development.
Design systems with strong focus on scalability, fault tolerance, multi-region resilience, and predictable latency under high load.
Establish observability standards, SLIs/SLOs, and capacity planning practices to ensure operational excellence.
Lead incident response for critical platform issues and drive post-mortem analysis with long-term reliability improvements.

View Full Description & ApplyYou'll be redirected to the employer's site