Senior Backend Engineer (High-Throughput Platforms)
New
100% remote work across the United StatesFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 6+ years
- Required Skills
- JavaC++Apache KafkaGoRustMicroservicesScalaDistributed Systems
Requirements
- 6+ years of backend or platform engineering experience in high-scale environments.
- Strong hands-on experience designing distributed systems with emphasis on reliability, latency, and scalability.
- Advanced proficiency in a backend programming language such as Java, Go, Scala, Rust, or C++.
- Proven experience building and operating shared platforms or infrastructure used by multiple engineering teams.
- Deep understanding of event-driven systems, streaming architectures, and messaging frameworks such as Apache Kafka or equivalent tools.
- Experience working with relational, NoSQL, and distributed storage systems.
- Strong system design skills with the ability to evaluate trade-offs across consistency, availability, and performance.
- Experience leading multi-quarter technical initiatives and influencing architectural direction.
- Strong communication skills with the ability to articulate complex technical decisions clearly.
- Experience mentoring engineers and contributing to engineering best practices across teams.
Responsibilities
- Design, build, and evolve high-throughput backend platforms and shared infrastructure services that support multiple engineering teams.
- Architect and develop internal platform services supporting request routing, traffic management, caching, and asynchronous processing at large scale.
- Design and implement event-driven and streaming systems using technologies such as Apache Kafka and similar messaging infrastructures.
- Build durable, replayable pipelines for data processing, event handling, and background job execution across distributed systems.
- Define and maintain platform SDKs, libraries, and abstractions that enable consistent and efficient product development.
- Design systems with strong focus on scalability, fault tolerance, multi-region resilience, and predictable latency under high load.
- Establish observability standards, SLIs/SLOs, and capacity planning practices to ensure operational excellence.
- Lead incident response for critical platform issues and drive post-mortem analysis with long-term reliability improvements.
View Full Description & ApplyYou'll be redirected to the employer's site