Senior/Staff Software Engineer, Search & Retrieval Infrastructure

New
Based in the United StatesFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
6+ years
Required Skills
PythonJavaKubernetesC++GoRust

Requirements

  • 6+ years of experience building production-grade backend systems in large-scale distributed environments.
  • Strong expertise in system architecture, with a focus on high throughput, low latency, and scalable design principles.
  • Experience building or working with search systems, including semantic search, vector databases, hybrid retrieval, or platforms like Elastic or OpenSearch.
  • Deep understanding of retrieval-augmented generation (RAG), embedding pipelines, and LLM-based orchestration patterns.
  • Proficiency in at least one major programming language such as Go, Rust, C++, Java, or Python.
  • Experience with data engineering and building large-scale indexing pipelines for diverse data types.
  • Familiarity with modern infrastructure tools such as Kubernetes, cloud-native architectures, observability systems, and IaC tools like Terraform or Pulumi.
  • Strong product mindset with the ability to design developer-friendly and agent-friendly APIs.
  • Comfortable working in ambiguous, high-growth environments with significant ownership expectations.
  • Strong problem-solving skills and a bias toward building robust, long-term systems rather than short-term fixes.

Responsibilities

  • Design and build scalable search and retrieval infrastructure, including semantic search, hybrid retrieval, metadata-aware querying, and LLM-driven query planning systems.
  • Develop high-throughput indexing pipelines for both structured and unstructured data, ensuring performance, reliability, and scalability.
  • Build and maintain backend services that support retrieval orchestration, knowledge synthesis, and agentic AI workflows.
  • Improve retrieval quality through robust evaluation frameworks, observability systems, and experimentation on ranking and relevance.
  • Design clean, intuitive, and scalable APIs for both internal systems and external developer and agentic use cases.
  • Optimize system performance across latency, throughput, and cost in large-scale distributed environments.
  • Drive technical direction for reliability, security, and architectural evolution of core retrieval systems.
  • Collaborate closely with cross-functional teams to align infrastructure capabilities with product and AI application needs.
  • Contribute to system-level design decisions that shape long-term platform scalability and extensibility.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now