Network DevOps Engineer, RDMA Fabric Automation

Posted about 3 hours agoViewed
90000 - 130000 USD per year
United StatesFull-TimeCloud Infrastructure
Company:Vultr
Location:United States
Languages:English
Skills:
PythonKafkaGrafanaPrometheusRustCI/CDLinuxAnsible
Requirements:
Solid understanding of modern data center networking: EVPN-VXLAN, BGP, MLAG, QoS, and traffic engineering Deep familiarity with RoCEv2, RDMA transport tuning, ECN/PFC, and lossless Ethernet design Strong experience with automation frameworks like Ansible Experience with languages like Python, Golang, Rust, or PHP Comfort working with telemetry and monitoring stacks — Prometheus, Grafana, Loki, ELK, or similar Previous experience integrating with NetBox, Nautobot, OpsMill or similar Familiarity with CI/CD systems (GitHub Actions, Jenkins, ArgoCD) Strong Linux networking background, including namespaces, netlink, and system-level debugging
Responsibilities:
Automate deployment and operations of large-scale RDMA (RoCEv2) Ethernet fabrics Build Ansible and Python-based frameworks to provision, validate, and remediate networks Integrate network automation with Vultr’s source-of-truth systems (NetBox, OpsMill) Develop telemetry ingestion and correlation pipelines for network health and performance metrics Collaborate with engineering teams to optimize RDMA performance Implement CI/CD workflows for network configuration changes Investigate complex network behaviors Contribute to the design of next-generation GPU and AI interconnect fabrics
Similar Jobs:
Posted 38 minutes ago
United StatesFull-TimeSoftware Development
Sr. Software Engineer II - DevSecOps, Reliability, Security (Remote Eligible)
Company:Smartsheet
Posted about 1 hour ago
United StatesFull-TimeSchool Transportation
Senior DevOps Engineer (AWS)
Company:N-iX
Posted about 1 hour ago
United StatesFull-TimeSports Technology
Staff Data Engineer