Bachelor's Degree in Computer Science/Engineering or equivalent practical experience.
4+ years experience on building/maintaining a large scale production-grade data platform (including a Data Lake) for data scientists, analysts, and engineers.
Strong programming and software engineering skills (experience with Python, Go, Scala, or SQL is a must).
2+ years experience working with Spark.
Experience deploying and managing data orchestration systems (e.g Airflow, Flyte, Prefect, Dagster).
AWS knowledge and expertise (S3, Lambda, SQS, Kinesis).
Responsibilities:
Develop software to reliably ingest vast amounts of data into our data lake.
Explore new infrastructure needed to support the growing needs of our data platform.
Design, scope, and build libraries and data management tooling for other software engineers, product managers and data scientists to use the data platform effectively.
Expand our ability to stream data into the data lake to support near real time access.
Be responsible for the uptime, reliability, and monitoring of our data platform.
Research, recommend, and implement new tools to help our teams leverage data more easily.
Uplevel team members and cross-functional colleagues on topics like data best practices, AWS, Spark, etc.
Champion and embed Samsara’s cultural principles as we scale globally.