Applyπ Poland
π§ Full-Time
π Cyber security, user privacy, and machine learning
π’ Company: Intuition Machines, Inc.π₯ 51-100InternetEducationInternet of ThingsMachine LearningSoftware
- Thoughtful, conscientious, and self-directed.
- Minimum of 3 years in a data role involving designing and building data stores, feature engineering, and building reliable data pipelines that handle high loads.
- At least 2 years of professional software development experience in a role other than data engineering.
- Significant experience coding and developing in Python.
- Experience in building and maintaining distributed data pipelines.
- Experience working with Kafka infrastructure and applications.
- Deep understanding of SQL and NoSQL databases (preferably Clickhouse).
- Familiarity with public cloud providers (AWS or Azure).
- Experience with CI/CD and orchestration platforms: Kubernetes, containerization, and microservice design.
- Familiarity with distributed systems and architectures.
- Maintain, extend, and improve existing data/ML workflows and implement new ones to handle high-velocity data.
- Provide interfaces and systems that enable ML engineers and researchers to build datasets on demand.
- Influence data storage and processing strategies.
- Collaborate with the ML team, as well as frontend and backend teams, to build out our data platform.
- Reduce time-to-deployment for dashboards and ML models.
- Establish best practices and develop pipelines that enable ML engineers and researchers to efficiently build and use datasets.
- Work with large datasets under performance constraints comparable to those at the largest companies.
- Iterate quickly, focusing on early and frequent shipping to deploy new products or features to millions of users.
AWSPythonSoftware DevelopmentSQLKafkaKubernetesStrategyAzureClickhouseData engineeringNosqlCI/CD
Posted 2024-10-12
Apply