ApplyMember of Technical Staff - Image / Video Data Engineer
Posted about 2 months agoViewed
View full description
Requirements:
- Proficiency in Python and various file systems for data-intensive manipulation and analysis.
- Familiarity with cloud computing platforms (AWS, GCP, or Azure) and Slurm/HPC environments for distributed data processing.
- Experience with image and video processing libraries (e.g., OpenCV, FFmpeg).
- Demonstrated ability to optimize and parallelize data processing workflows across CPUs and GPUs.
- Familiarity with data annotation and captioning processes for ML training datasets.
- Knowledge of machine learning techniques for data cleaning and preprocessing.
Responsibilities:
- Develop and maintain scalable infrastructure for large-scale image and video data acquisition.
- Manage and coordinate data transfers from various licensing partners.
- Implement and deploy state-of-the-art ML models for data cleaning, processing, and preparation.
- Implement scalable and efficient tools to visualize, cluster, and deeply understand the data.
- Optimize and parallelize data processing workflows to handle billion-scale datasets efficiently.
- Ensure data quality, diversity, and proper annotation for training readiness.
- Get training data from alternative sources into trainable format.
- Work closely in the model development loop to update data as required by training.
Apply