Apply

Member of Technical Staff - Image / Video Data Engineer

Posted about 2 months agoViewed

View full description

📍 Location: Germany, USA

🔍 Industry: Generative image and video models

🏢 Company: Black Forest Labs👥 20-100💰 $30,202,193 Seed 6 months agoArtificial Intelligence (AI)Media and EntertainmentGenerative AISoftware

🪄 Skills: AWSPythonGCPHadoopMachine LearningOpenCV

Requirements:
  • Proficiency in Python and various file systems for data-intensive manipulation and analysis.
  • Familiarity with cloud computing platforms (AWS, GCP, or Azure) and Slurm/HPC environments for distributed data processing.
  • Experience with image and video processing libraries (e.g., OpenCV, FFmpeg).
  • Demonstrated ability to optimize and parallelize data processing workflows across CPUs and GPUs.
  • Familiarity with data annotation and captioning processes for ML training datasets.
  • Knowledge of machine learning techniques for data cleaning and preprocessing.
Responsibilities:
  • Develop and maintain scalable infrastructure for large-scale image and video data acquisition.
  • Manage and coordinate data transfers from various licensing partners.
  • Implement and deploy state-of-the-art ML models for data cleaning, processing, and preparation.
  • Implement scalable and efficient tools to visualize, cluster, and deeply understand the data.
  • Optimize and parallelize data processing workflows to handle billion-scale datasets efficiently.
  • Ensure data quality, diversity, and proper annotation for training readiness.
  • Get training data from alternative sources into trainable format.
  • Work closely in the model development loop to update data as required by training.
Apply