Member of Technical Staff - Large Model Data

Posted about 1 year agoViewed
Germany, USAGenerative image and video models
Company:Black Forest Labs
Location:Germany, USA
Skills:
AWSPythonGCPHadoopMachine LearningOpenCV
Requirements:
Proficiency in Python and various file systems for data-intensive manipulation and analysis. Familiarity with cloud computing platforms (AWS, GCP, or Azure) and Slurm/HPC environments for distributed data processing. Experience with image and video processing libraries (e.g., OpenCV, FFmpeg). Demonstrated ability to optimize and parallelize data processing workflows across CPUs and GPUs. Familiarity with data annotation and captioning processes for ML training datasets. Knowledge of machine learning techniques for data cleaning and preprocessing.
Responsibilities:
Develop and maintain scalable infrastructure for large-scale image and video data acquisition. Manage and coordinate data transfers from various licensing partners. Implement and deploy state-of-the-art ML models for data cleaning, processing, and preparation. Implement scalable and efficient tools to visualize, cluster, and deeply understand the data. Optimize and parallelize data processing workflows to handle billion-scale datasets efficiently. Ensure data quality, diversity, and proper annotation for training readiness. Get training data from alternative sources into trainable format. Work closely in the model development loop to update data as required by training.
Similar Jobs:
Posted 1 day ago
WashingtonFull-TimeData Science
Senior Data Scientist (Remote from Washington)
Company:
Posted 1 day ago
USAFull-TimeVeterinary Software
AI Integrations Staff Engineer
Company:Vetcove
Posted about 1 month ago
United StatesFull-TimeSoftware Development
Technical Success Manager
Company:New Relic