Sourcescrub

Sourcescrub is a technology-driven company that specializes in search and recommendation systems, actively seeking a Lead Engineer to join their team.

Related companies:

Jobs at this company:

Apply
🔥 Head of Data Product
Posted about 2 months ago

📍 USA

🔍 Deal sourcing platform

  • 5+ years of experience in Product Management for data-related products.
  • Direct people and process management experience.
  • Extensive knowledge of the investing/private equity industry.
  • Experience designing research and data collection processes.
  • Strong Excel skills, experience with SQL and data modeling.
  • Knowledge of ontology development and data governance frameworks.
  • Familiarity with big data technologies like Hadoop and Spark.
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Understanding of data privacy regulations like GDPR and CCPA.
  • Own the strategy, development, execution, and budgeting for global data products.
  • Lead the Product Owner role for Data Labeling, Data Platform, and Data Automation Ingestion.
  • Engage in the development and optimization of raw and derived data products.
  • Collaborate with Go-to-Market teams on data accuracy, coverage, and freshness.
  • Oversee data acquisition from diverse sources.
  • Design innovative research methodologies ensuring accuracy and efficiency.
  • Identify patterns in data to support strategic planning.
  • Implement data quality measurement practices and conduct regular audits.
  • Document research methodologies to facilitate future initiatives.
  • Work with engineering teams on the data product roadmap.
  • Enhance research operations with new tools and explore new data sources.
  • Anticipate client needs with research solutions.
  • Contribute to the overall research strategy for SourceScrub.

AWSSQLData AnalysisHadoopProduct ManagementSparkData modeling

Posted about 2 months ago
Apply
Apply

📍 Mexico

🧭 Contract

🔍 Machine Learning

  • 3+ years of proven experience as an Applied ML Engineer or similar role, focusing on Python.
  • 3+ years of advanced Python skills with strong experience in libraries such as Pandas, NumPy, TensorFlow, PyTorch, and Scikit-Learn.
  • Demonstrated experience deploying and optimizing ML models at scale, with knowledge of LLMs and other scalable architectures.
  • 3+ years of strong knowledge of SQL and database management for data storage and retrieval.
  • 2+ years of experience with version control, particularly Git.
  • Strong analytical, problem-solving, and attention to detail skills.
  • Excellent communication and teamwork, particularly in agile environments.
  • Develop and maintain ML models focusing on scalability, particularly for deploying and optimizing LLMs in production environments.
  • Collaborate with cross-functional teams to deliver end-to-end ML solutions, from data preprocessing to model deployment.
  • Implement data preprocessing, feature engineering, and model evaluation processes to ensure high standards of accuracy and efficiency.
  • Develop and manage ML pipelines in Python for performance at scale.
  • Leverage cloud platforms (e.g., Azure, AWS) for deploying, scaling, and monitoring models in production.
  • Stay current with advancements in LLMs, ML frameworks, and Python development best practices.

AWSPythonSQLAgileGitMachine LearningNumpyPyTorchAzureData engineeringPandasTensorflowAttention to detail

Posted 2 months ago
Apply