Apply

Senior Data Scientist

Posted 4 days agoViewed

View full description

💎 Seniority level: Senior, 7+ years

📍 Location: Hong Kong, Macao

🔍 Industry: Software Development

🏢 Company: Intellectsoft👥 251-500Augmented RealityArtificial Intelligence (AI)DevOpsBlockchainInternet of ThingsUX DesignWeb DevelopmentMobile AppsQuality AssuranceSoftware

⏳ Experience: 7+ years

Requirements:
  • 7+ years of experience in data science, machine learning, and statistical modeling.
  • Strong programming skills in Python, SQL, and Spark.
  • Hands-on experience with MLflow, PyTorch, Spark MLlib, and FastAPI for model development and deployment.
  • Understanding of distributed computing and big data processing using Apache Spark and ClickHouse.
  • Proficiency in feature engineering, data preprocessing, and model tuning for large-scale datasets.
  • Experience in building and deploying ML models in production environments using TorchServe, FastAPI, or similar frameworks.
  • Knowledge of deep learning architectures (CNNs, RNNs, transformers) and their practical applications.
  • Strong grasp of MLOps best practices, including CI/CD for ML models, model monitoring, and retraining pipelines.
  • Understanding of real-time analytics and event-driven architectures for processing streaming data.
  • Experience working with SQL and NoSQL databases such as PostgreSQL, ClickHouse, and Delta Lake.
  • Strong ability to collaborate with data engineers, architects, and business analysts to ensure ML models align with business objectives.
  • Knowledge of A/B testing methodologies and causal inference techniques for evaluating model effectiveness.
  • Familiarity with cloud services (AWS, GCP, or Azure) for scalable model training and deployment.
Responsibilities:
  • Design the architecture for the open-source-based data analytics platform.
  • Develop scalable data models, data pipelines, and data lakes.
  • Ensure integration of various data sources, including Kafka, NiFi, Apache Airflow, and Spark.
  • Implement modern data platform components like Apache Iceberg, Delta Lake, ClickHouse, and PostgreSQL.
  • Define and enforce data governance, security, and compliance best practices.
  • Optimize data storage, access, and retrieval for performance and scalability.
  • Collaborate with data scientists, engineers, and business analysts to ensure platform usability.
Apply

Related Jobs

Apply

💸 160000.0 - 190000.0 USD per year

🔍 Mental Health Care

🏢 Company: Alma👥 251-500💰 $130,000,000 Series D over 2 years ago🫂 Last layoff 6 months agoMental HealthMedicalWellnessHealth Care

  • Master’s degree in a relevant quantitative field (e.g. data science, computer science, statistics, economics) or equivalent in industry experience
  • 2-4 years of experience as a data scientist supporting product development
  • Expert in SQL
  • Proficient in either R or Python for data analysis
  • Successfully developed and launched ML models with quantifiable business impact
  • Experience designing and analyzing A/B testing experiments and guide product decisions with experimental insights
  • Improve client and provider experience by building machine learning models to solve business problems such as provider-client matching and LTV prediction
  • Design, execute, and analyze A/B testing experiments to measure model performance and validate product hypotheses
  • Define metrics and create dashboards to measure product performance
  • Generate insights through deep-dive analysis to inform strategy and prioritization
  • Collaborate with product, engineering, and marketing stakeholders to define the product roadmap
  • Identify opportunities and initiate new data science projects that drive company OKRs
  • Communicate technical insights effectively to influence decision-making
Posted about 5 hours ago
Apply
Apply
🔥 Senior Data Scientist
Posted about 12 hours ago

📍 United States

🧭 Full-Time

🔍 Real Estate

🏢 Company: Property Leads👥 11-50Real Estate

  • 4+ years of experience working with real estate data, particularly niche data sets (divorce, bankruptcy, probate, etc.).
  • Proficiency in data analysis and engineering tools (e.g., SQL, Python, Pandas, Excel).
  • Experience with data platforms and lead generation tools (e.g., PropStream, BatchLeads).
  • Experience developing scoring models, predictive features, or data-driven segmentation.
  • Strong understanding of ETL pipelines, public record APIs, and/or scraping strategies
  • Proven ability to source and curate high-quality leads.
  • Meticulous attention to detail and a passion for clean, actionable data
  • Excellent communication skills and a collaborative, fast-moving mindset.
  • Source and analyze specialized real estate data sets, including: Divorce records Bankruptcy filings Probate cases Tax liens, pre-foreclosures, and other distressed property indicators
  • Develop and maintain efficient processes for extracting, cleansing, and managing data from multiple sources.
  • Identify patterns and insights in data to improve lead targeting and conversion rates.
  • Collaborate with the marketing and sales teams to create actionable lead lists and improve outreach strategies.
  • Stay current with trends and tools in real estate data sourcing and analysis.
  • Ensure data accuracy, completeness, and compliance with relevant regulations.
  • Provide input into data roadmap, tooling decisions, and long-term analytics strategy
  • Mentor junior analysts or data contributors (as team grows)

PythonSQLData AnalysisData MiningETLMachine LearningData sciencePandasRESTful APIsData visualizationLead GenerationData modeling

Posted about 12 hours ago
Apply
Apply

🏢 Company: StreamElements👥 101-250💰 $100,000,000 Series B over 3 years ago🫂 Last layoff about 2 years agoDeveloper ToolsVideo StreamingMedia and EntertainmentContent CreatorsMusic Streaming

  • Knowledge of one or more machine learning, statistical modeling, and optimization techniques relevant to StreamElement’s business.
  • Ability to methodically understand business context, define value propositions, and formulate complex data problems.
  • You have a deep analytical background with extensive experience executing complex analyses to drive organizational progress.
  • Ability to own key deliverables and use creative problem solving to deliver projects on time and iterate when necessary.
  • Knowledge of end-to-end software development, deployment, and monitoring workflows.
  • Proficiency in visualization tools such as Tableau, Shiny, or Seaborn.
  • Extensive experience in relational algebra, building SQL queries across complex relational databases.
  • Experience designing scalable data models and ETLs with tools like DBT.
  • Drive strategic projects and measure the impact of the implemented recommendations.
  • Design and execute multi-disciplinary data science solutions effectively and efficiently.
  • Autonomously leverage an advanced analytical toolkit to execute complex analyses, visualization and storytelling.
  • Partner with business leaders to work effectively across multiple technical teams to solve problems.
  • Further the team’s technical capabilities by bringing novel methods from the field into StreamElements through research, software development, and teaching others.
  • Help create and promote an inclusive environment where team members feel comfortable and empowered.
Posted 3 days ago
Apply
Apply

📍 Australia, New Zealand

🔍 Software Development

  • Drive impact with data
  • Excel in core data science skills
  • Demonstrate key soft skills
  • Bring additional technical expertise
  • Have a strong analytical foundation
  • Understand the dynamics of tech companies
  • Have hands-on experience with large-scale data
  • Uncovering strategic insights
  • Designing and analyzing experiments
  • Defining and influencing with metrics
  • Providing data for decision-making

PythonSQLData AnalysisData MiningMachine LearningNumpyTableauProduct AnalyticsAlgorithmsData scienceData StructuresPandasCommunication SkillsAnalytical SkillsData visualizationData modelingData analyticsA/B testing

Posted 4 days ago
Apply
Apply

  • You have deep knowledge of marketing measurement, including attribution, MMM, incrementality testing, and assessing marketing efficiency.
  • You have experience with understanding marketing efficiency and building dashboards that effectively communicate results and outcomes
  • You have competency with SQL; experience with data warehouses such as Snowflake, Redshift, or BigQuery is a plus!
  • You know how to wrangle and analyze data using Python or R.
  • You’re experienced in working with international teams and understand the nuances of collaborating across cultures and markets.
  • Acting as a strategic partner to International Marketing teams, helping to define and refine global growth and marketing strategies.
  • Developing innovative frameworks for marketing measurement, including attribution methodologies, marketing mix modeling (MMM), and incrementality testing, to assess the impact of our marketing investment across diverse regions.
  • Work closely with collaborators throughout Canva to develop insightful analysis and recommendations.
  • Serving as a thought leader in data strategy, aligning teams around key insights and acting as a driver of data literacy and analytics best practices.
  • Closely collaborating with senior leadership and taking the initiative to keep things moving - identifying the opportunities where data can make the largest impact and see it through.
  • Reporting on the performance of our marketing metrics at a campaign or channel level but also zooming out to see the big picture.
  • Build and maintain dashboards for our International marketing teams, and make them look really good - you don’t need to be a designer or have any design talent, you just need to want to build cool things.
Posted 4 days ago
Apply
Apply

💸 70.0 - 80.0 USD per hour

🔍 Digital Transformation Solutions

  • 6 years experience as Data Scientist.
  • Experience with Timeseries Anomaly Detection/Jenkins/GitHub -CI CD Pipeline
  • ML OPs
  • Experience around time series and anomaly detection.
Translate complex data insights into actionable strategies.
Posted 4 days ago
Apply
Apply

📍 Portugal

🔍 Wellness

  • Master’s degree or PhD in Computer Science, Data Science, Machine Learning, Statistics, or a related field.
  • Proficiency in Python and experience with machine learning frameworks such as PyTorch, TensorFlow, or similar.
  • Strong understanding of generative AI architectures, including transformers and attention mechanisms.
  • Experience in multi-agent systems and LLMs.
  • Strong problem-solving abilities, with a focus on experimental design and data analysis.
  • Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders.
  • Have clear scientific thinking and a passion for integration R&D and cutting-edge technology into a product
  • Prior experience in Python and SQL a are mandatory requirements.
  • Understanding of state-of-the-art deep learning techniques, such as Transformers architectures and attention mechanisms, and knowledge of fine-tuning LLMs for enhanced model performance, using techniques such as LoRA, QLoRA, among others.
  • Design and implement structured function calling mechanisms within LLMs to enable dynamic interactions with APIs, databases, and retrieval-augmented generation (RAG) pipelines.
  • Craft clear instructions that help our models understand exactly what we need, creating reusable templates and testing against edge cases.
  • Preprocess, analyse, and curate high-quality datasets to train and fine-tune both embedding and generative models.
  • Develop robust evaluation frameworks to assess agentic AI performance, combining automated evaluation (LLM-as-judge), adversarial testing, human-in-the-loop evaluations, and custom behavioral metrics.
  • Establish monitoring systems that track LLM behavior in production, capturing key metrics around information retrieval, hallucinations, and latency.
  • Stay current with the latest research in generative AI,, share insights with the team, test promising approaches, and help make us better.
  • Collaborate with engineering teams to deploy models in scalable and efficient production environments.

AWSPythonSQLData AnalysisMachine LearningPyTorchData science

Posted 4 days ago
Apply
Apply

📍 Mexico, Brazil, Ukraine, Romania, Argentina

🔍 Insider Risk Management and User Behavior Analytics

🏢 Company: Teramind👥 51-100Productivity ToolsSecurityCyber SecurityEnterprise SoftwareSoftware

  • 5+ years of hands-on experience in in data science with a strong background in machine learning.
  • Proficiency in programming languages such as Python and ML frameworks
  • Expertise in anomaly detection and behavioral analytics
  • Knowledge of statistical analysis and model validation techniques
  • Demonstrated experience with advanced algorithms and ML techniques including clustering, regression, classification, and neural networks.
  • Strong analytical skills with the ability to interpret large data sets and draw meaningful conclusions.
  • Advanced English level
  • Master’s or Ph.D. in Computer Science, Statistics, Mathematics, or a related field.
  • Develop and optimise complex modelsfor detecting anomalous user behavior
  • Create algorithms for identifying potential insider threats and security violations
  • Design feature engineering approaches for behavioral data
  • Implement unsupervised learning techniques for pattern discovery
  • Collaborate with security analysts to validate model effectiveness
  • Optimise models for production performance and accuracy
  • Collaborate with cross-functional teams to define metrics and key performance indicators, developing dashboards and reporting tools.
  • Establish best practices for data science methodologies that streamline processes and improve model reliability.
  • Stay abreast of the latest advancements in data science and machine learning, integrating new techniques and technologies where appropriate.

PythonData AnalysisMachine LearningAlgorithmsData scienceData visualizationData modeling

Posted 6 days ago
Apply
Apply

📍 Brazil, Portugal

🔍 Wellness

🏢 Company: Wellhub

  • Master’s degree or PhD in Computer Science, Data Science, Machine Learning, Statistics, or a related field.
  • Proficiency in Python and experience with machine learning frameworks such as PyTorch, TensorFlow, or similar.
  • Strong understanding of generative AI architectures, including transformers and attention mechanisms.
  • Experience in multi-agent systems and LLMs.
  • Strong problem-solving abilities, with a focus on experimental design and data analysis.
  • Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders.
  • Have clear scientific thinking and a passion for integration R&D and cutting-edge technology into a product
  • Prior experience in Python and SQL
  • Model Development & Fine tuning: Understanding of state-of-the-art deep learning techniques, such as Transformers architectures and attention mechanisms, and knowledge of fine-tuning LLMs for enhanced model performance, using techniques such as LoRA, QLoRA, among others.
  • Function Calling & Tool Use: Design and implement structured function calling mechanisms within LLMs to enable dynamic interactions with APIs, databases, and retrieval-augmented generation (RAG) pipelines.
  • Prompt Engineering: Craft clear instructions that help our models understand exactly what we need, creating reusable templates and testing against edge cases.
  • Data Preparation: Preprocess, analyse, and curate high-quality datasets to train and fine-tune both embedding and generative models.
  • Evaluation and Testing: Develop robust evaluation frameworks to assess agentic AI performance, combining automated evaluation (LLM-as-judge), adversarial testing, human-in-the-loop evaluations, and custom behavioral metrics.
  • LLM Observability: Establish monitoring systems that track LLM behavior in production, capturing key metrics around information retrieval, hallucinations, and latency.
  • Research and Innovation: Stay current with the latest research in generative AI,, share insights with the team, test promising approaches, and help make us better.
  • Deployment: Collaborate with engineering teams to deploy models in scalable and efficient production environments.

AWSPythonSQLData AnalysisMachine LearningPyTorchAlgorithmsAmazon Web ServicesData scienceTensorflowCI/CDRESTful APIsData visualizationData modeling

Posted 7 days ago
Apply

Related Articles

Posted about 1 month ago

Why remote work is such a nice opportunity?

Why is remote work so nice? Let's try to see!

Posted 7 months ago

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

Posted 8 months ago

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Posted 8 months ago

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

Posted 8 months ago

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.