Spark Jobs

Find remote positions requiring Spark skills. Browse through opportunities where you can utilize your expertise and grow your career.

Spark
232 jobs found. to receive daily emails with new job openings that match your preferences.
232 jobs found.

Set alerts to receive daily emails with new job openings that match your preferences.

Apply

πŸ“ Canada

🧭 Full-Time

πŸ” Technology for small businesses

🏒 Company: JobberπŸ‘₯ 501-1000πŸ’° $100,000,000 Series D almost 2 years agoSaaSMobileSmall and Medium BusinessesTask Management

  • Proven ability to lead and collaborate in team environments.
  • Strong coding skills in Python and SQL.
  • Expertise in building and maintaining ETL pipelines using tools like Airflow and dbt.
  • Experience with AWS tools such as Redshift, Glue, and Lambda.
  • Familiarity with handling large datasets using tools like Spark.
  • Experience with Terraform for infrastructure management.
  • Knowledge of dimensional modelling, star schemas, and data warehousing.

  • Design, develop, and maintain batch and real-time data pipelines within cloud infrastructure (preferably AWS).
  • Develop tools that automate processes and set up monitoring systems.
  • Collaborate with teams to extract actionable insights from data.
  • Lead initiatives to propose new technologies, participate in design and code reviews, and maintain data integrity.

AWSPythonSQLApache AirflowETLSparkTerraform

Posted about 23 hours ago
Apply
Apply

πŸ“ US, Canada

🧭 Full-Time

πŸ’Έ 165200.0 - 295000.0 USD per year

πŸ” Physical Operations, Internet of Things (IoT)

  • Masters or PhD in Computer Science or other quantitative field (e.g., Applied Math, Engineering, Computer Science, Physics).
  • 5+ years experience as a Scientist or Machine Learning Engineer.
  • Proficiency in self-serving with data for experiments and model training at scale.
  • Proficient with Spark, Ray, or a similar framework.
  • Coding in Python or similar.
  • Strong functional knowledge of the iterative machine learning product development process.
  • Experienced in developing and shipping production code.
  • Ability to distill informal or ambiguous customer and business requirements into crisp problem definitions.
  • Proven ability to communicate verbally and in writing to technical peers and leadership teams.
  • Experience coaching and mentoring scientists.

  • Lead design and implementation of critical AI product initiatives.
  • Develop both tactical AI solutions as well as more strategic and longer-term research.
  • Work with petabyte-scale data from customer operations including text, transactions, diagnostics, sensor, camera, and location data.
  • Partner across business units to explore and prototype new AI experiences.
  • Stay connected to industry and academic research and adopt novel technology that suits Samsara’s needs.
  • Champion, role model, and embed Samsara’s cultural principles.

PythonMachine LearningSpark

Posted 4 days ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 55.0 - 65.0 USD per hour

πŸ” Artificial Intelligence

  • Advanced proficiency in English, both verbal and written.
  • Strong experience in either Python or JavaScript, with a preference for candidates with experience in ReactJS if only JavaScript is known.
  • Solid understanding of computer science fundamentals like data structures, algorithms, and debugging skills.
  • A minimum of 2 years of hands-on industry experience and a proven track record in software development or public proof of work.
  • Extensive experience with various tools and systems such as Databases, SQL, Kubernetes, Spark, Kafka, gRPC, and AWS.

  • Curate code examples, offering precise solutions and corrections in Python, JavaScript (including ReactJS), C/C++, and Java for AI model training.
  • Evaluate and refine AI-generated code to ensure compliance with efficiency, scalability, and reliability standards.
  • Collaborate with cross-functional teams to enhance AI-driven coding solutions that meet enterprise-level quality and performance benchmarks.

AWSPythonSQLJavaJavascriptKafkaKubernetesReact.jsC++AlgorithmsData StructuresgRPCSparkDebugging

Posted 5 days ago
Apply
Apply

πŸ“ New Zealand, Australia

🧭 Full-Time

πŸ” Technology and Consulting

  • Hands-on experience in modern data platform architecture and engineering.
  • Strong experience with platforms like Snowflake, Data Bricks, and Azure.
  • Familiarity with data processing tools such as HDFS, Spark, and Kafka.
  • Experience with ETL/ELT packages such as DBT and Informatica.
  • Understanding of virtualization technologies like AWS EC2 and Docker.
  • Strong expertise in relational databases and SQL.
  • Knowledge of data modeling and structures.
  • Experience in data security engineering practices.

  • Lead and grow the data practice.
  • Architect, design, and engineer modern data platforms.
  • Provide consulting and technical support for client projects.
  • Present technical information to diverse audiences.

DockerPythonSQLGCPJavaKafkaKubernetesSnowflakeTypeScriptTableauAirflowAzureGoSparkData modeling

Posted 6 days ago
Apply
Apply

πŸ“ United States, Canada

🧭 Full-Time

πŸ” Digital Advertising

🏒 Company: RedditπŸ‘₯ 1001-5000πŸ’° $410,000,000 Series F over 3 years agoπŸ«‚ Last layoff over 1 year agoNewsContentSocial NetworkSocial Media

  • Degree in a quantitative discipline: engineering, statistics, operations research, computer science, informatics, applied mathematics, or economics.
  • 7+ years of contributing high-quality code to production systems that operate at scale.
  • 5+ years of experience building ads-serving related systems, including targeting, ranking, and pacing.
  • Experience building A/B testing frameworks for multiparty marketplace scenarios.
  • Experience leading large engineering teams and collaborating with cross-functional partners.

  • Building Reddit-scale optimizations to improve advertiser outcomes using cutting-edge techniques.
  • Leveraging live auction data and model predictions for real-time campaign bid adjustments.
  • Incorporating ads marketplace knowledge into budget pacing algorithms.
  • Leading the design of new bid and budget optimization products and algorithms.
  • Conducting rigorous A/B experiments to evaluate business impact and collaborating on long-term team direction.

DockerElasticSearchKafkaKubernetesCassandraGoPostgresRedisSparkScalaA/B testing

Posted 6 days ago
Apply
Apply

πŸ“ CA, WA, NY, NJ, CT, all other U.S. states

🧭 Full-Time

πŸ’Έ 200000.0 - 275000.0 USD per year

πŸ” Financial Technology

  • 8+ years of experience designing, developing, and launching backend systems using Python or Kotlin.
  • Extensive experience with highly available distributed systems utilizing AWS, MySQL, Spark, and Kubernetes.
  • Experience with online, real-time ML infrastructure like model servers or feature stores.
  • Developed offline environments for large scale data analysis and model training using Spark, Kubeflow, Ray, and Airflow.
  • Experience delivering major system features and writing high quality code.
  • Comfortable navigating from low-level language idioms to large system architecture.
  • Mastered gathering feedback and strong communication skills.
  • Bachelor's degree in a related field or equivalent practical experience.

  • Responsible for setting technical strategy for the team on a year-long time scale and linking it with business-impacting projects.
  • Collaborate across teams in the ML development lifecycle with machine learning engineers, platform engineers, and product management.
  • Act as a force-multiplier, defining and advocating for technical solutions and operational processes.
  • Ensure team operations and availability through monitoring, triage rotations, and testing.
  • Foster a culture of quality and ownership by setting standards and advocating beyond the team.
  • Develop talent by providing feedback, guidance, and leading by example.

AWSPythonApache AirflowKotlinKubeflowKubernetesMySQLSpark

Posted 7 days ago
Apply
Apply

πŸ“ United States, Canada

🧭 Full-Time

πŸ’Έ 206700.0 - 289400.0 USD per year

πŸ” Digital advertising

  • Degree in a quantitative discipline: engineering, statistics, operations research, computer science, informatics, applied mathematics, economics, etc.
  • 7+ years of contributing high-quality code to production systems that operate at scale.
  • 5+ years of experience building ads-serving related systems, including ads targeting, ads ranking, ads pacing.
  • Experience building A/B testing frameworks for multiparty marketplace scenarios.
  • Experience leading large engineering teams and collaborating with cross-functional partners, especially data science.

  • Building Reddit-scale optimizations to improve advertiser outcomes using cutting-edge techniques.
  • Leveraging live auction data and model predictions to adjust campaign bids in real time.
  • Incorporating knowledge of the Reddit ads marketplace into budget pacing algorithms.
  • Leading the team on designing new bid and budget optimization products, conducting rigorous A/B experiments.

Cloud ComputingElasticSearchKafkaGoPostgresRedisSparkScalaA/B testing

Posted 7 days ago
Apply
Apply

πŸ“ United States, Canada

🧭 Full-Time

πŸ” Digital Advertising

  • Degree in a quantitative discipline: engineering, statistics, operations research, computer science, informatics, applied mathematics, economics, etc.
  • 7+ years of contributing high-quality code to production systems that operate at scale.
  • 5+ years of experience building ads-serving related systems, including ads targeting, ranking, and pacing.
  • Experience building A/B testing frameworks for multiparty marketplace scenarios.
  • Experience leading large engineering teams and collaborating with cross-functional partners, especially with data science partners.
  • Significant experience in backend programming languages, with a preference for Go or Scala.
  • Experience with API development, service frameworks, data processing frameworks, cloud service providers, and CI/CD tools.

  • Building Reddit-scale optimizations to improve advertiser outcomes using cutting-edge techniques.
  • Leveraging live auction data and model predictions to adjust campaign bids in real time.
  • Leading the design of new bid & budget optimization products and algorithms.
  • Conducting rigorous A/B experiments to evaluate business impact.
  • Collaborating with other leads to set long-term team direction and project execution.

DockerPostgreSQLElasticSearchKafkaKubernetesGoRedisSparkScalaA/B testing

Posted 7 days ago
Apply
Apply

πŸ“ US

πŸ” Automotive

  • 8+ years of professional experience working with large datasets.
  • 8+ years of experience with statistical programming software, preferably Spark and Python.
  • 8+ years of experience with database software like RedShift, Hive, SQL, and MySQL.
  • Master’s Degree in Statistics, Data Science, Economics, or a related field.
  • Ability to communicate technical processes to a lay audience.
  • Presentation experience.
  • Curiosity and passion for solving problems with data.

  • Daily immersion in automotive industry data.
  • Involved in strategic planning for data science initiatives.
  • Create relatable presentations for technical predictions.
  • Document machine learning processes.
  • Deep expertise in feature engineering and AI/ML model building.
  • Collaborate with technology teams to improve processes.
  • Develop analytics with the product team.
  • Lead PI planning with limited supervision.
  • Create visual reports in Tableau.
  • Conduct ad-hoc data analysis and reporting.

AWSPythonSQLMachine LearningTableauData scienceSparkData visualization

Posted 8 days ago
Apply
Apply

πŸ“ Brazil

πŸ” Financial sector

🏒 Company: RecargaPayπŸ‘₯ 501-1000πŸ’° $10,000,000 Debt Financing over 2 years agoMobile PaymentsFinancial ServicesFinTech

  • Experience in the financial sector, including Finance Product, Fintech, or financial ecosystems, focused on data and financial products related to payment processing.
  • Experience with SQL, Python, and Excel.
  • Knowledge of Spark and Databricks.
  • Experience in data extraction, manipulation, and building dashboards and indicators.
  • Proficient in Qlik Sense and other data visualization tools.
  • Expertise in using statistical techniques, algorithms, and data mining to model complex problems.
  • Strong mathematical and financial knowledge.
  • Strong collaborative skills and the ability to work effectively in a team environment.

  • Analyze data to identify financial opportunities and relevant behaviors, generating valuable business insights.
  • Support the risk management area in strategic decision-making processes.
  • Identify and implement continuous improvements in financial planning processes.
  • Propose new technologies and techniques to enhance financial risk and asset management.
  • Discover innovative solutions that provide meaningful business insights.
  • Act as a reference for data analysis tools, ensuring optimal use of available technologies.
  • Foster a collaborative environment across teams and departments.
  • Demonstrate autonomy and proactivity in identifying and solving problems.
  • Ensure the delivery of high-quality results within established deadlines.

PythonSQLData AnalysisData MiningSparkData visualization

Posted 9 days ago
Apply
Shown 10 out of 232