Spark Jobs

Find remote positions requiring Spark skills. Browse through opportunities where you can utilize your expertise and grow your career.

Spark

192 jobs found. to receive daily emails with new job openings that match your preferences.

192 jobs found.

Set alerts to receive daily emails with new job openings that match your preferences.

Apply

🔥 Senior Engineer - Application Support

Posted about 1 hour ago

📍 India

🏢 Company: YipitData (Alternative)

🔧 Requirements

5+ years of proven experience in web application development or application support, particularly in systems with high uptime requirements.
Demonstrate strong ability to troubleshoot issues with web/http traffic, python applications, and JS applications.
Proficient in Python, Docker, AWS, REST APIs, and database technologies.

💡 Responsibilities

Diagnose and resolve technical issues in data applications and platform services, including web application performance, optimizing SQL, Pandas, and PySpark queries, and interacting with REST APIs.
Work cross-functionally with data analyst teams and platform engineers to coordinate releases, onboard users, and maintain the uptime of critical Plotly Dash applications.
Identify and implement process improvements to streamline support workflows, reduce repetitive tasks, and improve application and data platform efficiency.
Promote and enforce application best practices, data software management, and cloud infrastructure to enhance system reliability and reduce technical debt.

AWS Docker Python SQL REST API Pandas Spark CI/CD RESTful APIs Troubleshooting Data analytics Debugging

Posted about 1 hour ago

Apply

🔥 Senior Application Support Engineer

Posted about 2 hours ago

📍 India

🔍 Software Development

🏢 Company: YipitData👥 251-500💰 Debt Financing 9 months agoMarket Research Analytics Data Visualization

🔧 Requirements

5+ years of proven experience in web application development or application support, particularly in systems with high uptime requirements.
Strong ability to troubleshoot issues with web/http traffic, python applications, and JS applications.
Are proficient in Python, Docker, AWS, REST APIs, and database technologies.

💡 Responsibilities

Diagnose and resolve technical issues in data applications and platform services, including web application performance, optimizing SQL, Pandas, and PySpark queries, and interacting with REST APIs.
Work cross-functionally with data analyst teams and platform engineers to coordinate releases, onboard users, and maintain the uptime of critical Plotly Dash applications.
Identify and implement process improvements to streamline support workflows, reduce repetitive tasks, and improve application and data platform efficiency.
Promote and enforce application best practices, data software management, and cloud infrastructure to enhance system reliability and reduce technical debt.

AWS Docker Python SQL Git REST API Pandas Spark Troubleshooting Data analytics Debugging

Posted about 2 hours ago

Apply

🔥 Engineering Manager (Claims Platform)

Posted about 14 hours ago

📍 United States

🧭 Full-Time

💸 198050.0 - 233000.0 USD per year

🔍 Health Tech

🔧 Requirements

3+ years of experience as an Engineering Manager
3-5+ years of experience as a Software Engineer
Track record of building and leading high performing teams that have effectively delivered business outcomes through technical investments
Strong technical foundation
Strong product mindset
Track record of working well across teams and functions
Prior experience with Python and React (Nice to have)
Experience working with AWS infrastructure (Nice to have)
Experience in health tech, building user-facing products (Nice to have)
BS, MS in Computer Science or related field

💡 Responsibilities

Lead and manage a team (or teams) of 8+ engineers
Ensure there is process and product alignment across your team
Develop and grow your team through weekly 1-1s, mentorship, and feedback
Work with and help develop aspiring engineering leaders within the engineering team
Contribute to the broader company strategy and product roadmap
Help build out our engineering team as the headcount grows from 80 to 120+ through the remainder of the year

AWS Backend Development Docker Leadership PostgreSQL Project Management Python Cloud Computing Data Analysis Kafka People Management Product Management React.js TypeScript Cross-functional Team Leadership Algorithms Data Structures FastAPI REST API Redis Strategic Management React Spark Communication Skills Analytical Skills CI/CD Problem Solving Agile methodologies Mentoring Relationship building Team management Software Engineering Debugging

Posted about 14 hours ago

Apply

🔥 Data & AI Engineer (Frankfurt, Germany)

Posted about 17 hours ago

📍 Germany

🔍 AI and data analytics consulting

🏢 Company: Unit8 SA

🔧 Requirements

MSc level in the field of Computer Science, Machine Learning, Applied Statistics, Mathematics, or equivalent work experience.
Proficient software engineer who has experience in applying a blend of software engineering, machine learning, and statistical methods to solve real-world business problems
Proficient in one of the following languages: Python, Scala, Java.
Experience with cloud technologies is a strong plus.

💡 Responsibilities

Work with our customers to understand their challenges, design and implement solutions.
Closely collaborate with other data scientists, software engineers and business stakeholders.
Evaluate, compare and present results to technical and non-technical audience.
Contribute to the implementation and engineering of systems at different scales: from small proof-of-concepts to larger end-to-end data systems.
Implement best practices in CI/CD

AWS Docker Python SQL Cloud Computing Data Analysis ETL Java Kubernetes Machine Learning Algorithms Data engineering Data science Spark CI/CD RESTful APIs Scala Software Engineering

Posted about 17 hours ago

Apply

🔥 Senior Application Support Engineer

Posted 1 day ago

📍 India

🔍 Market Research and Analytics

🔧 Requirements

5+ years of proven experience in web application development or application support, particularly in systems with high uptime requirements.
Strong ability to troubleshoot issues with web/http traffic, python applications, and JS applications.
Proficient in Python, Docker, AWS, REST APIs, and database technologies.

💡 Responsibilities

Diagnose and resolve technical issues in data applications and platform services, including web application performance, optimizing SQL, Pandas, and PySpark queries, and interacting with REST APIs.
Work cross-functionally with data analyst teams and platform engineers to coordinate releases, onboard users, and maintain the uptime of critical Plotly Dash applications.
Identify and implement process improvements to streamline support workflows, reduce repetitive tasks, and improve application and data platform efficiency.
Promote and enforce application best practices, data software management, and cloud infrastructure to enhance system reliability and reduce technical debt.

AWS Docker Python SQL Git API testing REST API Pandas Spark CI/CD RESTful APIs Linux Troubleshooting Data modeling Scripting Data analytics Debugging

Posted 1 day ago

Apply

🔥 Staff Software Engineer, Backend (Fraud Decisioning)

Posted 1 day ago

📍 United States

🧭 Full-Time

💸 200000.0 - 275000.0 USD per year

🔍 Software Development

🔧 Requirements

6+ years of experience designing, developing and launching backend systems at scale using languages like Python or Kotlin.
Extensive track record of developing highly available distributed systems using technologies like AWS, MySQL, Spark and Kubernetes.
Experience delivering major features, system components or deprecating existing functionality in a system through the definition of a technical and execution plan.
Write high quality code that is easily understood and used by others.
Thrive in ambiguity, and are comfortable moving from low level language idioms all the way to the architecture of large systems to understand how they work.
Growth and impact trajectory demonstrates that you have mastered gathering and iterating on feedback from your engineering and cross-functional peers.
Strong verbal and written communication skills that support effective collaboration with our global engineering team.

💡 Responsibilities

Set technical strategy for your team on a year-long time scale, and help your team tie it together with critical, business-impacting projects.
Collaborate across teams in the product development lifecycle by collaborating with product management, design & analytics to ensure technical sustainability, risks and trade-offs are well understood and managed.
Act as a force-multiplier for your team through your definition and advocacy of technical solutions and operational processes.
Mentor less experienced engineers, leading by example, and setting the technical bar high.
Take ownership of your team’s operations and availability by ensuring you have the right monitoring, triage rotations, playbooks, policies, testing and alerting in place to support “keep the lights on” & on-call efforts.
Foster a culture of quality and ownership on your team by setting code review and design standards for your team, and advocating for them beyond your team through your writing and tech talks.
Help develop talent on your team by providing feedback and guidance, and leading by example.

AWS Backend Development Leadership Project Management Python Software Development SQL Data Analysis Kotlin Kubernetes MySQL Algorithms Data Structures REST API Spark Communication Skills Analytical Skills Collaboration CI/CD Problem Solving Mentoring DevOps Written communication Microservices

Posted 1 day ago

Apply

🔥 Senior Staff Software Engineer, Backend (ML Platform)

Posted 1 day ago

📍 United States

🧭 Full-Time

💸 232000.0 - 310000.0 USD per year

🔍 Software Development

🔧 Requirements

10+ years of experience designing, developing and launching backend systems at scale using languages like Python or Kotlin.
Strong experience leading multiple engineering teams to deliver high quality software
Track record of successfully leading engineering teams at both rapidly scaling startups and complex larger technology companies.
Expertise in synthesizing complex technical requirements, designs, trade-offs, and capabilities into clear decisions to influence ML & engineering direction
Extensive experience developing highly available distributed systems using technologies like AWS, MySQL, Spark and Kubernetes.
Experience building and operating online, real-time ML infrastructure including a model server and a feature store
Experience developing an offline environment for large scale data analysis and model training using technologies including Spark, Kubeflow, Ray, and Airflow
Experience delivering major features and system components

💡 Responsibilities

Set the multi-year, multi-team technical strategy for ML Platform and deliver it through direct implementation or broad technical leadership
Partner with technical leaders across the company to create joint roadmaps that will achieve business impacting goals through the advancement of machine learning
Act as a force-multiplier for your teams through your definition and advocacy of technical solutions and operational processes
You have an ownership mindset, and you will proactively champion investments in availability so that every project in your area achieves its availability targets
You will foster a culture of quality and ownership on your team by setting system design standards for your team, and advocating for them beyond your team through your writing and tech talks
You will help develop talent on your team by providing feedback and guidance, and leading by example

AWS Backend Development Leadership Project Management Python Apache Airflow Data Analysis Kotlin Kubeflow Kubernetes Machine Learning MySQL Software Architecture Cross-functional Team Leadership Data engineering Spark Communication Skills RESTful APIs DevOps

Posted 1 day ago

Apply

🔥 Data Engineer

Posted 1 day ago

📍 United States

🧭 Full-Time

🔍 Sustainable Agriculture

🏢 Company: Agrovision

🔧 Requirements

Experience with RDBMS (e.g., Teradata, MS SQL Server, Oracle) in production environments is preferred
Hands-on experience in data engineering and databases/data warehouses
Familiarity with Big Data platforms (e.g., Hadoop, Spark, Hive, HBase, Map/Reduce)
Expert level understanding of Python (e.g., Pandas)
Proficient in shell scripting (e.g., Bash) and Python data application development (or similar)
Excellent collaboration and communication skills with teams
Strong analytical and problem-solving skills, essential for tackling complex challenges
Experience working with BI teams and tooling (e.g. PowerBI), supporting analytics work and interfacing with Data Scientists

💡 Responsibilities

Collaborate with data scientists to ensure high-quality, accessible data for analytical and predictive modeling
Design and implement data pipelines (ETL’s) tailored to meet business needs and digital/analytics solutions
Enhance data integrity, security, quality, and automation, addressing system gaps proactively
Support pipeline maintenance, troubleshoot issues, and optimize performance
Lead and contribute to defining detailed scalable data models for our global operations
Ensure data security standards are met and upheld by contributors, partners and regional teams through programmatic solutions and tooling

Python SQL Apache Hadoop Bash ETL Data engineering Data science RDBMS Pandas Spark Communication Skills Analytical Skills Collaboration Problem Solving Data modeling

Posted 1 day ago

Apply

🔥 Senior Data Engineer

Posted 2 days ago

📍 United States

💸 144000.0 - 180000.0 USD per year

🔍 Software Development

🏢 Company: Hungryroot👥 101-250💰 $40,000,000 Series C almost 4 years agoArtificial Intelligence (AI)Food and Beverage E-Commerce Retail Consumer Goods Software

🔧 Requirements

5+ years of experience in ETL development and data modeling
5+ years of experience in both Scala and Python
5+ years of experience in Spark
Excellent problem-solving skills and the ability to translate business problems into practical solutions
2+ years of experience working with the Databricks Platform

💡 Responsibilities

Develop pipelines in Spark (Python + Scala) in the Databricks Platform
Build cross-functional working relationships with business partners in Food Analytics, Operations, Marketing, and Web/App Development teams to power pipeline development for the business
Ensure system reliability and performance
Deploy and maintain data pipelines in production
Set an example of code quality, data quality, and best practices
Work with Analysts and Data Engineers to enable high quality self-service analytics for all of Hungryroot
Investigate datasets to answer business questions, ensuring data quality and business assumptions are understood before deploying a pipeline

AWS Python SQL Apache Airflow Data Mining ETL Snowflake Algorithms Amazon Web Services Data engineering Data Structures Spark CI/CD RESTful APIs Microservices JSON Scala Data visualization Data modeling Data analytics Data management

Posted 2 days ago

Apply

🔥 Solutions Architect

Posted 2 days ago

📍 United States, Latin America, India

🔍 Software Development

🔧 Requirements

8+ years as a hands-on Solutions Architect and/or Data Engineer
Programming expertise in Java, Python and/or Scala
Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
SQL and the ability to write, debug, and optimize SQL queries
4-year Bachelor's degree in Computer Science or a related field
Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks

💡 Responsibilities

Design and implement data solutions
Lead and/or mentor other engineers
Develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration
Programming expertise in Java, Python and/or Scala
Client-facing written and verbal communication skills and experience
Create and deliver detailed presentations
Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)

AWS Leadership Python SQL Cloud Computing ETL GCP Java Snowflake Azure Data engineering REST API Spark Presentation skills Documentation Client relationship management Scala Data visualization Mentorship Data modeling

Posted 2 days ago

Apply

Shown 10 out of 192