Spark Jobs

Find remote positions requiring Spark skills. Browse through opportunities where you can utilize your expertise and grow your career.

Spark
192 jobs found. to receive daily emails with new job openings that match your preferences.
192 jobs found.

Set alerts to receive daily emails with new job openings that match your preferences.

Apply

πŸ“ India

🏒 Company: YipitData (Alternative)

  • 5+ years of proven experience in web application development or application support, particularly in systems with high uptime requirements.
  • Demonstrate strong ability to troubleshoot issues with web/http traffic, python applications, and JS applications.
  • Proficient in Python, Docker, AWS, REST APIs, and database technologies.
  • Diagnose and resolve technical issues in data applications and platform services, including web application performance, optimizing SQL, Pandas, and PySpark queries, and interacting with REST APIs.
  • Work cross-functionally with data analyst teams and platform engineers to coordinate releases, onboard users, and maintain the uptime of critical Plotly Dash applications.
  • Identify and implement process improvements to streamline support workflows, reduce repetitive tasks, and improve application and data platform efficiency.
  • Promote and enforce application best practices, data software management, and cloud infrastructure to enhance system reliability and reduce technical debt.

AWSDockerPythonSQLREST APIPandasSparkCI/CDRESTful APIsTroubleshootingData analyticsDebugging

Posted about 1 hour ago
Apply
Apply

πŸ“ India

πŸ” Software Development

🏒 Company: YipitDataπŸ‘₯ 251-500πŸ’° Debt Financing 9 months agoMarket ResearchAnalyticsData Visualization

  • 5+ years of proven experience in web application development or application support, particularly in systems with high uptime requirements.
  • Strong ability to troubleshoot issues with web/http traffic, python applications, and JS applications.
  • Are proficient in Python, Docker, AWS, REST APIs, and database technologies.
  • Diagnose and resolve technical issues in data applications and platform services, including web application performance, optimizing SQL, Pandas, and PySpark queries, and interacting with REST APIs.
  • Work cross-functionally with data analyst teams and platform engineers to coordinate releases, onboard users, and maintain the uptime of critical Plotly Dash applications.
  • Identify and implement process improvements to streamline support workflows, reduce repetitive tasks, and improve application and data platform efficiency.
  • Promote and enforce application best practices, data software management, and cloud infrastructure to enhance system reliability and reduce technical debt.

AWSDockerPythonSQLGitREST APIPandasSparkTroubleshootingData analyticsDebugging

Posted about 2 hours ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 198050.0 - 233000.0 USD per year

πŸ” Health Tech

  • 3+ years of experience as an Engineering Manager
  • 3-5+ years of experience as a Software Engineer
  • Track record of building and leading high performing teams that have effectively delivered business outcomes through technical investments
  • Strong technical foundation
  • Strong product mindset
  • Track record of working well across teams and functions
  • Prior experience with Python and React (Nice to have)
  • Experience working with AWS infrastructure (Nice to have)
  • Experience in health tech, building user-facing products (Nice to have)
  • BS, MS in Computer Science or related field
  • Lead and manage a team (or teams) of 8+ engineers
  • Ensure there is process and product alignment across your team
  • Develop and grow your team through weekly 1-1s, mentorship, and feedback
  • Work with and help develop aspiring engineering leaders within the engineering team
  • Contribute to the broader company strategy and product roadmap
  • Help build out our engineering team as the headcount grows from 80 to 120+ through the remainder of the year

AWSBackend DevelopmentDockerLeadershipPostgreSQLProject ManagementPythonCloud ComputingData AnalysisKafkaPeople ManagementProduct ManagementReact.jsTypeScriptCross-functional Team LeadershipAlgorithmsData StructuresFastAPIREST APIRedisStrategic ManagementReactSparkCommunication SkillsAnalytical SkillsCI/CDProblem SolvingAgile methodologiesMentoringRelationship buildingTeam managementSoftware EngineeringDebugging

Posted about 14 hours ago
Apply
Apply

πŸ“ Germany

πŸ” AI and data analytics consulting

🏒 Company: Unit8 SA

  • MSc level in the field of Computer Science, Machine Learning, Applied Statistics, Mathematics, or equivalent work experience.
  • Proficient software engineer who has experience in applying a blend of software engineering, machine learning, and statistical methods to solve real-world business problems
  • Proficient in one of the following languages: Python, Scala, Java.
  • Experience with cloud technologies is a strong plus.
  • Work with our customers to understand their challenges, design and implement solutions.
  • Closely collaborate with other data scientists, software engineers and business stakeholders.
  • Evaluate, compare and present results to technical and non-technical audience.
  • Contribute to the implementation and engineering of systems at different scales: from small proof-of-concepts to larger end-to-end data systems.
  • Implement best practices in CI/CD

AWSDockerPythonSQLCloud ComputingData AnalysisETLJavaKubernetesMachine LearningAlgorithmsData engineeringData scienceSparkCI/CDRESTful APIsScalaSoftware Engineering

Posted about 17 hours ago
Apply
Apply

πŸ“ India

πŸ” Market Research and Analytics

  • 5+ years of proven experience in web application development or application support, particularly in systems with high uptime requirements.
  • Strong ability to troubleshoot issues with web/http traffic, python applications, and JS applications.
  • Proficient in Python, Docker, AWS, REST APIs, and database technologies.
  • Diagnose and resolve technical issues in data applications and platform services, including web application performance, optimizing SQL, Pandas, and PySpark queries, and interacting with REST APIs.
  • Work cross-functionally with data analyst teams and platform engineers to coordinate releases, onboard users, and maintain the uptime of critical Plotly Dash applications.
  • Identify and implement process improvements to streamline support workflows, reduce repetitive tasks, and improve application and data platform efficiency.
  • Promote and enforce application best practices, data software management, and cloud infrastructure to enhance system reliability and reduce technical debt.

AWSDockerPythonSQLGitAPI testingREST APIPandasSparkCI/CDRESTful APIsLinuxTroubleshootingData modelingScriptingData analyticsDebugging

Posted 1 day ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 200000.0 - 275000.0 USD per year

πŸ” Software Development

  • 6+ years of experience designing, developing and launching backend systems at scale using languages like Python or Kotlin.
  • Extensive track record of developing highly available distributed systems using technologies like AWS, MySQL, Spark and Kubernetes.
  • Experience delivering major features, system components or deprecating existing functionality in a system through the definition of a technical and execution plan.
  • Write high quality code that is easily understood and used by others.
  • Thrive in ambiguity, and are comfortable moving from low level language idioms all the way to the architecture of large systems to understand how they work.
  • Growth and impact trajectory demonstrates that you have mastered gathering and iterating on feedback from your engineering and cross-functional peers.
  • Strong verbal and written communication skills that support effective collaboration with our global engineering team.
  • Set technical strategy for your team on a year-long time scale, and help your team tie it together with critical, business-impacting projects.
  • Collaborate across teams in the product development lifecycle by collaborating with product management, design & analytics to ensure technical sustainability, risks and trade-offs are well understood and managed.
  • Act as a force-multiplier for your team through your definition and advocacy of technical solutions and operational processes.
  • Mentor less experienced engineers, leading by example, and setting the technical bar high.
  • Take ownership of your team’s operations and availability by ensuring you have the right monitoring, triage rotations, playbooks, policies, testing and alerting in place to support β€œkeep the lights on” & on-call efforts.
  • Foster a culture of quality and ownership on your team by setting code review and design standards for your team, and advocating for them beyond your team through your writing and tech talks.
  • Help develop talent on your team by providing feedback and guidance, and leading by example.

AWSBackend DevelopmentLeadershipProject ManagementPythonSoftware DevelopmentSQLData AnalysisKotlinKubernetesMySQLAlgorithmsData StructuresREST APISparkCommunication SkillsAnalytical SkillsCollaborationCI/CDProblem SolvingMentoringDevOpsWritten communicationMicroservices

Posted 1 day ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 232000.0 - 310000.0 USD per year

πŸ” Software Development

  • 10+ years of experience designing, developing and launching backend systems at scale using languages like Python or Kotlin.
  • Strong experience leading multiple engineering teams to deliver high quality software
  • Track record of successfully leading engineering teams at both rapidly scaling startups and complex larger technology companies.
  • Expertise in synthesizing complex technical requirements, designs, trade-offs, and capabilities into clear decisions to influence ML & engineering direction
  • Extensive experience developing highly available distributed systems using technologies like AWS, MySQL, Spark and Kubernetes.
  • Experience building and operating online, real-time ML infrastructure including a model server and a feature store
  • Experience developing an offline environment for large scale data analysis and model training using technologies including Spark, Kubeflow, Ray, and Airflow
  • Experience delivering major features and system components
  • Set the multi-year, multi-team technical strategy for ML Platform and deliver it through direct implementation or broad technical leadership
  • Partner with technical leaders across the company to create joint roadmaps that will achieve business impacting goals through the advancement of machine learning
  • Act as a force-multiplier for your teams through your definition and advocacy of technical solutions and operational processes
  • You have an ownership mindset, and you will proactively champion investments in availability so that every project in your area achieves its availability targets
  • You will foster a culture of quality and ownership on your team by setting system design standards for your team, and advocating for them beyond your team through your writing and tech talks
  • You will help develop talent on your team by providing feedback and guidance, and leading by example

AWSBackend DevelopmentLeadershipProject ManagementPythonApache AirflowData AnalysisKotlinKubeflowKubernetesMachine LearningMySQLSoftware ArchitectureCross-functional Team LeadershipData engineeringSparkCommunication SkillsRESTful APIsDevOps

Posted 1 day ago
Apply
Apply
πŸ”₯ Data Engineer
Posted 1 day ago

πŸ“ United States

🧭 Full-Time

πŸ” Sustainable Agriculture

🏒 Company: Agrovision

  • Experience with RDBMS (e.g., Teradata, MS SQL Server, Oracle) in production environments is preferred
  • Hands-on experience in data engineering and databases/data warehouses
  • Familiarity with Big Data platforms (e.g., Hadoop, Spark, Hive, HBase, Map/Reduce)
  • Expert level understanding of Python (e.g., Pandas)
  • Proficient in shell scripting (e.g., Bash) and Python data application development (or similar)
  • Excellent collaboration and communication skills with teams
  • Strong analytical and problem-solving skills, essential for tackling complex challenges
  • Experience working with BI teams and tooling (e.g. PowerBI), supporting analytics work and interfacing with Data Scientists
  • Collaborate with data scientists to ensure high-quality, accessible data for analytical and predictive modeling
  • Design and implement data pipelines (ETL’s) tailored to meet business needs and digital/analytics solutions
  • Enhance data integrity, security, quality, and automation, addressing system gaps proactively
  • Support pipeline maintenance, troubleshoot issues, and optimize performance
  • Lead and contribute to defining detailed scalable data models for our global operations
  • Ensure data security standards are met and upheld by contributors, partners and regional teams through programmatic solutions and tooling

PythonSQLApache HadoopBashETLData engineeringData scienceRDBMSPandasSparkCommunication SkillsAnalytical SkillsCollaborationProblem SolvingData modeling

Posted 1 day ago
Apply
Apply

πŸ“ United States

πŸ’Έ 144000.0 - 180000.0 USD per year

πŸ” Software Development

🏒 Company: HungryrootπŸ‘₯ 101-250πŸ’° $40,000,000 Series C almost 4 years agoArtificial Intelligence (AI)Food and BeverageE-CommerceRetailConsumer GoodsSoftware

  • 5+ years of experience in ETL development and data modeling
  • 5+ years of experience in both Scala and Python
  • 5+ years of experience in Spark
  • Excellent problem-solving skills and the ability to translate business problems into practical solutions
  • 2+ years of experience working with the Databricks Platform
  • Develop pipelines in Spark (Python + Scala) in the Databricks Platform
  • Build cross-functional working relationships with business partners in Food Analytics, Operations, Marketing, and Web/App Development teams to power pipeline development for the business
  • Ensure system reliability and performance
  • Deploy and maintain data pipelines in production
  • Set an example of code quality, data quality, and best practices
  • Work with Analysts and Data Engineers to enable high quality self-service analytics for all of Hungryroot
  • Investigate datasets to answer business questions, ensuring data quality and business assumptions are understood before deploying a pipeline

AWSPythonSQLApache AirflowData MiningETLSnowflakeAlgorithmsAmazon Web ServicesData engineeringData StructuresSparkCI/CDRESTful APIsMicroservicesJSONScalaData visualizationData modelingData analyticsData management

Posted 2 days ago
Apply
Apply

πŸ“ United States, Latin America, India

πŸ” Software Development

  • 8+ years as a hands-on Solutions Architect and/or Data Engineer
  • Programming expertise in Java, Python and/or Scala
  • Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
  • SQL and the ability to write, debug, and optimize SQL queries
  • 4-year Bachelor's degree in Computer Science or a related field
  • Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks
  • Design and implement data solutions
  • Lead and/or mentor other engineers
  • Develop end-to-end technical solutions into production β€” and to help ensure performance, security, scalability, and robust data integration
  • Programming expertise in Java, Python and/or Scala
  • Client-facing written and verbal communication skills and experience
  • Create and deliver detailed presentations
  • Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)

AWSLeadershipPythonSQLCloud ComputingETLGCPJavaSnowflakeAzureData engineeringREST APISparkPresentation skillsDocumentationClient relationship managementScalaData visualizationMentorshipData modeling

Posted 2 days ago
Apply
Shown 10 out of 192