Apply

Staff Data Engineer

Posted 7 days agoViewed

View full description

πŸ’Ž Seniority level: Staff

πŸ“ Location: United States

πŸ” Industry: Software Development

πŸ—£οΈ Languages: English

πŸͺ„ Skills: AWSBackend DevelopmentPostgreSQLPythonSQLApache AirflowETLData engineeringREST APINodeJSSoftware EngineeringData analytics

Requirements:
  • Have built scalable web scraping platforms from the ground up
  • Experience juggling multiple projects with shifting priorities while continuing to deliver value to the business
  • Be a curious, detail oriented, self-starter who wants to take full ownership of high impact projects with visibility throughout the organization
Responsibilities:
  • Play a key role in the implementation and evolution of our web scraping and data platform
  • Craft and implement a best in class web scraping strategy and infrastructure
  • Build and scale pipelines that garner millions of records, across hundreds of sites, stored as measurable data that enable insights for our Analytics team and our customers
Apply

Related Jobs

Apply

πŸ“ United States

🧭 Full-Time

πŸ” Software Development

🏒 Company: RulaπŸ‘₯ 251-500πŸ’° Series C 7 months agoPersonal HealthMental HealthAddiction TreatmentHealth InsuranceWellnessHealth CareHome Health Care

  • Experience building and maintaining ETL/ELT pipelines using tools like AWS Glue, DBT, Dagster, or similar orchestration frameworks.
  • Experience in Python and SQL for data processing and transformation.
  • Experience in AWS services such as Redshift, S3, Glue, and IAM.
  • Experience designing and optimizing data warehouses (Redshift, Snowflake) and managing S3 data lakes.
  • Experience implementing data validation, quality checks, and error-handling mechanisms.
  • Familiarity with data governance practices, including metadata management and documentation.
NOT STATED

AWSPythonSQLData AnalysisETLAmazon Web ServicesApache KafkaData engineeringCI/CDTerraformData modeling

Posted 8 days ago
Apply
Apply

πŸ“ North America

πŸ” Software Development

NOT STATED
NOT STATED

AWSBackend DevelopmentGraphQLSQLElasticSearchETLKafkaRuby on RailsSoftware ArchitectureAlgorithmsData engineeringData StructuresGoRedisCI/CDRESTful APIsMicroservicesData modeling

Posted 8 days ago
Apply
Apply

πŸ“ United States

πŸ’Έ 131414.0 - 197100.0 USD per year

πŸ” Mental healthcare

🏒 Company: HeadspaceπŸ‘₯ 11-50WellnessHealth CareChild Care

  • 10+ years of success in enterprise data solutions and high-impact initiatives.
  • Expertise in platforms like Databricks, Snowflake, dbt, and Redshift.
  • Experience designing and optimizing real-time and batch ETL pipelines.
  • Demonstrated leadership and mentorship abilities in engineering.
  • Strong collaboration skills with product and analytics stakeholders.
  • Bachelor’s or advanced degree in Computer Science, Engineering, or a related field.
  • Drive the architecture and implementation of pySpark data pipelines.
  • Create and enforce design patterns in code and schema.
  • Design and lead secure and compliant data warehousing platforms.
  • Partner with analytics and product leaders for actionable insights.
  • Mentor team members on dbt architecture and foster a data-first culture.
  • Act as a thought leader on data strategy and cross-functional roadmaps.

SQLCloud ComputingETLSnowflakeData engineeringData modelingData analytics

Posted 18 days ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 130000.0 - 170000.0 USD per year

πŸ” Data Engineering

  • 8+ years experience in a data engineering role
  • Strong knowledge of REST-based APIs and cloud technologies (AWS, Azure, GCP)
  • Experience with Python/SQL for building data pipelines
  • Bachelor's degree in computer science or related field
  • Design and build data pipelines across various source systems
  • Collaborate with teams to develop data acquisition and integration strategies
  • Coach and guide others in scalable pipeline building
  • Deploy to cloud-based platforms and troubleshoot issues

AWSDockerPythonSQLApache AirflowCloud ComputingETLGCPMachine LearningSnowflakeData engineeringREST APIData modeling

Posted 24 days ago
Apply
Apply
πŸ”₯ Staff Data Engineer
Posted about 1 month ago

πŸ“ United States

🧭 Full-Time

πŸ’Έ 170000.0 - 195000.0 USD per year

πŸ” Healthcare

🏒 Company: Parachute HealthπŸ‘₯ 101-250πŸ’° $1,000 about 5 years agoMedicalHealth CareSoftware

  • 5+ years of relevant experience.
  • Experience in Data Engineering with Python.
  • Experience building customer-facing software.
  • Strong listening and communication skills.
  • Time management and organizational skills.
  • Proactive, a driven self-starter who can work independently or as part of a team.
  • Ability to think with the 'big picture' in mind.
  • Passionate about improving patient outcomes in the healthcare space.
  • Architect solutions to integrate and manage large volumes of data across various internal and external systems.
  • Establish best practices and data governance standards to ensure that data infrastructure is built for long-term scalability.
  • Build and maintain a reporting product for external customers that visualizes data and provides tabular reports.
  • Collaborate across the organization to assess data engineering needs.

PythonETLAirflowData engineeringData visualization

Posted about 1 month ago
Apply
Apply
πŸ”₯ Staff Data Engineer
Posted about 2 months ago

πŸ“ United States

πŸ” Cyber security

🏒 Company: BeyondTrustπŸ‘₯ 1001-5000πŸ’° Private over 3 years agoCloud ComputingSecurityCloud SecurityCyber SecuritySoftware

  • Strong programming and technology knowledge in cloud data processing.
  • Previous experience working in matured data lakes.
  • Strong data modelling skills for analytical workloads.
  • Spark (or equivalent parallel processing framework) experience is needed; existing Databricks knowledge is a plus.
  • Interest and aptitude for cybersecurity; interest in identity security is highly preferred.
  • Technical understanding of underlying systems and computation minutiae.
  • Experience working with distributed systems and data processing on object stores.
  • Ability to work autonomously.
  • Optimize data workloads at a software level by improving processing efficiency.
  • Develop new data processing routes to remove redundancy or reduce transformation overhead.
  • Monitor and maintain existing data workflows.
  • Use observability best practices to ensure pipeline performance.
  • Perform complex transformations on both real time and batch data assets.
  • Create new ML/Engineering solutions to tackle existing issues in the cybersecurity space.
  • Leverage CI/CD best practices to effectively develop and release source code.

PythonSparkCI/CDData modeling

Posted about 2 months ago
Apply
Apply

πŸ“ United States, Canada

🧭 Full-Time

πŸ’Έ 170000.0 - 205000.0 USD per year

πŸ” Healthcare

🏒 Company: Wellth

  • 7+ years in analytics engineering or data analysis in healthcare
  • Hands-on experience with healthcare data sets
  • Proficiency in SQL, Python, and dbt
  • Lead design and implementation of data pipelines for healthcare data
  • Create foundational data layers for analytics
  • Ensure data quality and consistency

PythonSQLApache AirflowETLGitData engineeringData visualizationData modeling

Posted 2 months ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 179000.0 - 277000.0 USD per year

πŸ” Healthcare

🏒 Company: Komodo HealthπŸ‘₯ 100-500πŸ’° $200,000,000 about 2 years agoπŸ«‚ Last layoff about 2 years agoPredictive AnalyticsInformation TechnologyHealth CareSoftware

  • Deep expertise in software and data or related fields in healthcare and technology.
  • US Healthcare claims data experience.
  • Extensive experience building scalable, best-in-class solutions.
  • Demonstrated record of thought leadership and solution design.
  • Strong ability to communicate clearly with both technical and non-technical teams.
  • Knowledge of large-scale data and computational technologies.
  • Experience with SQL and query design on large, complex datasets.
  • Ability to use a variety of databases, ideally Snowflake on AWS.
  • Partnering with Engineering team members, Product Managers, and Data Scientists to understand complex health data use cases.
  • Building foundational pieces of the data platform architecture, pipelines, analytics, and services.
  • Architecting and developing reliable data pipelines that transform data at scale using SQL and Python in Snowflake.
  • Contributing to python packages in Github and APIs following current best practices.

PythonSQLSnowflakeAirflowAlgorithmsData engineeringData modeling

Posted 3 months ago
Apply
Apply

πŸ“ AR, CA, CO, FL, GA, IL, KY, MA, MI, MT, MO, NV, NJ, NY, NC, OR, PA, TX, WA, WI

πŸ” Food waste reduction and grocery technology

🏒 Company: AfreshπŸ‘₯ 51-100πŸ’° $115,000,000 Series B over 2 years agoArtificial Intelligence (AI)LogisticsFood and BeverageMachine LearningAgricultureSupply Chain ManagementSoftware

  • 6+ years of experience as a data engineer, analytics engineer, or similar role.
  • Strong understanding of advanced SQL concepts.
  • Exceptional communication and leadership skills.
  • 1+ years of experience with SQL-driven transform libraries supporting ELT, including CI/CD pipelines.
  • Expert knowledge of OLTP and OLAP database design.
  • Familiarity with data engineering concepts like Data Mesh, Data Lake, Data Warehouse.
  • Experience with semantic layer setup defined with code (LookML, Cube.dev, etc.).
  • Technologies: SQL, Python, Airflow, dbt, Snowflake/Databricks/BigQuery, Spark.
  • Improve and extend data analytics architecture for reliable data across use cases.
  • Collaborate with engineers, product managers, and data scientists to understand data needs.
  • Build dimensional models and metrics for consistent insights.
  • Evolve existing data quality and governance processes.
  • Mentor and up-skill other engineers.

LeadershipPythonSQLSnowflakeAirflowData engineeringSparkCollaborationCI/CD

Posted 4 months ago
Apply