Apply

Senior Data Engineer

Posted 5 days agoViewed

View full description

πŸ’Ž Seniority level: Senior, 8+ years

πŸ“ Location: US

πŸ’Έ Salary: 103200.0 - 128950.0 USD per year

πŸ” Industry: Genetics and healthcare

🏒 Company: NateraπŸ‘₯ 1001-5000πŸ’° $250,000,000 Post-IPO Equity over 1 year agoπŸ«‚ Last layoff almost 2 years agoWomen'sBiotechnologyMedicalGeneticsHealth Diagnostics

πŸ—£οΈ Languages: English

⏳ Experience: 8+ years

πŸͺ„ Skills: AWSPythonSQLAgileDynamoDBSnowflakeData engineeringPostgresSparkData modelingData management

Requirements:
  • BS degree in computer science or a comparable program or equivalent experience.
  • 8+ years of overall software development experience, ideally in complex data management applications.
  • Experience with SQL and No-SQL databases including Dynamo, Cassandra, Postgres, Snowflake.
  • Proficiency in data technologies such as Hive, Hbase, Spark, EMR, Glue.
  • Ability to manipulate and extract value from large datasets.
  • Knowledge of data management fundamentals and distributed systems.
Responsibilities:
  • Work with other engineers and product managers to make design and implementation decisions.
  • Define requirements in collaboration with stakeholders and users to create reliable applications.
  • Implement best practices in development processes.
  • Write specifications, design software components, fix defects, and create unit tests.
  • Review design proposals and perform code reviews.
  • Develop solutions for the Clinicogenomics platform utilizing AWS cloud services.
Apply

Related Jobs

Apply

πŸ“ USA

πŸ’Έ 152960.0 - 183552.0 USD per year

πŸ” Data Engineering and Observability Solutions

  • Software development skills in Python, Java, Scala, or Go.
  • High proficiency in SQL.
  • Experience with workflow orchestration systems like Prefect, Dagster, or Airflow.
  • Knowledge of MLOps best practices.
  • Familiarity with dbt or similar data transformation tools.
  • Excellent communication skills for technical topics.

  • Build and maintain production quality data pipelines between operational systems and BigQuery.
  • Implement data quality and freshness checks to ensure data accuracy and consistency.
  • Build and maintain machine learning pipelines for automated model validation and deployment.
  • Create and maintain documentation for data engineering processes and workflows.
  • Maintain observability and monitoring of internal data pipelines.
  • Troubleshoot data pipeline issues to ensure data availability.
  • Contribute to dbt systems ensuring efficiency and availability.

PythonSQLETLGCPMachine LearningData engineering

Posted 16 days ago
Apply
Apply

πŸ“ USA

🧭 Full-Time

πŸ” Construction

🏒 Company: EquipmentShareπŸ‘₯ 1001-5000πŸ’° $400,000,000 Debt Financing over 1 year agoConstruction

  • 7+ years of relevant data platform development experience.
  • Proficient with SQL and a high-order object-oriented language, such as Python.
  • Experience designing and building distributed data architecture.
  • Experience with production-grade data pipelines using tools like Airflow.
  • Familiar with distributed systems like Kafka and Spark.
  • Knowledge of event data streaming at scale.
  • Ability to learn new technologies quickly.
  • Experience incorporating observability and monitoring into data products.

  • Collaborate with Product Managers, Designers, Engineers, Data Scientists, and Data Analysts to transition concepts to production.
  • Design, build, and maintain a data platform for automation and self-service.
  • Develop and support a data product framework for analytics features.
  • Create and manage CI/CD pipelines for data and machine learning deployments.
  • Develop data monitoring and alerting capabilities.
  • Document architecture and processes for knowledge sharing.
  • Mentor peers to enhance team skills.

AWSPythonSQLApache AirflowKafkaMLFlowSnowflakeSparkCI/CD

Posted 19 days ago
Apply
Apply

πŸ“ United States, United Kingdom, Spain, Estonia

πŸ” Identity verification

🏒 Company: VeriffπŸ‘₯ 501-1000πŸ’° $100,000,000 Series C almost 3 years agoπŸ«‚ Last layoff over 1 year agoArtificial Intelligence (AI)Fraud DetectionInformation TechnologyCyber SecurityIdentity Management

  • Expert-level knowledge of SQL, particularly with Redshift.
  • Strong experience in data modeling with an understanding of dimensional data modeling best practices.
  • Proficiency in data transformation frameworks like dbt.
  • Solid programming skills in languages used in data engineering, such as Python or R.
  • Familiarity with orchestration frameworks like Apache Airflow or Luigi.
  • Experience with data from diverse sources including RDBMS and APIs.

  • Collaborate with business stakeholders to design, document, and implement robust data models.
  • Build and optimize data pipelines to transform raw data into actionable insights.
  • Fine-tune query performance and ensure efficient use of data warehouse infrastructure.
  • Ensure data reliability and quality through rigorous testing and monitoring.
  • Assist in migrating from batch processing to real-time streaming systems.
  • Expand support for various use cases including business intelligence and analytics.

PythonSQLApache AirflowETLData engineeringJSONData modeling

Posted 24 days ago
Apply
Apply

πŸ“ USA

🧭 Full-Time

πŸ’Έ 165000.0 - 210000.0 USD per year

πŸ” E-commerce and AI technologies

🏒 Company: WizardπŸ‘₯ 11-50Customer ServiceManufacturing

  • 5+ years of professional experience in software development with a focus on data engineering.
  • Bachelor's degree in Computer Science or a related field, or equivalent practical experience.
  • Proficiency in Python with software engineering best practices.
  • Strong expertise in building ETL pipelines using tools like Apache Spark.
  • Hands-on experience with NoSQL databases like MongoDB, Cassandra, or DynamoDB.
  • Proficiency in real-time stream processing systems such as Kafka or AWS Kinesis.
  • Experience with cloud platforms (AWS, GCP, Azure) and technologies like Delta Lake and Parquet files.

  • Develop and maintain scalable data infrastructure for batch and real-time processing.
  • Build and optimize ETL pipelines for efficient data flow.
  • Collaborate with data scientists and cross-functional teams for accurate monitoring.
  • Design backend data solutions for microservices architecture.
  • Implement and manage integrations with third-party e-commerce platforms.

AWSPythonDynamoDBElasticSearchETLGCPGitHadoopKafkaMongoDBRabbitmqAzureCassandraRedis

Posted 25 days ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

🏒 Company: Avalore, LLC

  • Master’s or PhD in statistics, mathematics, computer science, or related field.
  • 8+ years of experience as a Data Engineer within the IC.
  • Outstanding communication skills, influencing abilities, and client focus.
  • Professional proficiency in English is required.
  • Current, active Top Secret security clearance.
  • Applicants must be currently authorized to work in the United States on a full-time basis.

  • Develops and documents data pipelines for ingest, transformation, and preparation of data for AI applications.
  • Designs scalable technologies such as streaming and transformation, joining disparate data sets for predictive analytics.
  • Develops API interfaces for accessibility.
  • Leads technical efforts and guides development teams.

PythonSQLApache AirflowArtificial IntelligenceETLMachine LearningAPI testingData engineering

Posted 27 days ago
Apply
Apply

πŸ“ USA

🧭 Full-Time

πŸ’Έ 190000.0 - 220000.0 USD per year

πŸ” B2B data / Data as a Service (DaaS)

🏒 Company: People Data LabsπŸ‘₯ 101-250πŸ’° $45,000,000 Series B about 3 years agoDatabaseArtificial Intelligence (AI)Developer APIsMachine LearningAnalyticsB2BSoftware

  • 5-7+ years industry experience with strategic technical problem-solving.
  • Strong software development fundamentals.
  • Experience with Python.
  • Expertise in Apache Spark (Java, Scala, or Python-based).
  • Proficiency in SQL.
  • Experience building scalable data processing systems.
  • Familiarity with data pipeline orchestration tools (e.g., Airflow, dbt).
  • Knowledge of modern data design and storage patterns.
  • Experience working in Databricks.
  • Familiarity with cloud computing services (e.g., AWS, GCP, Azure).
  • Experience in data warehousing technologies.
  • Understanding of modern data storage formats and tools.

  • Build infrastructure for ingestion, transformation, and loading of data using Spark, SQL, AWS, and Databricks.
  • Create an entity resolution framework for merging billions of entities into clean datasets.
  • Develop CI/CD pipelines and anomaly detection systems to enhance data quality.
  • Provide solutions to undefined data engineering problems.
  • Assist Engineering and Product teams with data-related technical issues.

AWSPythonSQLKafkaAirflowData engineeringPandasCI/CD

Posted 29 days ago
Apply
Apply
πŸ”₯ Senior Data Engineer
Posted about 2 months ago

πŸ“ United States, Canada

πŸ” Advanced analytics consulting

🏒 Company: Tiger AnalyticsπŸ‘₯ 1001-5000AdvertisingConsultingBig DataNewsMachine LearningAnalytics

  • Bachelor’s degree in Computer Science or similar field.
  • 8+ years of experience in a Data Engineer role.
  • Experience with relational SQL and NoSQL databases like MySQL, Postgres.
  • Strong analytical skills and advanced SQL knowledge.
  • Development of ETL pipelines using Python & SQL.
  • Good experience with Customer Data Platforms (CDP).
  • Experience in SQL optimization and performance tuning.
  • Data modeling and building high-volume ETL pipelines.
  • Working experience with any cloud platform.
  • Experience with Google Tag Manager and Power BI is a plus.
  • Experience with object-oriented scripting languages: Python, Java, Scala, etc.
  • Experience extracting/querying/joining large data sets at scale.
  • Strong communication and organizational skills.

  • Designing, building, and maintaining scalable data pipelines on cloud infrastructure.
  • Working closely with cross-functional teams.
  • Supporting data analytics, machine learning, and business intelligence initiatives.

PythonSQLBusiness IntelligenceETLJavaMySQLPostgresNosqlAnalytical SkillsOrganizational skillsData modeling

Posted about 2 months ago
Apply
Apply
πŸ”₯ Senior Data Engineer
Posted about 2 months ago

πŸ“ US

🧭 Full-Time

πŸ” Cloud integration technology

🏒 Company: Cleo (US)

  • 5-7+ years of experience in data engineering focusing on AI/ML models.
  • Hands-on expertise in data transformation and building data pipelines.
  • Leadership experience in mentoring data engineering teams.
  • Strong experience with cloud platforms and big data technologies.

  • Lead the design and build of scalable, reliable, and efficient data pipelines.
  • Set data infrastructure strategy for data warehouses and lakes.
  • Hands-on data transformation for AI/ML models.
  • Build data structures and manage metadata.
  • Implement data quality controls.
  • Collaborate with cross-functional teams to meet data requirements.
  • Optimize ETL processes for AI/ML.
  • Ensure data pipelines support model training needs.
  • Define data governance practices.

LeadershipArtificial IntelligenceETLMachine LearningStrategyData engineeringData StructuresMentoring

Posted about 2 months ago
Apply
Apply

πŸ“ ANY STATE

πŸ” Data and technology

  • 5+ years of experience making contributions in the form of code.
  • Experience with algorithms and data structures and knowing when to apply them.
  • Experience with machine learning techniques to develop better predictive and clustering models.
  • Experience working with high-scale systems.
  • Experience creating powerful machine learning tools for experimentation and productionalization at scale.
  • Experience in data engineering and warehousing to develop ingestion engines, ETL pipelines, and organizing data for consumption.

  • Be a senior member of the team by contributing to the architecture, design, and implementation of EMS systems.
  • Mentor junior engineers and promote their growth.
  • Lead technical projects and manage planning, execution, and success of complex technical projects.
  • Collaborate with other engineering, product, and data science teams to ensure optimal product development.

PythonSQLETLGCPKubeflowMachine LearningAlgorithmsData engineeringData scienceData StructuresTensorflowCollaborationScala

Posted about 2 months ago
Apply
Apply
πŸ”₯ Senior Data Engineer
Posted about 2 months ago

πŸ“ USA

🧭 Full-Time

πŸ” Energy analytics and forecasting

  • Senior level experience within data engineering with primary focus using Python.
  • Experience with cloud-based infrastructure (Kubernetes/Docker) and data services (GCP, AWS, Azure, etc.).
  • Proven track record of delivering results that impact the business through building data pipelines.
  • Experience working on complex large codebases with a focus on refactoring and enhancements.
  • Experience building data monitoring pipelines with a focus on scalability.

  • Rebuilding systems to identify more efficient ways to process data.
  • Automate the entire forecasting pipeline, including data collection, preprocessing, model training, and deployment.
  • Continuously monitor system performance and optimize data processing workflows to reduce latency and improve efficiency.
  • Set up real-time monitoring for data feeds to detect anomalies or issues promptly.
  • Utilize distributed computing and parallel processing to handle large-scale data.
  • Design your data infrastructure to be scalable to accommodate future growth in data volume and sources.

AWSDockerPythonGCPKubernetesAzureData engineering

Posted about 2 months ago
Apply