Data Engineer

Posted 5 days agoViewed

View full description

💎 Seniority level: Middle, 3-5 years

📍 Location: Worldwide

💸 Salary: 145000.0 - 160000.0 USD per year

🗣️ Languages: English

⏳ Experience: 3-5 years

🪄 Skills: SQLETLMongoDBData engineeringData modeling

Requirements:

Proficiency in managing MongoDB databases, including performance tuning and maintenance.
Experience with cloud-based data warehousing, particularly using BigQuery.
Familiarity with DBT for data transformation and modeling.
Exposure to tools like Segment for data collection and integration.
Basic knowledge of integrating third-party data sources to build a comprehensive data ecosystem.

Responsibilities:

Overseeing our production MongoDB database to ensure optimal performance, reliability, and security.
Assisting in the management and optimization of data pipelines into BigQuery, ensuring data is organized and accessible for downstream users.
Utilizing DBT to transform raw data into structured formats, making it useful for analysis and reporting.
Collaborating on the integration of data from Segment and various third-party sources to create a unified, clean data ecosystem.
Working closely with BI, Marketing, and Data Science teams to understand data requirements and ensure our infrastructure meets their needs.
Participating in code reviews, learning new tools, and contributing to the refinement of data processes and best practices.

Apply

Related Jobs

Apply

🔥 Senior Data Engineer

Posted 1 day ago

📍 Worldwide

🔍 Hospitality

🏢 Company: Lighthouse

🔧 Requirements

4+ years of professional experience using Python, Java, or Scala for data processing (Python preferred)
You stay up-to-date with industry trends, emerging technologies, and best practices in data engineering.
Improve, manage, and teach standards for code maintainability and performance in code submitted and reviewed
Ship large features independently, generate architecture recommendations and have the ability to implement them
Great communication: Regularly achieve consensus amongst teams
Familiarity with GCP, Kubernetes (GKE preferred), CI/CD tools (Gitlab CI preferred), familiarity with the concept of Lambda Architecture.
Experience with Apache Beam or Apache Spark for distributed data processing or event sourcing technologies like Apache Kafka.
Familiarity with monitoring tools like Grafana & Prometheus.

💡 Responsibilities

Design and develop scalable, reliable data pipelines using the Google Cloud stack.
Optimise data pipelines for performance and scalability.
Implement and maintain data governance frameworks, ensuring data accuracy, consistency, and compliance.
Monitor and troubleshoot data pipeline issues, implementing proactive measures for reliability and performance.
Collaborate with the DevOps team to automate deployments and improve developer experience on the data front.
Work with data science and analytics teams to enable them to bring their research to production grade data solutions, using technologies like airflow, dbt or MLflow (but not limited to)
As a part of a platform team, you will communicate effectively with teams across the entire engineering organisation, to provide them with reliable foundational data models and data tools.
Mentor and provide technical guidance to other engineers working with data.

PythonSQLApache AirflowETLGCPKubernetesApache KafkaData engineeringCI/CDMentoringTerraformScalaData modeling

Posted 1 day ago

Apply

🔥 Data Engineer

Posted 1 day ago

📍 Canada

🧭 Full-Time

🔍 FinTech

🏢 Company: KOHO

🔧 Requirements

5+ years of mastery in data manipulation and analytics architecture
Advanced expertise in dbt (incremental modeling, materializations, snapshots, variables, macros, jinja)
Strong knowledge of SQL and how to write efficient SQL queries
Strong command of SQL, query optimization, and data warehouse design

💡 Responsibilities

Building strong relationships with stakeholders (the finance team), scope and prioritize their analytics requests.
Understanding business needs and translating them to requirements.
Using dbt (Core for development and Cloud for orchestration) to transform, test, deploy, and document financial data while applying software engineering best practices.
Troubleshooting variances in reports, and striving to eliminate them at the source.
Building game-changing data products that empower the finance team
Architecting solutions that transform complex financial data into actionable insights
Monitoring, optimizing and troubleshooting warehouse performance (AWS Redshift).
Creating scalable, self-service analytics solutions that democratize data access
Occasionally building dashboards and reports in Sigma and Drivetrain.
Defining processes, building tools, and offering training to empower all data users in the organization.

AWSPythonSQLETLData engineeringData visualizationData modelingFinanceData analytics

Posted 1 day ago

Apply

🔥 Senior Data Engineer

Posted 2 days ago

📍 United States

🧭 Full-Time

💸 183600.0 - 216000.0 USD per year

🔍 Software Development

🔧 Requirements

6+ years of experience in a data engineering role building products, ideally in a fast-paced environment
Good foundations in Python and SQL.
Experience with Spark, PySpark, DBT, Snowflake and Airflow
Knowledge of visualization tools, such as Metabase, Jupyter Notebooks (Python)

💡 Responsibilities

Collaborate on the design and improvements of the data infrastructure
Partner with product and engineering to advocate best practices and build supporting systems and infrastructure for the various data needs
Create data pipelines that stitch together various data sources in order to produce valuable business insights
Create real-time data pipelines in collaboration with the Data Science team

PythonSQLSnowflakeAirflowData engineeringSparkData visualizationData modeling

Posted 2 days ago

Apply

🔥 Data Engineer (Uruguay, Argentina, Colombia, Chile, Peru, Brasil & Mexico)

Posted 2 days ago

📍 Argentina

🧭 Full-Time

🔍 Software Development

🏢 Company: Austin Software

🔧 Requirements

5+ years experience as a Data Engineer
4+ years experience with MySQL
Experience with Python
Experience with Spark jobs written in Scala
Experience with Databricks

💡 Responsibilities

AWSPythonMySQLData engineeringSparkScala

Posted 2 days ago

Apply

🔥 Senior Data Engineer

Posted 3 days ago

📍 United States

🧭 Full-Time

🔍 Healthcare

🏢 Company: Rad AI👥 101-250💰 $60,000,000 Series C 2 months agoArtificial Intelligence (AI)Enterprise Software Health Care

🔧 Requirements

4+ years relevant experience in data engineering.
Expertise in designing and developing distributed data pipelines using big data technologies on large scale data sets.
Deep and hands-on experience designing, planning, productionizing, maintaining and documenting reliable and scalable data infrastructure and data products in complex environments.
Solid experience with big data processing and analytics on AWS, using services such as Amazon EMR and AWS Batch.
Experience in large scale data processing technologies such as Spark.
Expertise in orchestrating workflows using tools like Metaflow.
Experience with various database technologies including SQL, NoSQL databases (e.g., AWS DynamoDB, ElasticSearch, Postgresql).
Hands-on experience with containerization technologies, such as Docker and Kubernetes.

💡 Responsibilities

Design and implement the data architecture, ensuring scalability, flexibility, and efficiency using pipeline authoring tools like Metaflow and large-scale data processing technologies like Spark.
Define and extend our internal standards for style, maintenance, and best practices for a high-scale data platform.
Collaborate with researchers and other stakeholders to understand their data needs including model training and production monitoring systems and develop solutions that meet those requirements.
Take ownership of key data engineering projects and work independently to design, develop, and maintain high-quality data solutions.
Ensure data quality, integrity, and security by implementing robust data validation, monitoring, and access controls.
Evaluate and recommend data technologies and tools to improve the efficiency and effectiveness of the data engineering process.
Continuously monitor, maintain, and improve the performance and stability of the data infrastructure.

AWSDockerSQLElasticSearchETLKubernetesData engineeringNosqlSparkData modeling

Posted 3 days ago

Apply

🔥 Senior Data Engineer - Data Services

Posted 3 days ago

📍 Worldwide

🧭 Full-Time

🔧 Requirements

NOT STATED

💡 Responsibilities

Own the design and implementation of cross-domain data models that support key business metrics and use cases.
Partner with analysts and data engineers to translate business logic into performant, well-documented dbt models.
Champion best practices in testing, documentation, CI/CD, and version control, and guide others in applying them.
Act as a technical mentor to other analytics engineers, supporting their development and reviewing their code.
Collaborate with central data platform and embedded teams to improve data quality, metric consistency, and lineage tracking.
Drive alignment on model architecture across domains—ensuring models are reusable, auditable, and trusted.
Identify and lead initiatives to reduce technical debt and modernise legacy reporting pipelines.
Contribute to the long-term vision of analytics engineering at Pleo and help shape our roadmap for scalability and impact.

SQLData AnalysisETLData engineeringCI/CDMentoringDocumentationData visualizationData modelingData analyticsData management

Posted 3 days ago

Apply

🔥 Senior Data Engineer

Posted 3 days ago

📍 United States

🧭 Full-Time

💸 183600.0 - 216000.0 USD per year

🔍 Mental Healthcare

🏢 Company: Headway👥 201-500💰 $125,000,000 Series C over 1 year agoMental Health Care

🔧 Requirements

6+ years of experience in a data engineering role building products, ideally in a fast-paced environment
Good foundations in Python and SQL.
Experience with Spark, PySpark, DBT, Snowflake and Airflow
Knowledge of visualization tools, such as Metabase, Jupyter Notebooks (Python)
A knack for simplifying data, expressing information in charts and tables

💡 Responsibilities

Collaborate on the design and improvements of the data infrastructure
Partner with product and engineering to advocate best practices and build supporting systems and infrastructure for the various data needs
Create data pipelines that stitch together various data sources in order to produce valuable business insights
Create real-time data pipelines in collaboration with the Data Science team

PythonSQLETLSnowflakeAirflowData engineeringRDBMSSparkRESTful APIsData visualizationData modeling

Posted 3 days ago

Apply

🔥 Junior Data Engineer (PySpark) - E-Learning

Posted 3 days ago

📍 LatAm

🧭 Full-Time

🔍 E-Learning

🏢 Company: Truelogic👥 101-250 Consulting Web Development Web Design Software

🔧 Requirements

1-3 years of experience working with PySpark and Apache Spark in Big Data environments.
Experience with SQL and relational and NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.).
Knowledge of ETL processes and data processing in distributed environments.
Familiarity with Apache Hadoop, Hive, or Delta Lake.
Experience with cloud storage (AWS S3, Google Cloud Storage, Azure Blob).
Proficiency in Git and version control.
Strong problem-solving skills and a proactive attitude.
A passion for learning and continuous improvement.

💡 Responsibilities

Design, develop, and optimize data pipelines using PySpark and Apache Spark.
Integrate and process data from multiple sources (databases, APIs, files, streaming).
Implement efficient data transformations for Big Data in distributed environments.
Optimize code to improve performance, scalability, and efficiency in data processing.
Collaborate with Data Science, BI, and DevOps teams to ensure seamless integration.
Monitor and debug data processes to ensure quality and reliability.
Apply best practices in data engineering and maintain clear documentation.
Stay up to date with the latest trends in Big Data and distributed computing.

PostgreSQLSQLApache HadoopCloud ComputingETLGitMongoDBMySQLApache Kafka

Posted 3 days ago

Apply

🔥 Staff Automation & Data Engineer

Posted 3 days ago

📍 United States

🧭 Full-Time

💸 114000.0 - 171599.0 USD per year

🔍 Fintech

🔧 Requirements

Strong expertise in data pipeline development (ETL/ELT) and workflow automation.
Proficiency in Python, SQL, and scripting languages for data processing and automation.
Hands-on experience with Workato, Google Apps Script, and API-driven automation.

💡 Responsibilities

Automate customer support, success, and service workflows to improve speed, accuracy, and responsiveness.
Build and maintain scalable ETL/ELT pipelines to ensure real-time access to critical customer data.
Implement self-service automation to enable customers and internal teams to quickly access information.

PythonSQLETLJiraAPI testingData engineeringCI/CDRESTful APIsData visualizationScriptingCustomer Success

Posted 3 days ago

Apply

🔥 Senior Data Engineer (Remote from EU)

Posted 3 days ago

📍 Germany, Austria, Italy, Spain, Portugal

🔍 Financial and Real Estate

🏢 Company: PriceHubble👥 101-250💰 Non-equity Assistance over 3 years agoArtificial Intelligence (AI)PropTech Big Data Machine Learning Analytics Real Estate

🔧 Requirements

3+ years of experience building and maintaining production data pipelines.
Excellent English communication skills, both spoken and written, to effectively collaborate with cross-functional teams and mentor other engineers.
Clear writing is key in our remote-first setup.
Proficient in working with geospatial data and leveraging geospatial features.

💡 Responsibilities

Work with backend engineers and data scientists to turn raw data into trusted insights, handling everything from scraping and ingestion to transformation and monitoring.
Navigate cost-value trade-offs to make decisions that deliver value to customers at an appropriate cost.
Develop solutions that work in over 10 countries, considering local specifics.
Lead a project from concept to launch with a temporary team of engineers.
Raise the bar and drive the team to deliver high-quality products, services, and processes.
Improve the performance, data quality, and cost-efficiency of our data pipelines at scale.
Maintain and monitor the data systems your team owns.

AWSDockerLeadershipPostgreSQLPythonSQLApache AirflowCloud ComputingData AnalysisETLGitKubernetesApache KafkaData engineeringData scienceSparkCI/CDProblem SolvingRESTful APIsMentoringLinuxExcellent communication skillsTeamworkCross-functional collaborationData visualizationData modelingData managementEnglish communication

Posted 3 days ago

Apply