Pessoa Engenheiro de Dados Pleno

New
BrazilFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
2–4 years
Required Skills
AWSPythonSQLApache AirflowGCPAzureNosqlPandasPySpark

Requirements

  • Approximately 2–4 years of experience as a Data Engineer in production environments.
  • Strong proficiency in SQL for query optimization, modeling, and performance tuning.
  • Strong proficiency in Python for data manipulation (e.g., Pandas, PySpark).
  • Experience with cloud platforms such as AWS, GCP, or Azure.
  • Experience with data warehouse solutions like BigQuery, Redshift, or Snowflake.
  • Hands-on experience with data orchestration tools such as Apache Airflow.
  • Knowledge of relational and NoSQL databases, as well as APIs and system integrations.
  • Experience working with unstructured data and vector databases (e.g., Pinecone, Milvus, Weaviate, pgvector) for RAG architectures.
  • Familiarity with NLP concepts, embeddings, and modern AI/ML data pipelines.

Responsibilities

  • Design, build, and maintain scalable batch and streaming data pipelines, ensuring high performance, reliability, and cost efficiency across data workflows.
  • Develop and optimize ETL/ELT processes, ensuring data quality, integrity, and consistency across multiple systems and sources.
  • Build and maintain data infrastructure supporting AI and Machine Learning pipelines, including structured and unstructured data processing.
  • Implement and improve data governance practices, ensuring secure, well-documented, and reusable data assets for analytics and AI use cases.
  • Work closely with Data Science and ML Engineering teams to enable efficient data consumption for predictive models and LLM-based solutions.
  • Monitor, troubleshoot, and optimize data pipelines and queries, continuously improving performance and reducing operational costs.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now