Pessoa Engenheiro de Dados Pleno
New
BrazilFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 2–4 years
- Required Skills
- AWSPythonSQLApache AirflowGCPAzureNosqlPandasPySpark
Requirements
- Approximately 2–4 years of experience as a Data Engineer in production environments.
- Strong proficiency in SQL for query optimization, modeling, and performance tuning.
- Strong proficiency in Python for data manipulation (e.g., Pandas, PySpark).
- Experience with cloud platforms such as AWS, GCP, or Azure.
- Experience with data warehouse solutions like BigQuery, Redshift, or Snowflake.
- Hands-on experience with data orchestration tools such as Apache Airflow.
- Knowledge of relational and NoSQL databases, as well as APIs and system integrations.
- Experience working with unstructured data and vector databases (e.g., Pinecone, Milvus, Weaviate, pgvector) for RAG architectures.
- Familiarity with NLP concepts, embeddings, and modern AI/ML data pipelines.
Responsibilities
- Design, build, and maintain scalable batch and streaming data pipelines, ensuring high performance, reliability, and cost efficiency across data workflows.
- Develop and optimize ETL/ELT processes, ensuring data quality, integrity, and consistency across multiple systems and sources.
- Build and maintain data infrastructure supporting AI and Machine Learning pipelines, including structured and unstructured data processing.
- Implement and improve data governance practices, ensuring secure, well-documented, and reusable data assets for analytics and AI use cases.
- Work closely with Data Science and ML Engineering teams to enable efficient data consumption for predictive models and LLM-based solutions.
- Monitor, troubleshoot, and optimize data pipelines and queries, continuously improving performance and reducing operational costs.
View Full Description & ApplyYou'll be redirected to the employer's site