Senior Data Engineer (PySpark, NoSQL)
New
Remotely within PolandFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Languages
- English B2+
- Experience
- 5+ years
- Required Skills
- PythonDynamoDBETLMongoDBAzureCassandraNosqlSparkCI/CDPySpark
Requirements
- 5+ years of experience as a Data Engineer or similar role
- Strong hands-on experience with PySpark in production
- Proven experience in data modeling, partitioning, indexing, and performance tuning in NoSQL systems
- Strong programming skills in Python
- Experience building and operating production-grade pipelines in cloud (Azure)
- Experience with distributed NoSQL databases (e.g., Cosmos DB, Cassandra, DynamoDB, MongoDB)
- Strong understanding of distributed systems and performance optimization
- Experience with CI/CD, monitoring, troubleshooting, and production support
- Strong analytical and communication skills (English B2+)
Responsibilities
- Design and optimize large-scale data pipelines using PySpark
- Build and maintain scalable ETL/ELT workflows in Azure
- Troubleshoot production issues related to performance, latency, and availability
- Work with distributed NoSQL technologies (e.g., Cosmos DB, Cassandra, DynamoDB, MongoDB, or similar)
- Optimize Spark jobs (partitioning, execution plans, resource usage)
- Implement best practices for scalability, security, and reliability
- Collaborate with cross-functional teams on data-driven solutions
- Contribute to automation, CI/CD, and operational improvements
View Full Description & ApplyYou'll be redirected to the employer's site