Senior Python Data Scraping Engineer - AI-Human Workflow
New
United StatesContractSenior
SalaryUp to 30 USD per hour
Apply NowOpens the employer's application page
Job Details
- Languages
- English
- Experience
- 5+ years
- Required Skills
- AWSDockerPythonSeleniumJSONGoogle SheetsLangChain
Requirements
- Minimum 5+ years of experience in data engineering, web scraping, automation, or software development
- Strong hands-on experience with Python scraping frameworks such as BeautifulSoup, Selenium, or equivalent tools
- Proven ability to extract data from complex and inconsistent web structures, including dynamic and JavaScript-heavy sites
- Experience working with APIs, proxies, and anti-bot mitigation techniques
- Strong background in data cleaning, normalization, transformation, and validation
- Experience delivering structured datasets in formats such as CSV, JSON, or Google Sheets
- Familiarity with cloud environments such as AWS or equivalent platforms
- Experience using containerization tools like Docker in production workflows
- Exposure to LLM-based frameworks such as LangChain, OpenRouter, or similar automation tools
- Strong analytical mindset with exceptional attention to detail and data accuracy
- Ability to work independently, troubleshoot issues, and manage tasks without supervision
- Upper-intermediate or higher English proficiency (B2+ required)
Responsibilities
- Own end-to-end web scraping and data extraction workflows across complex and dynamic websites
- Build, maintain, and optimize Python-based scraping systems using tools such as BeautifulSoup, Selenium, and API integrations
- Extract structured datasets from dynamic content sources including JavaScript-rendered pages, AJAX calls, and infinite scroll interfaces
- Utilize internal tools (such as Apify and OpenRouter) alongside custom-built automation workflows to improve efficiency and scalability
- Ensure high data quality through validation checks, normalization processes, and cross-source consistency verification
- Handle large-scale scraping operations using batching, parallelization, and performance optimization techniques
- Monitor scraping stability and adapt quickly to changes in website structure or anti-bot mechanisms
- Deliver clean, structured datasets in formats such as CSV, JSON, or spreadsheet-based outputs
- Apply systematic quality control processes before final data delivery
View Full Description & ApplyYou'll be redirected to the employer's site