Apply

Data and Machine Learning Intern

Posted 23 days agoViewed

View full description

📍 Location: Colombia

🔍 Industry: Software Development

🏢 Company: Loka, Inc

🗣️ Languages: English

🪄 Skills: AWSPythonSQLApache AirflowETLMachine LearningNumpyAlgorithmsData engineeringData StructuresPandasSparkTensorflowProblem SolvingData visualizationData modelingData analytics

Requirements:
  • Last year of a bachelor’s degree in Computer Science or related
  • Proficient in English
  • Basic knowledge of Python, ML, and Data libraries
  • Basic knowledge of Databases
  • Understanding of statistical, ML ,and deep learning algorithms
  • Experience visualizing and manipulating big datasets
  • Problem solving
  • Bonus: AWS knowledge, (Py)Spark, Airflow, Data Lakes and Data Warehouses
Responsibilities:
  • Assist in designing, developing and maintaining data pipelines to ensure clean, reliable and timely data.
  • Collaborate with the team to implement and optimize ETL processes.
  • Integrate data from various sources into warehouses, data lakes and lakehouses.
  • Support data management tasks, including data cleaning, validation and transformation.
  • Understand business objectives and develop models that help achieve them, plus metrics to track their progress.
  • Implement ML systems using classical ML, DL and Foundation Models following best practices.
  • Participate in client communications by helping gather requirements and communicate deliverables.
  • Explore and visualize data with a careful eye for issues that require data cleaning as well as differences in data distribution that may affect performance after deployment.
  • Identify and analyze model errors.
Apply

Related Jobs

Apply

📍 Colombia

🧭 Internship

🔍 Software Development

🏢 Company: Loka® Inc

  • Last year of a bachelor’s degree in Computer Science or related
  • Proficient in English
  • Basic knowledge of Python, ML, and Data libraries
  • Basic knowledge of Databases
  • Understanding of statistical, ML ,and deep learning algorithms
  • Experience visualizing and manipulating big datasets
  • Problem solving
  • Assist in designing, developing and maintaining data pipelines to ensure clean, reliable and timely data.
  • Collaborate with the team to implement and optimize ETL processes.
  • Integrate data from various sources into warehouses, data lakes and lakehouses.
  • Support data management tasks, including data cleaning, validation and transformation.
  • Understand business objectives and develop models that help achieve them, plus metrics to track their progress.
  • Implement ML systems using classical ML, DL and Foundation Models following best practices.
  • Participate in client communications by helping gather requirements and communicate deliverables.
  • Explore and visualize data with a careful eye for issues that require data cleaning as well as differences in data distribution that may affect performance after deployment.
  • Identify and analyze model errors.

AWSPythonSQLData AnalysisData MiningETLMachine LearningNumpyPyTorchAirflowAlgorithmsData engineeringData StructuresPandasSparkTensorflowRESTful APIsData visualizationData modeling

Posted 24 days ago
Apply