Full-time

Lead Data Engineer

Posted by Capgemini • bogotá, bogotá, distrito capital, Colombia

📍 bogotá, bogotá, distrito capital 🕒 June 06, 2026

About the Role

Job Description

Your Role:

  • Design, build, and maintain data pipelines and ETL processes using Databricks and Apache Spark.
  • Optimize data workflows for performance, scalability, and cost efficiency.
  • Implement data Lakehouse architecture and manage data ingestion from multiple sources.
  • Collaborate with data scientists and analysts to enable advanced analytics and machine learning workloads.
  • Ensure data quality, governance, and security across all data assets.
  • Monitor and troubleshoot Databricks clusters, jobs, and workflows.
  • Integrate Databricks with cloud services (AWS, Azure, or GCP) and other enterprise systems.
  • Document processes, standards, and best practices for data engineering.

Your Profile:

  • Hands‑on experience with Databricks, Apache Spark, and PySpark.
  • Strong knowledge of SQL, Python, and data modeling principles.
  • Experience w...

Ready to Apply?

Submit your application today and take the next step in your career journey with Capgemini.

Apply Now