Title: Lead Data Engineer

Date: 28 Jan 2026

Location:

KA, IN

Job Description

We are a technology-led healthcare solutions provider. We are driven by our purpose to enable healthcare organizations to be future-ready. We offer accelerated, global growth opportunities for talent that’s bold, industrious, and nimble. With Indegene, you gain a unique career experience that celebrates entrepreneurship and is guided by passion, innovation, collaboration, and empathy. To explore exciting opportunities at the convergence of healthcare and technology, check out www.careers.indegene.com Looking to jump-start your career? We understand how important the first few years of your career are, which create the foundation of your entire professional journey. At Indegene, we promise you a differentiated career experience. You will not only work at the exciting intersection of healthcare and technology but also will be mentored by some of the most brilliant minds in the industry. We are offering a global fast-track career where you can grow along with Indegene’s high-speed growth. We are purpose-driven. We enable healthcare organizations to be future ready and our customer obsession is our driving force. We ensure that our customers achieve what they truly want. We are bold in our actions, nimble in our decision-making, and industrious in the way we work.

Role: Lead Databricks Engineer

Description: We are seeking a highly skilled Senior Databricks Lead with 5+ years of experience in data engineering and architecture. This role is ideal for an expert who brings deep hands-on experience with Databricks, Apache Spark, and cloud platforms (AWS, Azure, or GCP), along with a strong foundation in designing and implementing scalable, secure, and high-performing data solutions. You will be responsible for driving the design and implementation of modern data architectures including data lakes, data warehouses, and streaming systems, while working closely with cross-functional teams to ensure alignment with business goals. This is a strategic technical role with a strong focus on execution and continuous improvement.

Must Have

Responsibilities
•    Data Architecture & Design -Design robust, scalable, and high-performing data architecture using Databricks and Apache Spark.
•    Define and implement data lake, data warehouse, and real-time streaming solutions. Develop architecture blueprints and documentation to support technical and business stakeholders.
•    Configure, monitor, and optimize Databricks clusters for performance and cost-efficiency. Apply best practices in cluster sizing, autoscaling, and performance tuning.
•    Lead, design and develop complex ETL/ELT pipelines for batch and real-time data ingestion using Databricks. Implement modular and reusable components for scalable data processing workflows.
•    Integrate Databricks with cloud-native storage solutions (e.g., AWS S3, Azure Data Lake, GCS) and other data services. Support hybrid and multi-cloud deployments were required.
•    Ensure compliance with security standards, IAM policies, encryption, and regulatory requirements.
•    Mentor junior engineers and guide them on best practices in Databricks and big data engineering.
•    Evaluate and recommend new tools, frameworks, and best practices in the Databricks ecosystem.
•    Stay up to date with the latest in Delta Lake, Databricks SQL, MLflow, and open-source Spark enhancements.

About You (Desired Profile)
•    5+ years of experience in data engineering, big data, or cloud data architecture roles. Strong expertise in Databricks, Apache Spark, and Delta Lake.
•    Proven experience designing and delivering large-scale data lakes, warehouses, and streaming platforms.
•    In-depth experience with cloud platforms (AWS, Azure, or GCP), including native services and integration patterns.
•    Strong coding skills in Python, Scala, or SQL.
•    Solid understanding of data governance, security, IAM, and regulatory compliance in cloud environments. 
•    Experience with tools like Airflow, dbt, or CI/CD pipelines for data.
•    Excellent problem-solving, documentation, and communication skills.
•    Bachelor’s degree in computer science, Data Engineering, or related field (Master’s preferred).
•    Experience working in the pharmaceutical domain is a plus.
•    Certifications in Databricks, AWS, or Azure (or actively working toward them).
•    Exposure to Unity Catalog, Delta Lake, or Databricks SQL.
•    Knowledge of data governance, IAM, or cloud security practices.

Preferred Qualifications
•    Certifications in Databricks (e.g., Databricks Certified Data Engineer or Architect).
•    Certifications in AWS, Azure, or GCP.
•    Experience working in regulated industries (e.g., pharmaceuticals, finance).
•    Exposure to ML Ops tools and machine learning pipelines within Databricks.

Good to have

EQUAL OPPORTUNITY

Indegene is proud to be an Equal Employment Employer and is committed to the culture of Inclusion and Diversity. We do not discriminate on the basis of race, religion, sex, colour, age, national origin, pregnancy, sexual orientation, physical ability, or any other characteristics. All employment decisions, from hiring to separation, will be based on business requirements, the candidate’s merit and qualification. We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, national origin, gender identity, sexual orientation, disability status, protected veteran status, or any other characteristics.

Apply now »