Title: Data Engineer
KA, IN
Job Description
We are a technology-led healthcare solutions provider. We are driven by our purpose to enable healthcare organizations to be future-ready. We offer accelerated, global growth opportunities for talent that’s bold, industrious, and nimble. With Indegene, you gain a unique career experience that celebrates entrepreneurship and is guided by passion, innovation, collaboration, and empathy. To explore exciting opportunities at the convergence of healthcare and technology, check out www.careers.indegene.com Looking to jump-start your career? We understand how important the first few years of your career are, which create the foundation of your entire professional journey. At Indegene, we promise you a differentiated career experience. You will not only work at the exciting intersection of healthcare and technology but also will be mentored by some of the most brilliant minds in the industry. We are offering a global fast-track career where you can grow along with Indegene’s high-speed growth. We are purpose-driven. We enable healthcare organizations to be future ready and our customer obsession is our driving force. We ensure that our customers achieve what they truly want. We are bold in our actions, nimble in our decision-making, and industrious in the way we work
Role: Data Engineer
Description:
- Design, develop, and maintain scalable, robust data pipelines using AWS Glue, with a focus on multi-tiered architecture to meet business needs.
- Implement ETL workflows and data processing systems that integrate data from multiple sources into data lakes and data warehouses (Redshift, ).
- Ensure efficient data movement and transformation through optimized ETL jobs leveraging AWS tools.
- Should be proficient with Snowflake Analytical SQL queries, complex SQL queries.
- Ensure optimal performance and cost-effectiveness of cloud services by monitoring and tuning infrastructure as necessary.
- Collaborate with the cloud architecture team to design and implement security protocols, including IAM policies and SSO configurations.
- Develop and maintain CI/CD pipelines and automate the deployment of data pipelines, ETL processes, and infrastructure updates.
- Implement data quality checks and monitoring processes across data pipelines.
- Ensure data security, accuracy, and integrity by using best practices for data validation, deduplication, and transformation.
- Collaborate closely with Business Analysts, Data Scientists, and Reporting Teams to understand business requirements and deliver technical solutions that meet their needs.
Must Have
Required Tech Stack
- Cloud Technologies: AWS (S3, Lambda, Glue, Redshift, IAM, EC2)
- Orchestration: Apache Airflow
- Data Warehouse: Snowflake, Redshift
- ETL Tools: SQL procedures, DBT (optional)
- Programming Language: Pytho
Desired Profile / Skills
• Bachelor's degree in Computer Science, Information Systems, or a related field
• 1-3 years of experience in data engineering w.r.t AWS Cloud. Strong expertise in creating data pipelines using AWS Glue
• Strong technical background with experience in data engineering, ETL frameworks. programming languages( Python/Pyspark) .
• Proficiency with databases such as Redshift, and Athena,
• Good to have experience with ETL tools (Dataiku, Informatica, Databricks etc.) SQL(Snowflake)
• Strong Experience with Digital Data management related to Pharma Domain
• Familiarity with reporting and visualization tools (Tableau, Power BI, etc.)
• Strong analytical skills and a problem-solving mindset.
• Experience in managing SLAs and ensuring consistent delivery within deadlines.
• Experience working in the pharmaceutical or healthcare industry.
• Experience with Agile and DevOps methodologies for managing data projects.
Good to have
EQUAL OPPORTUNITY