DevConf.IN 2025

Puja Thacker

I am an IT professional with 12 years of diverse experience in data engineering, platform engineering, and DevOps. My focus has been on building and supporting scalable data pipelines and platforms while exploring cloud-native solutions and CI/CD processes through hands-on involvement.

With a foundation in data and platform engineering, I bring a collaborative mindset and a problem-solving approach to every project. I am keen to learn and grow, leveraging my skills to contribute to impactful solutions and drive value for teams and organizations.


Company or affiliation

Redhat

Job title

Principal Data Engineer


Session

02-28
15:15
35min
From Fragmented to Unified: Transforming Management and Governance of Data & AI assets
Avinash Singh, Puja Thacker

In today’s data-driven landscape, organizations in finance and healthcare face the dual challenges of managing sensitive data—such as revenue and patient information—and dealing with data fragmentation across different systems and formats. Protecting sensitive data from unauthorized access while ensuring its accessibility for operations is crucial, as is integrating fragmented data for effective governance. This session will explore how the OSS Unity Catalog addresses these issues by centralizing data management, enforcing fine-grained access controls, and ensuring robust governance. Attendees will learn how Unity Catalog’s multimodal interface and support for various data formats and engines enable secure, efficient data sharing across tables, files, functions, and AI models. Demo will include the integration of Apache Iceberg, Apache Spark, and x-table, all deployed on Kubernetes to showcase a scalable data management solution.

AI, Data Science, and Emerging Tech
Raigad Room (Chanakya Building / School of Business)