Data Governance Meetups

Experiential discussions on engineering for security, compliance and privacy

Engineering for compliance and data governance is challenge for all industries and verticals. Whether it is data localization, compliance required to connect with public digital platforms, GDPR, CCPA or the impending Personal Data Protection (PDP) Bill in India, companies have to engineer solutions for:

  1. Data acquisition
  2. Data storage
  3. Re-engineering data infrastructure for privacy
  4. Data anonymisation
  5. Data security
  6. Privacy, data and security audits

Data governance meetups are peer-to-peer learning sessions where practitioners share implementation experiences and insights with participants. Participants are welcome to submit topics and presentation ideas here.

Sessions are held monthly. Thus far, we have held meetings in July, August and September, covering GDPR compliance, frameworks for automating data governance and building resources that can help us, as tech industry, to establish standards for upcoming data governance laws. Video recordings are published on https://hasgeek.com/fifthelephant/data-governance-meetups/videos

October meeting will be held on Saturday, 10 October. Atif Akhtar of ThoughtWorks will speak about Data Catalogs and how to do them right. Mayur Ralekar of Skizzle will talk about end-to-end encryption as applied to email attachments

Participation in the monthly meetings is via Zoom, for registered participants. Alternatively, you can watch the YouTube livestream on this page.

About curators: This meetup series is curated by The Fifth Elephant alumnus, Rajat Venkatesh, with active involvement of Devangana Khokhar and Shadab Siddiqui.

For queries, contact 7676332020 or email fifthelephant.editorial@hasgeek.com

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Rajat Venkatesh

@vrajatblr

Data Governance 101

Submitted Jun 25, 2020

Data compliance, privacy and security is hard because:

  • There is too much data
  • There is too much complexity
  • There is no context to data usage.

Automation is the only hope.

This talk introduces the first steps to automate data governance tasks to answer:

  • Where is my data ?
  • Who has access to the data ?
  • How is the data used ?

We will discuss data governance automation examples from past work for AWS Redshift, Snowflake and MySQL.
The information from these tasks will set the foundation for an effective strategy for compliance, privacy and security.

Outline

  • What is Data Governance?
  • Why is Data Governance hard?
    • There is too much data.
    • There is too much complexity.
    • There is no context for data usage.
  • Automation examples to ease data governance.
    • Where is my data?
    • Who has access to data?
    • How is the data used?
  • Conclusion

Requirements

Data Engineering

Speaker bio

Rajat Venkatesh has experience in building data warehouses and data lakes used by the largest companies in the world. He has helped data-driven companies adopt data governance processes to solve their security & privacy goals. He created a set of open source data governance tools (https://tokern.io/) to help other data teams with similar challenges.

Slides

https://speakerdeck.com/vrajat/data-governance-101

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more