Data Governance Meetups

Experiential discussions on engineering for security, compliance and privacy

Rajat Venkatesh

@vrajatblr

Data Governance 101

Submitted Jun 25, 2020

Data compliance, privacy and security is hard because:

  • There is too much data
  • There is too much complexity
  • There is no context to data usage.

Automation is the only hope.

This talk introduces the first steps to automate data governance tasks to answer:

  • Where is my data ?
  • Who has access to the data ?
  • How is the data used ?

We will discuss data governance automation examples from past work for AWS Redshift, Snowflake and MySQL.
The information from these tasks will set the foundation for an effective strategy for compliance, privacy and security.

Outline

  • What is Data Governance?
  • Why is Data Governance hard?
    • There is too much data.
    • There is too much complexity.
    • There is no context for data usage.
  • Automation examples to ease data governance.
    • Where is my data?
    • Who has access to data?
    • How is the data used?
  • Conclusion

Requirements

Data Engineering

Speaker bio

Rajat Venkatesh has experience in building data warehouses and data lakes used by the largest companies in the world. He has helped data-driven companies adopt data governance processes to solve their security & privacy goals. He created a set of open source data governance tools (https://tokern.io/) to help other data teams with similar challenges.

Slides

https://speakerdeck.com/vrajat/data-governance-101

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

Jump starting better data engineering and AI futures