Published inAWS Tip·Nov 7, 2022Snowflake Setting Up RBAC — Sample ScriptDatabases are repositories of sensitive, confidential, and valuable business data. databases contain the gold of every organization. Successful businesses not only supply accurate and timely data, but they must also safeguard data as well. Security provides a critical competitive edge for any high-functioning database. …Snowflake11 min readSnowflake11 min read
Oct 29, 2022Snowflake Massive Scale Testing — Part 2This is part 2 of the Snowflake Massive Scale Testing project I wrote a month back and am eager for you to check it out. The first part of this work is in the link below: Snowflake Massive Scale Testing — Part 1 WARNING… READ AT YOUR OWN RISK! USE AT YOUR OWN RISK!!johnthuma.medium.com A quick background: A customer of mine, while I was working at Snowflake, wanted…Snowflake5 min readSnowflake5 min read
Published inDataSeries·Oct 6, 2022Snowflake Provision User — ScriptA couple of weeks ago a customer of mine wanted to know how to auto provision a user from their SCIM/SSO integration. What they wanted to do was auto provision a sandbox schema for the user. I had already shared my Sandbox script with the user which is located here: …3 min read3 min read
Published inAWS Tip·Oct 5, 2022Snowflake or DatabricksShould I go with Snowflake or Databricks? I hear this almost every day. It is a very important question. It is like the truth between any two foes: There is one version of the truth from one perspective, there is another version from the opposing perspective; then there is the…4 min read4 min read
Published inAWS Tip·Sep 29, 2022What is SnowPark?SnowPark is a new feature Snowflake recently announced at Snowflake Summit. It enables data engineers, data scientists, and developers to use Python, Scala, and Java to build processes that can be pushed down into the Snowflake environment. …4 min read4 min read
Sep 28, 2022Snowflake Streams & Tasks — ScriptData Pipelining and Change Data Capture: In databases, change data capture (CDC) is a set of software design patterns used to determine and track the data that has changed so that action can be taken using the changed data. CDC is an approach to data integration that is based on…5 min read5 min read
Sep 28, 2022Snowflake Data Classification — ScriptClassification enables answering questions about the data stored in tables and views, such as: Does the table/view contain PII (Personally Identifiable Information) or sensitive data? Where is the data stored and how long has it been stored? How can the data be protected from exposure while still deriving insights? The…5 min read5 min read
Published inTowards Dev·Sep 27, 2022Snowflake Data Masking — ScriptEver need to hide something? Work for a government? Need to redact things like Social Security Numbers, birthdays, addresses, or any other kind of personal data? Snowflake has a feature called Dynamic Data Masking that will allow you to do just that! But it will also allow you to let…Snowflake8 min readSnowflake8 min read
Published inAWS Tip·Sep 27, 2022Snowflake Unistore and Hybrid Tables — ScriptNote: This feature is in Private Preview What is Unistore? Probably the biggest announcement to come out of Snowflake Summit this year! That is what Unistore is! I am excited and this feature will change the way customers use Snowflake. Don’t worry you don’t have to change what you already have. The Unistore/Hybrid tables can be used…Snowflake3 min readSnowflake3 min read
Published inAWS Tip·Sep 27, 2022Snowflake Forecasting — ScriptHurricane Ian is on its way! What a great time to discuss forecasts! In Snowflake, you can produce a forecast for a univariatetime seriesby using the FORECAST function. The FORECAST function is a shared user-defined table function (UDTF) that you access by installing a Snowflake Native Application. When you call…Snowflake3 min readSnowflake3 min read