Iceberg
Instructions for installation and configuration of the Iceberg Agent
Last updated
Instructions for installation and configuration of the Iceberg Agent
Last updated
Note: We currently only support Iceberg for our AWS Customers
We recommend that you follow the Installation instructions from the UI. This can be found on app.lariatdata.com by clicking on the Integrations tab and going to "Add new integration" as outlined in the page.
To describe what the install entails, we outline how you could do this outside of the UI. If you are interested in the code that powers the install, please take a look here
docker
Your Cloud Account ID and region
Your own Cloud account keys
Note: If you are running the install outside of the UI, you will need your Lariat API Key and Lariat Application Key and Lariat generated Cloud access keys (e.g. AWS key and secret key)
If using the UI, copy the installation command and fill in the unpopulated fields.
Here is what the command looks like:
If you do not have access to the UI and need LARIAT_TMP_AWS_ACCESS_KEY_ID and LARIAT_TMP_AWS_SECRET_ACCESS_KEY you will have to reach out to support@lariatdata.com.
In order to configure tracked tables outlined in the configuration, go to the datasets section in the homepage and click on the ellipses next to the newly tracked Iceberg Datasets. Select "Configure" and move through the menu to rapidly set up indicators. You can also click on "Indicators" from the sidebar and select "Create an Indicator" to create fully customized indicators. That is covered in Working with Datasets and Indicators - This page summarizes the details of the Rapid Set Up path. The rapid path to setting up indicators should take no longer than 5 minutes. If you need to batch set up indicators for 100s of datasets, we have a beta feature that does. Just reach out to support@lariatdata.com and we will activate it for you.
Select a timestamp field that either has an ISO formatted date or is a unix timestamp. You can also use duckdb sql to define a computation that creates a unix timestamp from existing fields. We encourage you to use partition fields as the timestamp field.
You can also select dimensions, these are columns that you can later break metrics out by as seen below. You can also use these dimensions to filter health metrics for a more granular view.
You can then select the frequency with which Metrics should be collected.
Finally, you can rapidly select the columns and type of health metrics prior to complete metric definition. You will start seeing indicators populate based on the backfill schedule or general schedule you select. Note: Only select summary stats for numeric fields.
Reach out to support@lariatdata.com if you have any further questions!