Register New Data¶
After setting up your new sales organization, you may want to protect data for other areas of your organization, such as for Marketing or Human Resources. In this section of the tutorial, we demonstrate how to add and register new data in Okera so that you can grant access and protect it.
What Are Crawlers?¶
A crawler automatically connects to your data source and locates and pulls any discovered datasets (tables) into Okera.
Create a Database¶
To create a new database for the new data, complete these steps:
Log into Okera as admin.
Navigate to the Data page and select Create new database. The *Create new database dialog appears.
Specify hr in the Database name box and add a description.
Select . The Database created dialog appears.
Select . The Databases page appears for the
Create a Crawler¶
To create a crawler to crawl your data and pull datasets (tables) into Okera, complete these steps:
1.Select Go to Registration to begin the crawler creation process and register new datasets in the database. The Registration page appears.
Select Create crawler to open to the Create crawler dialog.
Leave the Source connection set to Object storage on the dialog.
s3://okera-lake/hras the S3 bucket link where the HR data is stored in the Source path box.
hr_crawlerin the Crawler name box. The dialog should look like this:
Select . You are returned to the Registration page. The status of the crawl is provided in the Status column.
After the Status of the crawler changes to “Complete,” you must register the data in Okera.
hr_crawlercrawler to view any unregistered data that the crawler discovered. One unregistered dataset was discovered from the crawler path we specified.
Select on the
hrdataset row. The Register selected datasets dialog appears.
Make sure Existing database is selected on the dialog and then select the
hrdatabase from the dropdown menu.
Select Data on the left-hand menu and then select the
hrdatabase. You will see the
hrdatabase now has the
hrdataset in it. You can now grant access to your Human Resources team and create access conditions to protect your data.