Object Storage Support¶
Okera supports the following cloud object storage types.
- AWS S3
- Azure: ADLS Gen2
- Google Cloud Storage
- HDMS
Note: Make sure Okera already has read access to your files, as configured in your cluster. You cannot create a crawler on a path for which Okera does not have read access.
Supported File Formats¶
Okera supports registering data in these file formats:
- Avro
- CSV
- JSON
- Parquet
- ORC
- TEXT
Register Data From Object Storage¶
See Crawlers to learn how to register data from object storage.
Amazon S3 Bucket Role Mapping Support¶
See Amazon S3 Bucket Role Mapping Support to learn how to assume secondary roles to read S3 data, with different roles for different buckets.