Object Storage¶
This document outlines Okera's support for object storage.
Supported Cloud Object Storage¶
- AWS S3
- Azure: ADLS Gen1, Gen2
- Google Cloud Storage
Note
Please make sure Okera already has read access to your files, as configured in your cluster. You won’t be able to create a crawler on a path Okera does not have read access to.
Supported file formats¶
Okera supports registering data from these file formats:
- Avro
- CSV
- JSON
- Parquet
- ORC (beta)
- TEXT
Registering data from object storage¶
See Crawlers on how to register data from object storage.