Object Storage Support

Okera supports the following cloud object storage types.

  • AWS S3
  • Azure: ADLS Gen2
  • Google Cloud Storage
  • HDMS

Note: Make sure Okera already has read access to your files, as configured in your cluster. You cannot create a crawler on a path for which Okera does not have read access.

Supported File Formats

Okera supports registering data in these file formats:

  • Avro
  • CSV
  • JSON
  • Parquet
  • ORC
  • TEXT

Register Data From Object Storage

See Crawlers to learn how to register data from object storage.

Amazon S3 Bucket Role Mapping Support

See Amazon S3 Bucket Role Mapping Support to learn how to assume secondary roles to read S3 data, with different roles for different buckets.