Object Storage

This document outlines Okera's support for object storage.

Supported Cloud Object Storage

  • AWS S3
  • Azure: ADLS Gen1, Gen2
  • Google Cloud Storage


Please make sure Okera already has read access to your files, as configured in your cluster. You won’t be able to create a crawler on a path Okera does not have read access to.

Supported file formats

Okera supports registering data from these file formats:

  • Avro
  • CSV
  • JSON
  • Parquet
  • ORC
  • TEXT

Registering data from object storage

See Crawlers on how to register data from object storage.