MySQL :: HeatWave User Guide :: 4.6.4.2 Expor Query Results to Parquet Files with JSON Syntax

Before You Begin

Review the following:
- Overview of Exporting Query Results
- Requirements to Export Query Results

JSON Syntax

Use the following SELECT INTO JSON syntax to export the query results into a Parquet file.

SELECT ... INTO OUTFILE WITH PARAMETERS
file_JSON_array: {
  begin-array file_def [, file_def] … end-array
}    

file_def: file_RP_JSON | file_PAR_JSON | file_URI_JSON

file_def_RP_JSON: {
  "bucket": "bucket_name",
  "namespace": "namespace_name",
  "region": "region_name",
  {"prefix": "prefix_value" | "name": "filename" | "pattern" : "pattern_value"},
}

file_PAR_JSON: {
  "par": "PAR URL",
  ["prefix": "prefix_value", | "name": "filename", | "pattern" : "pattern_value",]
}

file_URI_JSON: {
  ("uri": "OCIFS_URI") | ("uri": "PAR_URI") | ("uri": "NATIVE_URI"),
}

dialect_JSON_object: begin-object dialect_option_key_value [, dialect_option_key_value]... end-object
 
dialect_option_key_value: {
    "format": "parquet" 
  | "compression": {"uncompressed"|"snappy"|"gzip"|"brotli"|"zstd"|"lz4"}
}

Parameter Descriptions for URI

Supported as of MySQL 9.3.1. The following parameters are included in the JSON syntax to export the query results to the Object Storage location represented as a URI.

URI: The Object Storage location is represented as a URI link. The URI must be a prefix URI and end in an unencoded / character.

You can specify the following types of URIs.

OCIFS Syntax:

oci://bucket_name@namespace_name/object_path

Example:

oci://bucket_1@tenant_1/data_files/

PAR URI Syntax. You can use one of the following formats:

https://namespace_name.objectstorage.region_name.oci.customer-oci.com/p/PAR_token/n/namespace_name/b/bucket_name/o/object_path

https://objectstorage.region_name.oraclecloud.com/p/PAR_token/n/namespace_name/b/bucket_name/o/object_path

Example:

https://tenant_1.objectstorage.us-ashburn-1.oci.customer-oci.com/p/.../n/tenant_1/b/bucket_1/o/data_files/

Native URI Syntax. You can use one of the following formats:

https://namespace_name.objectstorage.region_name.oci.customer-oci.com/n/namespace_name/b/bucket_name/o/object_path

https://objectstorage.region_name.oraclecloud.com/n/namespace_name/b/bucket_name/o/object_path

Example:

https://tenant_1.objectstorage.us-ashburn-1.oci.customer-oci.com/n/tenant_1/b/bucket_1/o/data_files/

dialect: The dialect parameters assist you to organize the data in the exported CSV file.

Parameter Descriptions for PAR URL

The following parameters are included in the JSON syntax to export the query results to the Object Storage location represented as a PAR URL.

URL: You can use a PAR URL to access Object Storage. Learn more about Pre-Authenticated Request Requirements to export query results. To learn more about PARs, see Object Storage Pre-Authenticated Requests from Oracle Cloud Infrastructure Documentation.

For security reasons, the PAR URL is hidden in the log file. For more information and samples, see Redacted PAR URLs in Exported Log Files.
format: The file format to export the query results. Export the query results as a CSV, Parquet, or JSON (As of MySQL 9.3.1) file. By default, the results are exported to a CSV file.
compression: The file compression type for CSV format. The supported compression types are uncompressed, snappy, gzip, brotli, zstd and lz4 (as supported by the Lakehouse load).

Parameter Descriptions for Resource Principal

The following parameters are included in the JSON syntax to export the query results to the Object Storage location based on the assigned resource principal.

format: The file format to export the query results. Export the query results as a CSV, Parquet, or JSON (As of MySQL 9.3.1) file. By default, the results are exported to a CSV file.
bucket: The bucket name.
prefix: The directory located within the bucket to store the export query results. This is an optional parameter.
namespace: The tenancy namespace.
region: The region that the tenancy is in.
compression: he file compression type for CSV format. The supported compression types are uncompressed, snappy, gzip, brotli, zstd and lz4 (as supported by the Lakehouse load).

Example for URI

The following example uses a specified OCIFS URI to export the query results as a Parquet file.

SELECT sum(l_quantity) from lineitem limit 1000
INTO OUTFILE WITH PARAMETERS
'{"file":[{
     "uri":"oci://bucket_1@tenancy1/data_files_1/"}], 
     "dialect": 
    {"format": "parquet"
    }
}';

Example for PAR

The following example uses a specified PAR URL to export the query results as a Parquet file.

SELECT sum(l_quantity) from lineitem limit 1000
INTO OUTFILE WITH PARAMETERS
'{"file":[{
     "par":"https://objectstorage.region1.oraclecloud.com/p/PTn9XsVegDBbSz1v-FgDma4G_e_78fJjv1GK04zlSqD9975MPZydieUJAGUypxQb/n/tenancy1/b/usr-bucket/o/", "prefix":"summary/2025"}], 
     "dialect": 
    {"format": "parquet"
    }
}';

Examples for Resource Principal

The following example uses resource principals to export the query results as a parquet file. This sample uses the default values, dialect parameters are not required.

mysql>SELECT sum(l_quantity) from lineitem limit 1000
INTO OUTFILE WITH PARAMETERS
'{"file":[{"region": region1, 
           "namespace": org_namespace, 
           "bucket": usr-bucket,
           "prefix": export/summary/
           "dialect": {"format": "parquet"}
  
         }]
}';

The following example sets the compression to snappy.

mysql> SELECT sum(l_quantity) from lineitem limit 1000
INTO OUTFILE WITH PARAMETERS
'{"file":[{"region": "region1", 
           "namespace": "org_namespace", 
           "bucket": "usr-bucket",
           "prefix": "prefix1/prefix2/"
         }]
           "dialect": {"compression": "snappy", "format": "parquet"}
}';

What's Next

Review the following:
- Redacted PAR URLs in Exported Log Files
- File Name Format for Exported Query Result Files