Azure Synapse Serverless. HashBytes: The query references an object that is not supported in distributed processing mode

azure-synapse

I am receiving the error "The query references an object that is not supported in distributed processing mode" when using the HASHBYTES() function to hash rows in Synapse Serverless SQL Pool.

The end goal is to parse the json and store it as parquet along with a hash of the json document. The hash will be used in future imports of new snapshots to identify differentials.

Here is a sample query that produces the error:

SELECT HASHBYTES('sha2_256', csvdata.rec)
FROM OPENROWSET(
        BULK 'json/*/*/*/*/*.json.gz',
        DATA_SOURCE = 'landingzone',
        FORMAT = 'csv',
        fieldterminator ='0x0b',
        fieldquote = '0x0b'
) with (rec nvarchar(max)) as csvdata;

This example also produces the error:

SELECT HASHBYTES('sha2_256', '{"mydoc": {"key":"value"}}')
FROM OPENROWSET(
        BULK 'json/*/*/*/*/*.json.gz',
        DATA_SOURCE = 'landingzone',
        FORMAT = 'csv',
        fieldterminator ='0x0b',
        fieldquote = '0x0b'
) with (rec nvarchar(max)) as csvdata;

This example produces a hash as expected:

SELECT HASHBYTES('sha2_256', '{"mydoc": {"key":"value"}}');

I also receive the error if I first create an external table and use the hashbytes() function when querying from the external table.

Thank you in advance for your advice.

I would like to announce that Hashbytes function is now supported in distributed queries (when used with external tables or openrowset) in serverless SQL pools in Azure Synapse!

On this link you can see Transact-SQL features supported in Azure Synapse SQL.

R iterating through 1600 cols in df with binary values 0 and 1 and copy values from two other columns to save in an array by group

Plot raster by nrow and ncol (dimensions) in R

What is the order of precedence when there are multiple Spring's environment profiles as set by spring.profiles.active

How to pass BASH shell variables into AWK statement

Specification pattern with Entity Framework generic repository pattern

Not all parameters were used in the SQL statement while trying to insert into mysql database using flask application [duplicate]

What are all the syntactic elements of a Python function tooltip?

how to write a simple Python Multi-thread task

Fastlane Gym change provisioning profile only for one target

How to determine in which browser your extension background script is executing?

how do I delay the exiting from a function in javaScript? [duplicate]