New posts in apache-spark-sql

How to read records in JSON format from Kafka using Structured Streaming?

Spark Strutured Streaming automatically converts timestamp to local time

Best way to get the max value in a Spark dataframe column

How can we JOIN two Spark SQL dataframes using a SQL-esque "LIKE" criterion?

Spark DataFrame: count distinct values of every column

Extract column values of Dataframe as List in Apache Spark

Pyspark: explode json in column to multiple columns

How to export a table dataframe in PySpark to csv?

How to create an empty DataFrame with a specified schema?

Overwrite specific partitions in spark dataframe write method

how to convert rows into columns in spark dataframe using scala [duplicate]

Does SparkSQL support subquery?

How to split a dataframe into dataframes with same column values?

How can I pass extra parameters to UDFs in Spark SQL?

Avoid performance impact of a single partition mode in Spark window functions

Spark unionAll multiple dataframes

SparkSQL: How to deal with null values in user defined function?

Spark DataFrame groupBy and sort in the descending order (pyspark)

What is the meaning of partitionColumn, lowerBound, upperBound, numPartitions parameters?

Databricks - is not empty but it's not a Delta table