New posts in apache-spark-sql

How to read records in JSON format from Kafka using Structured Streaming?

scala apache-spark apache-kafka apache-spark-sql spark-structured-streaming

Spark Strutured Streaming automatically converts timestamp to local time

java scala apache-spark apache-spark-sql spark-structured-streaming

Best way to get the max value in a Spark dataframe column

python apache-spark pyspark apache-spark-sql

How can we JOIN two Spark SQL dataframes using a SQL-esque "LIKE" criterion?

python apache-spark apache-spark-sql pyspark

Spark DataFrame: count distinct values of every column

apache-spark apache-spark-sql distinct-values

Extract column values of Dataframe as List in Apache Spark

scala apache-spark apache-spark-sql

Pyspark: explode json in column to multiple columns

python apache-spark pyspark apache-spark-sql

How to export a table dataframe in PySpark to csv?

python apache-spark dataframe apache-spark-sql export-to-csv

How to create an empty DataFrame with a specified schema?

scala apache-spark dataframe apache-spark-sql

Overwrite specific partitions in spark dataframe write method

apache-spark apache-spark-sql spark-dataframe

how to convert rows into columns in spark dataframe using scala [duplicate]

scala apache-spark apache-spark-sql transpose

Does SparkSQL support subquery?

sql apache-spark subquery apache-spark-sql

How to split a dataframe into dataframes with same column values?

scala apache-spark dataframe apache-spark-sql

How can I pass extra parameters to UDFs in Spark SQL?

scala apache-spark apache-spark-sql user-defined-functions

Avoid performance impact of a single partition mode in Spark window functions

apache-spark pyspark apache-spark-sql partitioning window-functions

Spark unionAll multiple dataframes

scala apache-spark apache-spark-sql

SparkSQL: How to deal with null values in user defined function?

scala apache-spark apache-spark-sql user-defined-functions nullable

Spark DataFrame groupBy and sort in the descending order (pyspark)

python apache-spark dataframe pyspark apache-spark-sql

What is the meaning of partitionColumn, lowerBound, upperBound, numPartitions parameters?

apache-spark jdbc apache-spark-sql

Databricks - is not empty but it's not a Delta table

apache-spark-sql databricks delta-lake