PySpark: match the values of a DataFrame column against another DataFrame column
Solution 1:
This kind of operation is called left semi join in spark:
df_B.join(df_A, ['col1'], 'leftsemi')
This kind of operation is called left semi join in spark:
df_B.join(df_A, ['col1'], 'leftsemi')