У меня есть две таблицы, и я хочу читать только уникальные записи из исходной таблицы, обе таблицы имеют нулевые значения.
source table:
name| age| degree| dept
aaa | 20| ece |null
bbb |20 |it |null
ccc |30 |mech| null
target table
name| age |degree |dept
aaa |20| ece |null
bbb |20 |it| null
soruce_df.join (target_df, seq ("name", "age", "степень"), "leftanti") -> рабочий
soruce_df.join (target_df, seq ("имя", "возраст", "степень", "отдел"), "leftanti") -> Не рабочий
Now i need to pick only 3rd record from source ,
If i use name ,age ,degree as my joining key , it's working as expected
But when i include dept it's picking all the records from source table.
Please help me.