The following DataFrame has one duplicate row; the fourth row is the same as the first.
>
|
|
| (1) |
The AreUnique command returns a truefalseFAIL DataSeries that shows duplicate rows as false and unique rows as true.
| (2) |
If the DataFrame is indexed by the result, a filtered DataFrame is returned that removes any duplicate rows:
| (3) |
The keep option allows for specification of which row to keep (either the first, last or none) when a duplicate row is found:
>
|
|
| (4) |
>
|
|
| (5) |
If keep is set to none, any row that has a duplicate is marked as a duplicate (false).
>
|
|
The AreUnique command can also be used on individual DataSeries in a DataFrame. This allows for the removal of the rows that have duplicate values in a specific column.
>
|
|