Excel: Remove duplicate rows

Use this action to identify all the unique values in one column, delete rows containing duplicate values, and save the result as a new spreadsheet.

screen readers look here

How to configure this action

This action will create a new file in which the rows that are filtered out are completely deleted, not just hidden. The initial file is unedited. All rows above the Start Row will be included in the output file.

Fields for this action

  • Spreadsheet file

    • This is the Excel file you want to filter. Use a field reference to a file uploaded in a prior task or instance.
  • Sheet

    • Enter the worksheet name or number in the spreadsheet. Enter a number to select by worksheet order. If the sheet does not exist, it will be created. - By default, Excel sheet names are “Sheet1”, “Sheet2”, etc. Enter a number to select by worksheet order, the first sheet starts at 1.
  • Header row number

    • Enter the row number that contains the names of the columns. Leave blank if the file does not have a header row.
    • ✅   Heads-up: If blank, columns can only be referenced by Excel or column order reference style.
  • Start row

    • The row number from which to begin removing duplicates. If left blank, this defaults to the first row.
      • For example, if the start row is set to 5, the output file will include the unfiltered rows 1-4 and all unique rows after and including row 5.
  • Column Reference Type

    • Enter the column reference type to use during configuration; Column Name, Excel, or Column Order. Subsequent fields must be configured according to the column reference type you choose.
    Column name If a header row is set, you can reference columns directly by their name in {{column-name}} format.
    Excel You can reference columns by their Excel letter, such as A, B, C.
    Column order You can reference columns by their order, such as 1, 2, 3.
    • If Header row number is not specified, only Excel or Column Order references can be used.
  • Column

    • Choose the column to remove duplicates from using Column Names, Excel style references, or Column Order. You must reference the column based on the column reference type specified above.
  • Output field name

    • The name of the field in which the result should be saved.

What will this output?

This action outputs a new Excel file based on the initial spreadsheet and configuration. The name of the output field for this action will be the Output field name defined during configuration.

Sorry about that. What was the most unhelpful part?

Thanks for your feedback

We update the Help Center daily, so expect changes soon.

Link Copied

Paste this URL anywhere to link straight to the section.

Need more help?

If you're signed in to Catalytic Community, you can ask other users a question. You'll be redirected to Community where you can add more info.