unique
Get the unique values from a column.
Usage
gurita unique [-h] -c COLUMN
Arguments
Argument |
Description |
Reference |
---|---|---|
|
display help for this command |
|
|
get unique values in this column |
Simple example
Get the unique values in the species
column in the iris.csv
file:
gurita unique -c species < iris.csv
The output of the above command is:
species_unique
setosa
versicolor
virginica
The output is a new table with one column called species_unique
. The rows in the new column list all the unique values from the species
column in the input data.
Here we see that there are three unique values: setosa
, versicolor
, and virginica
.
Because the output is a new table it can be passed to new commands in a chain.
The following example passes the output of unique
is the sort
command, which sorts the unique values in descending order (because the --order d
argument is used):
gurita unique -c species + sort -c species_unique --order d < iris.csv
The output of the above command is as follows:
species_unique
virginica
versicolor
setosa
The above output shows the same unique values as the original example, the only difference is that the values are shown in a different order.
Getting help
The full set of command line arguments for unique
can be obtained with the -h
or --help
arguments:
gurita unqiue -h
Selecting the column
-c COLUMN, --col COLUMN
The unique
command requires the name of a single column to be specified using the -c/--col
argument.
The output of unique
is a new data table with a single column. The name of the output column is based on the name of the column specified by the -c/--col
argument.
If the input column is named example
the output column will be called example_unique
.
For example, the following command generates the unique values in the class
column in the titanic.csv
file:
gurita unique -c class < titanic.csv
The output of the above command is as follows:
class_unique
Third
First
Second