How do I get the total number of distinct values in a column in a CSV?

username Published at Dev

Username

I have a CSV file named test.csv. It looks like this:

1,Color
1,Width
2,Color
2,Height

I want to find out how many distinct values are in the first column. The shell script should return 2 in this case.

I tried running sort -u -t, -k2,2 test.csv, which I saw on another question, but it printed out far more info than I need.

How do I write a shell script that prints the number of distinct values in the first column of test.csv?

anubhava

Using awk you can do:

awk -F, '!seen[$1]++{c++} END{print c}' file

2

This awk command uses key $1, and stores them in an array seen. Value of which is incremented to 1 when a key is populated first time. Every time we get a unique key we increment count c and print it in the end.

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-02-28

Comments

0 comments

From Dev

Related Related

Article

How do I get the total number of distinct values in a column in a CSV?

How do I get the total number of distinct values in a column in a CSV?

SQL: How would I get a total count for distinct values in a column?

How do I get the distinct/unique values in a column in Excel?

MySQL: How do I get the AVG value in one column for entries sharing distinct values in another column?

How do I sum the total number of values against a given value in another column?

How can I get the total number of rows in a CSV file with PHP?

How do I count distinct combinations of column values?

How do I calculate total number of unique values in a dataframe?

How do I calculate total number of unique values in a dataframe?

How to get the total count of rows AND the count based on the distinct values of another column in the table

SQL on Spark: How do I get all values of DISTINCT?

How do I get distinct count values of 2 columns?

How do I get total number of objects in a variable only once?

How do you get distinct values from dataTables and sum the total specific field using JS

In Django, how could I in a single query get total row count based on distinct field values?

How to get number of distinct values ? mysql

In R: How do I get the column names of a CSV file as a list and values as a list of lists

How can I insert values as a distinct column?

How do I count the number of nonzero values in a given array column?

How can i get CSV column values in php

How Do I Sum the Total Of The Column Values If filtering the pending values in Angularjs?

How do I sum distinct values?

determining total number of times distinct values 0 or 1 or na in each column in a data frame in R

How can i get the total of a column in mysql?

How do I get the total number of records before I run limit when using Aggregation

How do I summarize the total number of radio button selection values (Letters, and not numbers)?

How do I get distinct grouped data?

How do I get the averages of duplicate values in a postgresql column?

How do i get column values of all checked rows?

How do I get all values of a column in sqlite3?