Dataset of 100 common nationalities
Handpicked dataset of nationalities you will know (western perspective).
Unfiltered list of nationalities that includes:
- country names that double as nationalities (like Luxemburg)
The orignal data source is: faru-khan/countries_nationalities_en_ar
More datasets:
These are the transfomations I applied to the data (along with some manual picking out.)
To get nationalities-unfiltered.txt
from the source file:
<SOURCE jq ".[].nationality_en" $INPUT | cut -d\" -f2 | grep -ve " or " -e " and "
From txt
to json
:
<FILE.txt awk -F@ 'BEGIN { print "[" } { print " \""$1"\"," }' | sed '$ s/,$/\n]/'