A R Markdown / Notebook that walks through the various unix commands handy for data scientists. Great for self-taught software developers that are trying to gain familiarity with using command line to work with data.
Examples of using unix commands to complement your data science work:
- Peek at data (head / tail)
- Inspect number of rows / columns
- Combine data by merging rows / columns
- Data preparation (remove header row, replace separator, aggregate)
- Use regex for search-and-replace
- Compression / Decompress
sed
,awk
and other text processing utility- Other common use-cases
This workbook (R Markdown) will be continually updated and expanded upon. Interested? Clone / Fork the repo;
Please feel free to submit Pull Requests.
- Add attribution to authors / reference materials
- Add more chapters to cover new use-cases