add parallel processing (not sure this can be done from a theoretical point of view) arguments for threshold parameters and number of nodes a file format header containing checksum, etc piping from standard input and standard output