Tried to follow Andrej Karparthy video step to step to make a GPT for small parameters and small dataset