SAM_public Work in progress -simplifying the code for inference -implementing GPUManager to optimize GPU utilization for space (vs speed) -implementing flash attention