Analysis notebooks for "SARS-CoV-2 entry factors are highly expressed in nasal epithelial cells together with innate immune genes".
For each dataset where per-cell annotation is not available, we re-processed the data from raw or normalised (whichever was deposited alongside the original publication) quantification matrix. The standard scanpy and clustering procedure was followed. When batch information is available, harmony was used to correct batch effects in the PC space and the corrected PCs were used for computing nearest neighbour graph. To re-annotate the cells, multiple clusterings of different resolutions were generated among which the one best matching the published clustering was picked and manual annotation was undertaken using marker genes described in the original publication.
The analysis code makes heavy use of scanpy_scripts (branch nh3
), which provides command line interfaces as well as functional extensions to scanpy.