It may seem a simple step, but it isn't.
There is considerable variation in gene expression from patient to patient, which confounds the analysis. However, if one considers dozens of patients and thousands of genes, it might be possible to discern a pattern from the sea of numbers. Which are the genes that are truly informative and how might their information be combined?