hest.batch_effect.filter_hest_stromal_housekeeping

hest.batch_effect.filter_hest_stromal_housekeeping(meta_df: DataFrame, hest_dir, whole_tissue=False, unify_genes=False, verbose=False) List[HESTData]

Filter the genes of HESTData samples, such that: - only stable housekeeping genes are kept (see assets/MostStable_{species}.csv). - only stromal regions are kept (determined from CellViT segmentation)

The lists of most stable housekeeping genes across organs were taken from https://housekeeping.unicamp.br/?download

Parameters:
  • meta_df (pd.DataFrame) – panda dataframe containing the following columns [‘id’, ‘species’]

  • whole_tissue (bool, optional) – whenever to only keep stromal regions. Defaults to False.

  • unify_genes (bool, optional) – whenever to all the gene names beforehand. Defaults to False.

Returns:

filtered list of sc.AnnData containing only stromal regions and stable housekeeping genes

Return type:

List[HESTData]