Some questions about normalization

WOT is a  great tool for time coures single cell analysis! thank you for developing it. I want to use it to analyze my reprograming data too, but I was confused about the normalization steps in your parper. It seem that you normalized tha data twice (before and after find HVGs).  

![屏幕截图 2022-11-22 102721](https://user-images.githubusercontent.com/46601791/203204863-4bf99365-a150-40d9-906e-8a806ac7d26f.png)

![屏幕截图 2022-11-22 102741](https://user-images.githubusercontent.com/46601791/203204966-d2fe520c-154a-4e47-90c2-29f69385dd16.png)

I flowed your way (by my own understanding) to use the code below:

```python3
import numpy as np
import pandas as pd
import scanpy as sc

adata = sc.read_h5ad("adata_filtered.h5ad")

adata.var_names_make_unique() 
sc.pp.filter_cells(adata, min_genes=200)
sc.pp.filter_genes(adata, min_cells=3)

sc.pp.normalize_total(adata, target_sum=1e4)
sc.pp.log1p(adata)
adata

adata.write("ExprMatrix.h5ad")

sc.pp.highly_variable_genes(adata, min_mean=0.0125, max_mean=3, min_disp=0.5)
adata = adata[:,adata.var.highly_variable]

sc.pp.normalize_total(adata, target_sum=1e4)
sc.pp.log1p(adata)
adata

adata.write("ExprMatrix.var.genes.h5ad")
```

I wondered if the following operation is reasonable：

1. I didn't downsample, It seems not required.
2. I used the function of find HVGs in scanpy in place of seurat.
3. And last but not least, which data should  the normalization use after select HVGs? raw data？ or just like my code above？


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about normalization #104

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Some questions about normalization #104

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions