Skip to content

MetaProViz::Pool_Estimation - Data normality #63

Open
@ChristinaSchmidt1

Description

@ChristinaSchmidt1

As I was looking into data normality and SD in a different context, I realised that this might be something we need to dicuss in regards to the CV caculation of the pool samples.

Since the CV depends on the SD, we shouldensure that the data is normally distributed and otherwise eisther return a warning, use something else like interquartile range or try to enforce data normality by log transformation (which wouldnt be my favorite choice).

I personally would use the shapiro test on the pool samples. Here we will only have one condition (="Pool") and perform the test for each metabolite. We can return a warning/message about the data distribution as in the DMA function and let the user know the importance of this in regards of CV calculation. We can even consider to add the results into the output DF. Given that this is the same code as in the DMA function I would make the shapiro test into a helper function, so that we can use the helper function in both, DMA and Pool_Estimation.

For the time being, I will add a comment into the vignette, so that the user is informed about the importance of data normality

Metadata

Metadata

Labels

Intermediate priorityImplementation needs to be prioritiseddocumentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions