Normalization for two bulk RNA-Seq samples to enable reliable fold-change estimation between genes

I have two bulk RNA-Seq samples, already tpm-normalized.

I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.

The distribution of the two samples using the common set of genes looks similar:

TPM distribution

However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?

My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.

Do you also have suggestions for the definition of up-regulated genes with these data?

scatter

asked 3 hours ago

gc5

721216

add a comment |

I have two bulk RNA-Seq samples, already tpm-normalized.

I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.

The distribution of the two samples using the common set of genes looks similar:

TPM distribution

Do you also have suggestions for the definition of up-regulated genes with these data?

scatter

asked 3 hours ago

gc5

721216

add a comment |

I have two bulk RNA-Seq samples, already tpm-normalized.

I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.

The distribution of the two samples using the common set of genes looks similar:

TPM distribution

Do you also have suggestions for the definition of up-regulated genes with these data?

scatter

asked 3 hours ago

gc5

721216

I have two bulk RNA-Seq samples, already tpm-normalized.

I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.

The distribution of the two samples using the common set of genes looks similar:

TPM distribution

Do you also have suggestions for the definition of up-regulated genes with these data?

scatter

rna-seq normalization fold-change

asked 3 hours ago

gc5

721216

asked 3 hours ago

gc5

721216

asked 3 hours ago

gc5

721216

asked 3 hours ago

gc5

721216

asked 3 hours ago

gc5

721216

add a comment |

3 Answers
3

active

oldest

votes

It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.

answered 2 hours ago

gringer

7,79221049

$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
1 hour ago

add a comment |

What I have generally done in the past is to process the data using voom in the limma package for bulk RNASeq. Inside voom you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.

voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.

Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
It is a very thorough introduction to the package and all of its capabilities.

Good luck!

answered 2 hours ago

h3ab74

836

add a comment |

You have only two samples?

You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.

answered 1 hour ago

swbarnes2

47114

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "676"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fbioinformatics.stackexchange.com%2fquestions%2f7142%2fnormalization-for-two-bulk-rna-seq-samples-to-enable-reliable-fold-change-estima%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

3 Answers
3

active

oldest

votes

3 Answers
3

active

oldest

votes

answered 2 hours ago

gringer

7,79221049

$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
1 hour ago

add a comment |

answered 2 hours ago

gringer

7,79221049

$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
1 hour ago

add a comment |

answered 2 hours ago

gringer

7,79221049

answered 2 hours ago

gringer

7,79221049

answered 2 hours ago

gringer

7,79221049

answered 2 hours ago

gringer

7,79221049

answered 2 hours ago

gringer

7,79221049

$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
1 hour ago

add a comment |

$begingroup$
I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
$endgroup$
– gc5
1 hour ago

I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks

– gc5
1 hour ago

add a comment |

voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.

Good luck!

answered 2 hours ago

h3ab74

836

add a comment |

voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.

Good luck!

answered 2 hours ago

h3ab74

836

add a comment |

voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.

Good luck!

answered 2 hours ago

h3ab74

836

voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.

Good luck!

answered 2 hours ago

h3ab74

836

answered 2 hours ago

h3ab74

836

answered 2 hours ago

h3ab74

836

answered 2 hours ago

h3ab74

836

add a comment |

You have only two samples?

You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.

answered 1 hour ago

swbarnes2

47114

add a comment |

You have only two samples?

You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.

answered 1 hour ago

swbarnes2

47114

add a comment |

You have only two samples?

You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.

answered 1 hour ago

swbarnes2

47114

You have only two samples?

You aren't going to be able to draw strong conclusions from that no matter what you do. Clever statistics don't work without replicates.

answered 1 hour ago

swbarnes2

47114

answered 1 hour ago

swbarnes2

47114

answered 1 hour ago

swbarnes2

47114

answered 1 hour ago

swbarnes2

47114

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Bioinformatics Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Htydjtk