How to use pre-trained weights to initialize the custom CNN?












1












$begingroup$


From this paperhere, it shows that U_Net initialized by VGG received a better result than the one trained from scratch. Now I want to build a custom u_net which has [32,64,128,256] which is different from vgg.features (I have tried to use U_Net with VGG architecture without pre-trained weights and custom U_Net, they all get the same Iou score .This means that the sample architecture suits for my data. ) How can I get the initialized weight from VGG? As far as I know, knowledge distillation could help me to get a simple net whose weight could be used for initialization for my u-net like ([32,64,128,256]). But knowledge distillation on ImageNet data set would cost lost of time, Is there another way to initialize the ‘smaller’ U-Net([32,64,128,256] )?










share|improve this question









$endgroup$












  • $begingroup$
    Here is the two paper I referred above, arxiv.org/abs/1801.05746 arxiv.org/abs/1503.02531
    $endgroup$
    – user66596
    22 hours ago


















1












$begingroup$


From this paperhere, it shows that U_Net initialized by VGG received a better result than the one trained from scratch. Now I want to build a custom u_net which has [32,64,128,256] which is different from vgg.features (I have tried to use U_Net with VGG architecture without pre-trained weights and custom U_Net, they all get the same Iou score .This means that the sample architecture suits for my data. ) How can I get the initialized weight from VGG? As far as I know, knowledge distillation could help me to get a simple net whose weight could be used for initialization for my u-net like ([32,64,128,256]). But knowledge distillation on ImageNet data set would cost lost of time, Is there another way to initialize the ‘smaller’ U-Net([32,64,128,256] )?










share|improve this question









$endgroup$












  • $begingroup$
    Here is the two paper I referred above, arxiv.org/abs/1801.05746 arxiv.org/abs/1503.02531
    $endgroup$
    – user66596
    22 hours ago
















1












1








1


1



$begingroup$


From this paperhere, it shows that U_Net initialized by VGG received a better result than the one trained from scratch. Now I want to build a custom u_net which has [32,64,128,256] which is different from vgg.features (I have tried to use U_Net with VGG architecture without pre-trained weights and custom U_Net, they all get the same Iou score .This means that the sample architecture suits for my data. ) How can I get the initialized weight from VGG? As far as I know, knowledge distillation could help me to get a simple net whose weight could be used for initialization for my u-net like ([32,64,128,256]). But knowledge distillation on ImageNet data set would cost lost of time, Is there another way to initialize the ‘smaller’ U-Net([32,64,128,256] )?










share|improve this question









$endgroup$




From this paperhere, it shows that U_Net initialized by VGG received a better result than the one trained from scratch. Now I want to build a custom u_net which has [32,64,128,256] which is different from vgg.features (I have tried to use U_Net with VGG architecture without pre-trained weights and custom U_Net, they all get the same Iou score .This means that the sample architecture suits for my data. ) How can I get the initialized weight from VGG? As far as I know, knowledge distillation could help me to get a simple net whose weight could be used for initialization for my u-net like ([32,64,128,256]). But knowledge distillation on ImageNet data set would cost lost of time, Is there another way to initialize the ‘smaller’ U-Net([32,64,128,256] )?







deep-learning cnn






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked 22 hours ago









user66596user66596

133




133












  • $begingroup$
    Here is the two paper I referred above, arxiv.org/abs/1801.05746 arxiv.org/abs/1503.02531
    $endgroup$
    – user66596
    22 hours ago




















  • $begingroup$
    Here is the two paper I referred above, arxiv.org/abs/1801.05746 arxiv.org/abs/1503.02531
    $endgroup$
    – user66596
    22 hours ago


















$begingroup$
Here is the two paper I referred above, arxiv.org/abs/1801.05746 arxiv.org/abs/1503.02531
$endgroup$
– user66596
22 hours ago






$begingroup$
Here is the two paper I referred above, arxiv.org/abs/1801.05746 arxiv.org/abs/1503.02531
$endgroup$
– user66596
22 hours ago












0






active

oldest

votes











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "557"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f46453%2fhow-to-use-pre-trained-weights-to-initialize-the-custom-cnn%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f46453%2fhow-to-use-pre-trained-weights-to-initialize-the-custom-cnn%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

How to label and detect the document text images

Vallis Paradisi

Tabula Rosettana