Recommender system for next career step












3












$begingroup$


I want to build a recommender system that suggests the next step in your career.



About the dataset. About 50'000 Users with following informations:




  • Skills (tags, string value)

  • every job they did (embedded job titles)

  • every school they went


Now the recommender system should tell you what the next step in your career can be, and if you need a another skill or degree for that step.



How should you build such a system? (Supervised, Unsupervised (Recommender System,...), ...)










share|improve this question











$endgroup$




bumped to the homepage by Community 15 hours ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.















  • $begingroup$
    Hi.Please can you confirm how you solved this? I am interested in the solution. Thanks.
    $endgroup$
    – Z Z
    Dec 20 '18 at 9:15
















3












$begingroup$


I want to build a recommender system that suggests the next step in your career.



About the dataset. About 50'000 Users with following informations:




  • Skills (tags, string value)

  • every job they did (embedded job titles)

  • every school they went


Now the recommender system should tell you what the next step in your career can be, and if you need a another skill or degree for that step.



How should you build such a system? (Supervised, Unsupervised (Recommender System,...), ...)










share|improve this question











$endgroup$




bumped to the homepage by Community 15 hours ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.















  • $begingroup$
    Hi.Please can you confirm how you solved this? I am interested in the solution. Thanks.
    $endgroup$
    – Z Z
    Dec 20 '18 at 9:15














3












3








3


1



$begingroup$


I want to build a recommender system that suggests the next step in your career.



About the dataset. About 50'000 Users with following informations:




  • Skills (tags, string value)

  • every job they did (embedded job titles)

  • every school they went


Now the recommender system should tell you what the next step in your career can be, and if you need a another skill or degree for that step.



How should you build such a system? (Supervised, Unsupervised (Recommender System,...), ...)










share|improve this question











$endgroup$




I want to build a recommender system that suggests the next step in your career.



About the dataset. About 50'000 Users with following informations:




  • Skills (tags, string value)

  • every job they did (embedded job titles)

  • every school they went


Now the recommender system should tell you what the next step in your career can be, and if you need a another skill or degree for that step.



How should you build such a system? (Supervised, Unsupervised (Recommender System,...), ...)







recommender-system machine-learning-model






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Dec 19 '18 at 21:06









Z Z

1032




1032










asked Oct 19 '18 at 15:01









PeterPeter

161




161





bumped to the homepage by Community 15 hours ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.







bumped to the homepage by Community 15 hours ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.














  • $begingroup$
    Hi.Please can you confirm how you solved this? I am interested in the solution. Thanks.
    $endgroup$
    – Z Z
    Dec 20 '18 at 9:15


















  • $begingroup$
    Hi.Please can you confirm how you solved this? I am interested in the solution. Thanks.
    $endgroup$
    – Z Z
    Dec 20 '18 at 9:15
















$begingroup$
Hi.Please can you confirm how you solved this? I am interested in the solution. Thanks.
$endgroup$
– Z Z
Dec 20 '18 at 9:15




$begingroup$
Hi.Please can you confirm how you solved this? I am interested in the solution. Thanks.
$endgroup$
– Z Z
Dec 20 '18 at 9:15










1 Answer
1






active

oldest

votes


















0












$begingroup$

I have tried something similar once. I used an approach which you would not expect, but which gave some surprisingly good results.



I used the NMT (Neural Machine Translation) model in Tensorflow. There are some examples online for Vietnamese->English translation. I changed it to translate from "old job title" to "new job title", and trained it on my dataset. This is easy to do and you don't need to modify any code, except maybe tweak the model a little (e.g. length of document is now shorter than in the machine translation example).



The drawback of this approach is that only the most recent job title is taken into account when using it for inference. However you could try concatenating all previous job titles and other data (schools etc) as the input text, to produce a single job title as output, and this way you could make sure you are making use of their entire career path.



I would suggest tweaking aspects of the architecture (number of layers, number of previous jobs used, off-the-shelf word2vec vs domain specific trained word2vec etc) until you get the best performance against whatever evaluation metric you are using.



The other approach you could take would be to take the doc2vec of each title, and train some kind of RNN/LSTM to predict the next vec given all previous vecs. Then you need a postprocessing stage on the output to convert the output vec to a text. Unfortunately I don't know of a shortcut here to avoid building your model architecture from scratch, so this approach is more work.



Of course there are other ways of solving the problem but since no-one else has answered yet I thought I'd give my suggestion.






share|improve this answer









$endgroup$













    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "557"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f39919%2frecommender-system-for-next-career-step%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0












    $begingroup$

    I have tried something similar once. I used an approach which you would not expect, but which gave some surprisingly good results.



    I used the NMT (Neural Machine Translation) model in Tensorflow. There are some examples online for Vietnamese->English translation. I changed it to translate from "old job title" to "new job title", and trained it on my dataset. This is easy to do and you don't need to modify any code, except maybe tweak the model a little (e.g. length of document is now shorter than in the machine translation example).



    The drawback of this approach is that only the most recent job title is taken into account when using it for inference. However you could try concatenating all previous job titles and other data (schools etc) as the input text, to produce a single job title as output, and this way you could make sure you are making use of their entire career path.



    I would suggest tweaking aspects of the architecture (number of layers, number of previous jobs used, off-the-shelf word2vec vs domain specific trained word2vec etc) until you get the best performance against whatever evaluation metric you are using.



    The other approach you could take would be to take the doc2vec of each title, and train some kind of RNN/LSTM to predict the next vec given all previous vecs. Then you need a postprocessing stage on the output to convert the output vec to a text. Unfortunately I don't know of a shortcut here to avoid building your model architecture from scratch, so this approach is more work.



    Of course there are other ways of solving the problem but since no-one else has answered yet I thought I'd give my suggestion.






    share|improve this answer









    $endgroup$


















      0












      $begingroup$

      I have tried something similar once. I used an approach which you would not expect, but which gave some surprisingly good results.



      I used the NMT (Neural Machine Translation) model in Tensorflow. There are some examples online for Vietnamese->English translation. I changed it to translate from "old job title" to "new job title", and trained it on my dataset. This is easy to do and you don't need to modify any code, except maybe tweak the model a little (e.g. length of document is now shorter than in the machine translation example).



      The drawback of this approach is that only the most recent job title is taken into account when using it for inference. However you could try concatenating all previous job titles and other data (schools etc) as the input text, to produce a single job title as output, and this way you could make sure you are making use of their entire career path.



      I would suggest tweaking aspects of the architecture (number of layers, number of previous jobs used, off-the-shelf word2vec vs domain specific trained word2vec etc) until you get the best performance against whatever evaluation metric you are using.



      The other approach you could take would be to take the doc2vec of each title, and train some kind of RNN/LSTM to predict the next vec given all previous vecs. Then you need a postprocessing stage on the output to convert the output vec to a text. Unfortunately I don't know of a shortcut here to avoid building your model architecture from scratch, so this approach is more work.



      Of course there are other ways of solving the problem but since no-one else has answered yet I thought I'd give my suggestion.






      share|improve this answer









      $endgroup$
















        0












        0








        0





        $begingroup$

        I have tried something similar once. I used an approach which you would not expect, but which gave some surprisingly good results.



        I used the NMT (Neural Machine Translation) model in Tensorflow. There are some examples online for Vietnamese->English translation. I changed it to translate from "old job title" to "new job title", and trained it on my dataset. This is easy to do and you don't need to modify any code, except maybe tweak the model a little (e.g. length of document is now shorter than in the machine translation example).



        The drawback of this approach is that only the most recent job title is taken into account when using it for inference. However you could try concatenating all previous job titles and other data (schools etc) as the input text, to produce a single job title as output, and this way you could make sure you are making use of their entire career path.



        I would suggest tweaking aspects of the architecture (number of layers, number of previous jobs used, off-the-shelf word2vec vs domain specific trained word2vec etc) until you get the best performance against whatever evaluation metric you are using.



        The other approach you could take would be to take the doc2vec of each title, and train some kind of RNN/LSTM to predict the next vec given all previous vecs. Then you need a postprocessing stage on the output to convert the output vec to a text. Unfortunately I don't know of a shortcut here to avoid building your model architecture from scratch, so this approach is more work.



        Of course there are other ways of solving the problem but since no-one else has answered yet I thought I'd give my suggestion.






        share|improve this answer









        $endgroup$



        I have tried something similar once. I used an approach which you would not expect, but which gave some surprisingly good results.



        I used the NMT (Neural Machine Translation) model in Tensorflow. There are some examples online for Vietnamese->English translation. I changed it to translate from "old job title" to "new job title", and trained it on my dataset. This is easy to do and you don't need to modify any code, except maybe tweak the model a little (e.g. length of document is now shorter than in the machine translation example).



        The drawback of this approach is that only the most recent job title is taken into account when using it for inference. However you could try concatenating all previous job titles and other data (schools etc) as the input text, to produce a single job title as output, and this way you could make sure you are making use of their entire career path.



        I would suggest tweaking aspects of the architecture (number of layers, number of previous jobs used, off-the-shelf word2vec vs domain specific trained word2vec etc) until you get the best performance against whatever evaluation metric you are using.



        The other approach you could take would be to take the doc2vec of each title, and train some kind of RNN/LSTM to predict the next vec given all previous vecs. Then you need a postprocessing stage on the output to convert the output vec to a text. Unfortunately I don't know of a shortcut here to avoid building your model architecture from scratch, so this approach is more work.



        Of course there are other ways of solving the problem but since no-one else has answered yet I thought I'd give my suggestion.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Oct 19 '18 at 15:09









        TomTom

        261




        261






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Data Science Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f39919%2frecommender-system-for-next-career-step%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            How to label and detect the document text images

            Tabula Rosettana

            Aureus (color)