How to predict entity (token) classes with a Keras neural network?












0












$begingroup$


I try to build a neural network that predicts one of several classes for every token in a document. I have got a dataset that provides a class for evry token. The majority of tokens has got the class 'None.' I tried CNNs and LSTMs. When seeing the problem as a sequence tagging problem with word embeddings, I do not get useful results even though I partly used code that perfectly works for POS-tagging etc. (from tutorials). The best working solution so far is using an LSTM-CNN network that receives all tokens of all documents in an array using character-embeddings without paying respect to sentence-boundaries.



What is the best (or the normal) way to classify tokens in documents? How can I make this work with word-embeddings?









share







New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$












  • $begingroup$
    What are you trying to do? why do you need to classify each word of the text and what are the optional classes?
    $endgroup$
    – Mark.F
    13 hours ago
















0












$begingroup$


I try to build a neural network that predicts one of several classes for every token in a document. I have got a dataset that provides a class for evry token. The majority of tokens has got the class 'None.' I tried CNNs and LSTMs. When seeing the problem as a sequence tagging problem with word embeddings, I do not get useful results even though I partly used code that perfectly works for POS-tagging etc. (from tutorials). The best working solution so far is using an LSTM-CNN network that receives all tokens of all documents in an array using character-embeddings without paying respect to sentence-boundaries.



What is the best (or the normal) way to classify tokens in documents? How can I make this work with word-embeddings?









share







New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$












  • $begingroup$
    What are you trying to do? why do you need to classify each word of the text and what are the optional classes?
    $endgroup$
    – Mark.F
    13 hours ago














0












0








0





$begingroup$


I try to build a neural network that predicts one of several classes for every token in a document. I have got a dataset that provides a class for evry token. The majority of tokens has got the class 'None.' I tried CNNs and LSTMs. When seeing the problem as a sequence tagging problem with word embeddings, I do not get useful results even though I partly used code that perfectly works for POS-tagging etc. (from tutorials). The best working solution so far is using an LSTM-CNN network that receives all tokens of all documents in an array using character-embeddings without paying respect to sentence-boundaries.



What is the best (or the normal) way to classify tokens in documents? How can I make this work with word-embeddings?









share







New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$




I try to build a neural network that predicts one of several classes for every token in a document. I have got a dataset that provides a class for evry token. The majority of tokens has got the class 'None.' I tried CNNs and LSTMs. When seeing the problem as a sequence tagging problem with word embeddings, I do not get useful results even though I partly used code that perfectly works for POS-tagging etc. (from tutorials). The best working solution so far is using an LSTM-CNN network that receives all tokens of all documents in an array using character-embeddings without paying respect to sentence-boundaries.



What is the best (or the normal) way to classify tokens in documents? How can I make this work with word-embeddings?







python neural-network keras text-mining multiclass-classification





share







New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.










share







New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








share



share






New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked yesterday









junkmasterjunkmaster

1




1




New contributor




junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






junkmaster is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.












  • $begingroup$
    What are you trying to do? why do you need to classify each word of the text and what are the optional classes?
    $endgroup$
    – Mark.F
    13 hours ago


















  • $begingroup$
    What are you trying to do? why do you need to classify each word of the text and what are the optional classes?
    $endgroup$
    – Mark.F
    13 hours ago
















$begingroup$
What are you trying to do? why do you need to classify each word of the text and what are the optional classes?
$endgroup$
– Mark.F
13 hours ago




$begingroup$
What are you trying to do? why do you need to classify each word of the text and what are the optional classes?
$endgroup$
– Mark.F
13 hours ago










0






active

oldest

votes











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "557"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});






junkmaster is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f46416%2fhow-to-predict-entity-token-classes-with-a-keras-neural-network%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes








junkmaster is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















junkmaster is a new contributor. Be nice, and check out our Code of Conduct.













junkmaster is a new contributor. Be nice, and check out our Code of Conduct.












junkmaster is a new contributor. Be nice, and check out our Code of Conduct.
















Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f46416%2fhow-to-predict-entity-token-classes-with-a-keras-neural-network%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Callistus I

Tabula Rosettana

How to label and detect the document text images