Using K-prototypes algorithm to cluster gdelt data by country
$begingroup$
I want to cluster GDLET data by country using the k-prototypes algorithm. GDELT returns a list of themes as seen in this SE post.
I was looking at the example data frame in this blog post as a reference. My idea is to sum theme occurrence by country and use the top themes as inputs, then my data frame would look like
t1 t2 t3 t4 t5 t1 t2 t3 t4 t5 ...
country_1 0 0 0 1 0 1 0 0 0 0 ...
.
.
.
|_____________| |____________|
#1 theme #2 theme
I would then be able to cluster the countries using this data frame. The above mentioned SE post says that clustering will not work well for that question, but I feel like what I am trying to do is different, but I have not used k-modes/k-prototypes and was hoping someone who had could let me know if this is a valid approach. There will be other data associated besides themes, but this is the basic approach I am planning for the majority of the data frame.
Thanks
clustering
New contributor
$endgroup$
add a comment |
$begingroup$
I want to cluster GDLET data by country using the k-prototypes algorithm. GDELT returns a list of themes as seen in this SE post.
I was looking at the example data frame in this blog post as a reference. My idea is to sum theme occurrence by country and use the top themes as inputs, then my data frame would look like
t1 t2 t3 t4 t5 t1 t2 t3 t4 t5 ...
country_1 0 0 0 1 0 1 0 0 0 0 ...
.
.
.
|_____________| |____________|
#1 theme #2 theme
I would then be able to cluster the countries using this data frame. The above mentioned SE post says that clustering will not work well for that question, but I feel like what I am trying to do is different, but I have not used k-modes/k-prototypes and was hoping someone who had could let me know if this is a valid approach. There will be other data associated besides themes, but this is the basic approach I am planning for the majority of the data frame.
Thanks
clustering
New contributor
$endgroup$
add a comment |
$begingroup$
I want to cluster GDLET data by country using the k-prototypes algorithm. GDELT returns a list of themes as seen in this SE post.
I was looking at the example data frame in this blog post as a reference. My idea is to sum theme occurrence by country and use the top themes as inputs, then my data frame would look like
t1 t2 t3 t4 t5 t1 t2 t3 t4 t5 ...
country_1 0 0 0 1 0 1 0 0 0 0 ...
.
.
.
|_____________| |____________|
#1 theme #2 theme
I would then be able to cluster the countries using this data frame. The above mentioned SE post says that clustering will not work well for that question, but I feel like what I am trying to do is different, but I have not used k-modes/k-prototypes and was hoping someone who had could let me know if this is a valid approach. There will be other data associated besides themes, but this is the basic approach I am planning for the majority of the data frame.
Thanks
clustering
New contributor
$endgroup$
I want to cluster GDLET data by country using the k-prototypes algorithm. GDELT returns a list of themes as seen in this SE post.
I was looking at the example data frame in this blog post as a reference. My idea is to sum theme occurrence by country and use the top themes as inputs, then my data frame would look like
t1 t2 t3 t4 t5 t1 t2 t3 t4 t5 ...
country_1 0 0 0 1 0 1 0 0 0 0 ...
.
.
.
|_____________| |____________|
#1 theme #2 theme
I would then be able to cluster the countries using this data frame. The above mentioned SE post says that clustering will not work well for that question, but I feel like what I am trying to do is different, but I have not used k-modes/k-prototypes and was hoping someone who had could let me know if this is a valid approach. There will be other data associated besides themes, but this is the basic approach I am planning for the majority of the data frame.
Thanks
clustering
clustering
New contributor
New contributor
New contributor
asked 2 mins ago
Jeff TiltonJeff Tilton
101
101
New contributor
New contributor
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "557"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Jeff Tilton is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48061%2fusing-k-prototypes-algorithm-to-cluster-gdelt-data-by-country%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Jeff Tilton is a new contributor. Be nice, and check out our Code of Conduct.
Jeff Tilton is a new contributor. Be nice, and check out our Code of Conduct.
Jeff Tilton is a new contributor. Be nice, and check out our Code of Conduct.
Jeff Tilton is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48061%2fusing-k-prototypes-algorithm-to-cluster-gdelt-data-by-country%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown