Kmeans clustering with multiple columns containing strings2019 Community Moderator ElectionScikit Learn: KMeans Clustering 3D data over a time period (dimentionality reduction?)Combining K-means clustering with Agglomerative clusteringKMeans clustering to help label Multi-class Supervised modelConfused about how to apply KMeans on my a dataset with features extractedImplementation of kmeans clustering using RClustering for multiple variableClustering with multiple distance measureshow to convert multiple columns into single columns in pandas?Accuracy for Kmeans clusteringHow can I perform clustering on a list of words and ratings as columns?
What is the fastest integer factorization to break RSA?
What Exploit Are These User Agents Trying to Use?
What's the meaning of "Sollensaussagen"?
Placement of More Information/Help Icon button for Radio Buttons
Send out email when Apex Queueable fails and test it
How seriously should I take size and weight limits of hand luggage?
Forgetting the musical notes while performing in concert
Can someone clarify Hamming's notion of important problems in relation to modern academia?
Are British MPs missing the point, with these 'Indicative Votes'?
Can a virus destroy the BIOS of a modern computer?
What exactly is ineptocracy?
Why were 5.25" floppy drives cheaper than 8"?
Does the Idaho Potato Commission associate potato skins with healthy eating?
Blending or harmonizing
How dangerous is XSS
In Bayesian inference, why are some terms dropped from the posterior predictive?
Why is the sentence "Das ist eine Nase" correct?
Is it "common practice in Fourier transform spectroscopy to multiply the measured interferogram by an apodizing function"? If so, why?
files created then deleted at every second in tmp directory
What is a Samsaran Word™?
How to prevent "they're falling in love" trope
Finitely generated matrix groups whose eigenvalues are all algebraic
Is there a hemisphere-neutral way of specifying a season?
Notepad++ delete until colon for every line with replace all
Kmeans clustering with multiple columns containing strings
2019 Community Moderator ElectionScikit Learn: KMeans Clustering 3D data over a time period (dimentionality reduction?)Combining K-means clustering with Agglomerative clusteringKMeans clustering to help label Multi-class Supervised modelConfused about how to apply KMeans on my a dataset with features extractedImplementation of kmeans clustering using RClustering for multiple variableClustering with multiple distance measureshow to convert multiple columns into single columns in pandas?Accuracy for Kmeans clusteringHow can I perform clustering on a list of words and ratings as columns?
$begingroup$
I have the following dataset:
https://www.kaggle.com/carolzhangdc/imdb-5000-movie-dataset
What I want to find is clusters based on imdb score per genre per country. I have created a pandas data frame that contains per country for every unique genre the average imdb rating.
The dataframe looks like this:
country object
genre object
avgRating float64
dtype: object
Since the columns country and genre contain strings, I can't use Kmeans for this.
Is there anyway I can achieve what I want?
Ps: This is the first question I have asked. Tips on how I can improve my question are appreciated.
python k-means unsupervised-learning
New contributor
$endgroup$
add a comment |
$begingroup$
I have the following dataset:
https://www.kaggle.com/carolzhangdc/imdb-5000-movie-dataset
What I want to find is clusters based on imdb score per genre per country. I have created a pandas data frame that contains per country for every unique genre the average imdb rating.
The dataframe looks like this:
country object
genre object
avgRating float64
dtype: object
Since the columns country and genre contain strings, I can't use Kmeans for this.
Is there anyway I can achieve what I want?
Ps: This is the first question I have asked. Tips on how I can improve my question are appreciated.
python k-means unsupervised-learning
New contributor
$endgroup$
add a comment |
$begingroup$
I have the following dataset:
https://www.kaggle.com/carolzhangdc/imdb-5000-movie-dataset
What I want to find is clusters based on imdb score per genre per country. I have created a pandas data frame that contains per country for every unique genre the average imdb rating.
The dataframe looks like this:
country object
genre object
avgRating float64
dtype: object
Since the columns country and genre contain strings, I can't use Kmeans for this.
Is there anyway I can achieve what I want?
Ps: This is the first question I have asked. Tips on how I can improve my question are appreciated.
python k-means unsupervised-learning
New contributor
$endgroup$
I have the following dataset:
https://www.kaggle.com/carolzhangdc/imdb-5000-movie-dataset
What I want to find is clusters based on imdb score per genre per country. I have created a pandas data frame that contains per country for every unique genre the average imdb rating.
The dataframe looks like this:
country object
genre object
avgRating float64
dtype: object
Since the columns country and genre contain strings, I can't use Kmeans for this.
Is there anyway I can achieve what I want?
Ps: This is the first question I have asked. Tips on how I can improve my question are appreciated.
python k-means unsupervised-learning
python k-means unsupervised-learning
New contributor
New contributor
New contributor
asked 1 hour ago
DonCappieDonCappie
12
12
New contributor
New contributor
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
DonCappie is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48473%2fkmeans-clustering-with-multiple-columns-containing-strings%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
DonCappie is a new contributor. Be nice, and check out our Code of Conduct.
DonCappie is a new contributor. Be nice, and check out our Code of Conduct.
DonCappie is a new contributor. Be nice, and check out our Code of Conduct.
DonCappie is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48473%2fkmeans-clustering-with-multiple-columns-containing-strings%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown