Looking for an algorithm which does Max Sum Clustering Unicorn Meta Zoo #1: Why another podcast? Announcing the arrival of Valued Associate #679: Cesar Manara 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsClustering not producing even clustersHow to create clusters of position data?Numerical data and different algorithmsHow PCA is different from SubSpace clustering ? how to extract variables responsible for PCA1 component?Distance Based Classification in PythonWhy use SOM for clustering?Datasets for Weighted graph clustering with detailed ground truthData model and algorithm for recommending “related” interestsMeasure of variety within list/clusterClustering algorithm for a distance matrix
English or Hindi translation of Vyasa Smriti
How to find if a column is referenced in a computed column?
Do I need to watch Ant-Man and the Wasp and Captain Marvel before watching Avengers: Endgame?
What is this word supposed to be?
Multiple fireplaces in an apartment building?
How to not starve gigantic beasts
"My boss was furious with me and I have been fired" vs. "My boss was furious with me and I was fired"
How do I check if a string is entirely made of the same substring?
What is the term for a person whose job is to place products on shelves in stores?
Why did C use the -> operator instead of reusing the . operator?
How bug prioritization works in agile projects vs non agile
How exactly does Hawking radiation decrease the mass of black holes?
Prove that the countable union of countable sets is also countable
How to have a sharp product image?
The weakest link
Unable to completely uninstall Zoom meeting app
Is Electric Central Heating worth it if using Solar Panels?
A faster way to compute the largest prime factor
Obeylines and gappto from etoolbox
Does Mathematica have an implementation of the Poisson binomial distribution?
Co-worker works way more than he should
A strange hotel
Suing a Police Officer Instead of the Police Department
Sharepoint Designer Discontinuation - software to modify existing workflows
Looking for an algorithm which does Max Sum Clustering
Unicorn Meta Zoo #1: Why another podcast?
Announcing the arrival of Valued Associate #679: Cesar Manara
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsClustering not producing even clustersHow to create clusters of position data?Numerical data and different algorithmsHow PCA is different from SubSpace clustering ? how to extract variables responsible for PCA1 component?Distance Based Classification in PythonWhy use SOM for clustering?Datasets for Weighted graph clustering with detailed ground truthData model and algorithm for recommending “related” interestsMeasure of variety within list/clusterClustering algorithm for a distance matrix
$begingroup$
I have a very limited background in data science and dataset processing and I was hoping I could get some help here. I am doing some work that requires clustering certain data points having (x, y) position values and associated weight Wi for each point.
I started with looking at k-means clustering since I know how many bins or k clusters the data needs to be divided into but I also have an additional criteria regarding the sum of weights Wi in each cluster being less than/equal to a value Wmax.
I need to essentially cluster the points along k different y=Ci lines on XY plane such that the sum of weights for points along each line is less than/equal to Wmax. Are there any constrained clustering algorithms that do this. It would a massive help if someone could point me to any open source C++ data science libraries that can help me achieve this.
Thank you for your time and help.
clustering
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
add a comment |
$begingroup$
I have a very limited background in data science and dataset processing and I was hoping I could get some help here. I am doing some work that requires clustering certain data points having (x, y) position values and associated weight Wi for each point.
I started with looking at k-means clustering since I know how many bins or k clusters the data needs to be divided into but I also have an additional criteria regarding the sum of weights Wi in each cluster being less than/equal to a value Wmax.
I need to essentially cluster the points along k different y=Ci lines on XY plane such that the sum of weights for points along each line is less than/equal to Wmax. Are there any constrained clustering algorithms that do this. It would a massive help if someone could point me to any open source C++ data science libraries that can help me achieve this.
Thank you for your time and help.
clustering
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
$begingroup$
This is really more of an optimization problem than a traditional clustering problem. Under the hood, kmeans is just moving points towards the nearest center, then recalculating the clusters after points have moved. Instead, treat this as a travelling salesman-type problem, where each Wmax is the "size of the truck" - figure out how to allocate your points effectively as if they were locations on a route. Alternatively, run a standard K-means algorithm, then figure out how to most effectively shift points to other clusters to meet your constraints.
$endgroup$
– flyingmeatball
2 mins ago
add a comment |
$begingroup$
I have a very limited background in data science and dataset processing and I was hoping I could get some help here. I am doing some work that requires clustering certain data points having (x, y) position values and associated weight Wi for each point.
I started with looking at k-means clustering since I know how many bins or k clusters the data needs to be divided into but I also have an additional criteria regarding the sum of weights Wi in each cluster being less than/equal to a value Wmax.
I need to essentially cluster the points along k different y=Ci lines on XY plane such that the sum of weights for points along each line is less than/equal to Wmax. Are there any constrained clustering algorithms that do this. It would a massive help if someone could point me to any open source C++ data science libraries that can help me achieve this.
Thank you for your time and help.
clustering
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
I have a very limited background in data science and dataset processing and I was hoping I could get some help here. I am doing some work that requires clustering certain data points having (x, y) position values and associated weight Wi for each point.
I started with looking at k-means clustering since I know how many bins or k clusters the data needs to be divided into but I also have an additional criteria regarding the sum of weights Wi in each cluster being less than/equal to a value Wmax.
I need to essentially cluster the points along k different y=Ci lines on XY plane such that the sum of weights for points along each line is less than/equal to Wmax. Are there any constrained clustering algorithms that do this. It would a massive help if someone could point me to any open source C++ data science libraries that can help me achieve this.
Thank you for your time and help.
clustering
clustering
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
asked 23 mins ago
confusdius730confusdius730
11
11
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
confusdius730 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$begingroup$
This is really more of an optimization problem than a traditional clustering problem. Under the hood, kmeans is just moving points towards the nearest center, then recalculating the clusters after points have moved. Instead, treat this as a travelling salesman-type problem, where each Wmax is the "size of the truck" - figure out how to allocate your points effectively as if they were locations on a route. Alternatively, run a standard K-means algorithm, then figure out how to most effectively shift points to other clusters to meet your constraints.
$endgroup$
– flyingmeatball
2 mins ago
add a comment |
$begingroup$
This is really more of an optimization problem than a traditional clustering problem. Under the hood, kmeans is just moving points towards the nearest center, then recalculating the clusters after points have moved. Instead, treat this as a travelling salesman-type problem, where each Wmax is the "size of the truck" - figure out how to allocate your points effectively as if they were locations on a route. Alternatively, run a standard K-means algorithm, then figure out how to most effectively shift points to other clusters to meet your constraints.
$endgroup$
– flyingmeatball
2 mins ago
$begingroup$
This is really more of an optimization problem than a traditional clustering problem. Under the hood, kmeans is just moving points towards the nearest center, then recalculating the clusters after points have moved. Instead, treat this as a travelling salesman-type problem, where each Wmax is the "size of the truck" - figure out how to allocate your points effectively as if they were locations on a route. Alternatively, run a standard K-means algorithm, then figure out how to most effectively shift points to other clusters to meet your constraints.
$endgroup$
– flyingmeatball
2 mins ago
$begingroup$
This is really more of an optimization problem than a traditional clustering problem. Under the hood, kmeans is just moving points towards the nearest center, then recalculating the clusters after points have moved. Instead, treat this as a travelling salesman-type problem, where each Wmax is the "size of the truck" - figure out how to allocate your points effectively as if they were locations on a route. Alternatively, run a standard K-means algorithm, then figure out how to most effectively shift points to other clusters to meet your constraints.
$endgroup$
– flyingmeatball
2 mins ago
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
confusdius730 is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f50933%2flooking-for-an-algorithm-which-does-max-sum-clustering%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
confusdius730 is a new contributor. Be nice, and check out our Code of Conduct.
confusdius730 is a new contributor. Be nice, and check out our Code of Conduct.
confusdius730 is a new contributor. Be nice, and check out our Code of Conduct.
confusdius730 is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f50933%2flooking-for-an-algorithm-which-does-max-sum-clustering%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
This is really more of an optimization problem than a traditional clustering problem. Under the hood, kmeans is just moving points towards the nearest center, then recalculating the clusters after points have moved. Instead, treat this as a travelling salesman-type problem, where each Wmax is the "size of the truck" - figure out how to allocate your points effectively as if they were locations on a route. Alternatively, run a standard K-means algorithm, then figure out how to most effectively shift points to other clusters to meet your constraints.
$endgroup$
– flyingmeatball
2 mins ago