Supervised learning on sources of information with different importance Unicorn Meta Zoo #1: Why another podcast? Announcing the arrival of Valued Associate #679: Cesar Manara 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsWhat algorithms should I use to perform job classification based on resume data?How to extract features and classify alert emails coming from monitoring tools into proper category?Question and Answer Chatbot for Customer SupportWhich approach for user classification on chat text (classifier, representation, features)?Best approach for image recognition/classification with few training dataData amount for a very simple chatbotUsing neural networks for classification in Hierarchical dataHow to set the parameters of a Hidden Markov Model that'll be used to correct the mistakes by a previous classifier?Visitor's probability to purchase on eCommerce site, based on aggregate historic dataautoencoder for features selection
How to keep bees out of canned beverages?
Protagonist's race is hidden - should I reveal it?
Why did Israel vote against lifting the American embargo on Cuba?
c++ diamond problem - How to call base method only once
What *exactly* is electrical current, voltage, and resistance?
Co-worker works way more than he should
Prove the alternating sum of a decreasing sequence converging to 0 is Cauchy.
Map material from china not allowed to leave the country
What is this word supposed to be?
Will I lose my paid in full property
Why did C use the -> operator instead of reusing the . operator?
Reattaching fallen shelf to wall?
How do I check if a string is entirely made of the same substring?
What is the best way to deal with NPC-NPC combat?
What to do with someone that cheated their way through university and a PhD program?
Did the Roman Empire have penal colonies?
What's parked in Mil Moscow helicopter plant?
Contradiction proof for inequality of P and NP?
All ASCII characters with a given bit count
Implementing 3DES algorithm in Java: is my code secure?
What is /etc/mtab in Linux?
Arriving in Atlanta after US Preclearance in Dublin. Will I go through TSA security in Atlanta to transfer to a connecting flight?
My bank got bought out, am I now going to have to start filing tax returns in a different state?
What is the ongoing value of the Kanban board to the developers as opposed to management
Supervised learning on sources of information with different importance
Unicorn Meta Zoo #1: Why another podcast?
Announcing the arrival of Valued Associate #679: Cesar Manara
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsWhat algorithms should I use to perform job classification based on resume data?How to extract features and classify alert emails coming from monitoring tools into proper category?Question and Answer Chatbot for Customer SupportWhich approach for user classification on chat text (classifier, representation, features)?Best approach for image recognition/classification with few training dataData amount for a very simple chatbotUsing neural networks for classification in Hierarchical dataHow to set the parameters of a Hidden Markov Model that'll be used to correct the mistakes by a previous classifier?Visitor's probability to purchase on eCommerce site, based on aggregate historic dataautoencoder for features selection
$begingroup$
I am trying to classify customer support sessions using supervised machine learning.
In each customer support session I have 3 bags of information.
1. The title of the customer's complaint
2. Information about the device the customer was using
3. Text of the chat session with the customer support agent
In each customer support session, there are 6 different classes. Is it better to:
1. Train a classifier on each bag of information and have them vote on which class the session belongs to?
2. Put all of the information into a single set of features, and train a single classifier to determine which class the session belongs to?
3. Other?
machine-learning nlp feature-selection aggregation
$endgroup$
bumped to the homepage by Community♦ 42 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
add a comment |
$begingroup$
I am trying to classify customer support sessions using supervised machine learning.
In each customer support session I have 3 bags of information.
1. The title of the customer's complaint
2. Information about the device the customer was using
3. Text of the chat session with the customer support agent
In each customer support session, there are 6 different classes. Is it better to:
1. Train a classifier on each bag of information and have them vote on which class the session belongs to?
2. Put all of the information into a single set of features, and train a single classifier to determine which class the session belongs to?
3. Other?
machine-learning nlp feature-selection aggregation
$endgroup$
bumped to the homepage by Community♦ 42 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
add a comment |
$begingroup$
I am trying to classify customer support sessions using supervised machine learning.
In each customer support session I have 3 bags of information.
1. The title of the customer's complaint
2. Information about the device the customer was using
3. Text of the chat session with the customer support agent
In each customer support session, there are 6 different classes. Is it better to:
1. Train a classifier on each bag of information and have them vote on which class the session belongs to?
2. Put all of the information into a single set of features, and train a single classifier to determine which class the session belongs to?
3. Other?
machine-learning nlp feature-selection aggregation
$endgroup$
I am trying to classify customer support sessions using supervised machine learning.
In each customer support session I have 3 bags of information.
1. The title of the customer's complaint
2. Information about the device the customer was using
3. Text of the chat session with the customer support agent
In each customer support session, there are 6 different classes. Is it better to:
1. Train a classifier on each bag of information and have them vote on which class the session belongs to?
2. Put all of the information into a single set of features, and train a single classifier to determine which class the session belongs to?
3. Other?
machine-learning nlp feature-selection aggregation
machine-learning nlp feature-selection aggregation
asked Dec 29 '17 at 6:11
TobyToby
462
462
bumped to the homepage by Community♦ 42 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
bumped to the homepage by Community♦ 42 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
Its better to do supervised classification then unsupervised clustering since you have the right labels for the response.
Now in supervised classification approach 1 is better than approach 2 because in approach 2 you will the reducing the already available information by combining features.
Approach 3 is advised where you will create more features from the given 3 features by use of NLP processing, sentiments, tags, chunking, time taken, text length etc and then selecting the features those provide more information gain using feature selection approaches.
$endgroup$
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f26102%2fsupervised-learning-on-sources-of-information-with-different-importance%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Its better to do supervised classification then unsupervised clustering since you have the right labels for the response.
Now in supervised classification approach 1 is better than approach 2 because in approach 2 you will the reducing the already available information by combining features.
Approach 3 is advised where you will create more features from the given 3 features by use of NLP processing, sentiments, tags, chunking, time taken, text length etc and then selecting the features those provide more information gain using feature selection approaches.
$endgroup$
add a comment |
$begingroup$
Its better to do supervised classification then unsupervised clustering since you have the right labels for the response.
Now in supervised classification approach 1 is better than approach 2 because in approach 2 you will the reducing the already available information by combining features.
Approach 3 is advised where you will create more features from the given 3 features by use of NLP processing, sentiments, tags, chunking, time taken, text length etc and then selecting the features those provide more information gain using feature selection approaches.
$endgroup$
add a comment |
$begingroup$
Its better to do supervised classification then unsupervised clustering since you have the right labels for the response.
Now in supervised classification approach 1 is better than approach 2 because in approach 2 you will the reducing the already available information by combining features.
Approach 3 is advised where you will create more features from the given 3 features by use of NLP processing, sentiments, tags, chunking, time taken, text length etc and then selecting the features those provide more information gain using feature selection approaches.
$endgroup$
Its better to do supervised classification then unsupervised clustering since you have the right labels for the response.
Now in supervised classification approach 1 is better than approach 2 because in approach 2 you will the reducing the already available information by combining features.
Approach 3 is advised where you will create more features from the given 3 features by use of NLP processing, sentiments, tags, chunking, time taken, text length etc and then selecting the features those provide more information gain using feature selection approaches.
answered Dec 29 '17 at 10:33
gopal krishna varshneygopal krishna varshney
541
541
add a comment |
add a comment |
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f26102%2fsupervised-learning-on-sources-of-information-with-different-importance%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown