Predicting Customer Activity Absence Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsFind threshold in rate to determine reason for lost customerPredicting with categorical dataPredicting customer conversion rates for number of advertisements shown over a time periodPredicting a new documentHow to use features when predicting aggregation?Predicting sequences newbie questionPredicting missing data. Looking for good data predicting techniquePredicting service dateWhat Machine Learning Algorithm could I use to determine some measure in a date?predicting loan default based on transaction history
Can anything be seen from the center of the Boötes void? How dark would it be?
What are the out-of-universe reasons for the references to Toby Maguire-era Spider-Man in ITSV
How do I stop a creek from eroding my steep embankment?
Is "Reachable Object" really an NP-complete problem?
Can melee weapons be used to deliver Contact Poisons?
How to answer "Have you ever been terminated?"
Quick way to create a symlink?
Is it fair for a professor to grade us on the possession of past papers?
Should I use a zero-interest credit card for a large one-time purchase?
Using audio cues to encourage good posture
Maximum summed powersets with non-adjacent items
Generate an RGB colour grid
An adverb for when you're not exaggerating
Is there such thing as an Availability Group failover trigger?
Would "destroying" Wurmcoil Engine prevent its tokens from being created?
Why do the resolve message appear first?
また usage in a dictionary
How to show element name in portuguese using elements package?
Why are there no cargo aircraft with "flying wing" design?
Is it ethical to give a final exam after the professor has quit before teaching the remaining chapters of the course?
Is it common practice to audition new musicians 1-2-1 before rehearsing with the entire band?
Does classifying an integer as a discrete log require it be part of a multiplicative group?
Can a new player join a group only when a new campaign starts?
What is the meaning of the simile “quick as silk”?
Predicting Customer Activity Absence
Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsFind threshold in rate to determine reason for lost customerPredicting with categorical dataPredicting customer conversion rates for number of advertisements shown over a time periodPredicting a new documentHow to use features when predicting aggregation?Predicting sequences newbie questionPredicting missing data. Looking for good data predicting techniquePredicting service dateWhat Machine Learning Algorithm could I use to determine some measure in a date?predicting loan default based on transaction history
$begingroup$
Could you please assist me with to following question?
I have a customer activity dataframe that looks like this:
It contains at least 500.000 customers and a "timeseries" of 42 months. The ones and zeroes represent customer activity. If a customer was active during a particular month then there will be a 1, if not - 0. I need determine those customers that most likely (+ probability) will not be active during the next 6 months (2018 July-December).
Could you please direct me what approach/models should i use in order to predict this? I use Python.
Thanks in advance!
python pandas prediction dataframe data-analysis
$endgroup$
bumped to the homepage by Community♦ 31 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
add a comment |
$begingroup$
Could you please assist me with to following question?
I have a customer activity dataframe that looks like this:
It contains at least 500.000 customers and a "timeseries" of 42 months. The ones and zeroes represent customer activity. If a customer was active during a particular month then there will be a 1, if not - 0. I need determine those customers that most likely (+ probability) will not be active during the next 6 months (2018 July-December).
Could you please direct me what approach/models should i use in order to predict this? I use Python.
Thanks in advance!
python pandas prediction dataframe data-analysis
$endgroup$
bumped to the homepage by Community♦ 31 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
$begingroup$
This question is way too broad. What have you tried already? Have you done any basic statistical analysis? Are the data any different from random 1s and 0s? Have you seen any patterns? Is your outcome "any activity (ie at least one "1") in the next six months"?
$endgroup$
– Spacedman
Jun 16 '18 at 9:44
$begingroup$
@Spacedman: Thanks for your input. To be honest i'm a bit lost (+ very new to this field) and haven't done anything yet. No, the data is only 1 and 0. The outcome would be, as you mentioned, "any activity". If there is at least one 1 in the next 6 months then I am not interested in this customer.
$endgroup$
– Andrei
Jun 16 '18 at 9:58
$begingroup$
Is there seasonality (a yearly pattern)? Is there serial correlation in the 1s (ie you are more likely to get 10 1s in a row than if they are completely random)? Do 1s "tail off" for all customers? Do some basic descriptive summaries before thinking all you need to do is throw it into a magic machine learning algorithm.
$endgroup$
– Spacedman
Jun 16 '18 at 15:40
$begingroup$
There pretty much might be a seasonality pattern, that's what i'm trying to figure out. Some of them might have had 1's for a few years and afterwards are just trailing 0's that pretty much mean (based on the data available) that the customer will not be active anymore. Some of them have only a few 1's in the middle of the selected daterange (these are, as well, not very likely to produce new activity). Some might bring up recurring activity based on seasonality (these are the one's id like to determine). There might not have been any sight from them recently but the might be active again soon
$endgroup$
– Andrei
Jun 16 '18 at 15:47
add a comment |
$begingroup$
Could you please assist me with to following question?
I have a customer activity dataframe that looks like this:
It contains at least 500.000 customers and a "timeseries" of 42 months. The ones and zeroes represent customer activity. If a customer was active during a particular month then there will be a 1, if not - 0. I need determine those customers that most likely (+ probability) will not be active during the next 6 months (2018 July-December).
Could you please direct me what approach/models should i use in order to predict this? I use Python.
Thanks in advance!
python pandas prediction dataframe data-analysis
$endgroup$
Could you please assist me with to following question?
I have a customer activity dataframe that looks like this:
It contains at least 500.000 customers and a "timeseries" of 42 months. The ones and zeroes represent customer activity. If a customer was active during a particular month then there will be a 1, if not - 0. I need determine those customers that most likely (+ probability) will not be active during the next 6 months (2018 July-December).
Could you please direct me what approach/models should i use in order to predict this? I use Python.
Thanks in advance!
python pandas prediction dataframe data-analysis
python pandas prediction dataframe data-analysis
asked Jun 16 '18 at 9:15
AndreiAndrei
61
61
bumped to the homepage by Community♦ 31 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
bumped to the homepage by Community♦ 31 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
$begingroup$
This question is way too broad. What have you tried already? Have you done any basic statistical analysis? Are the data any different from random 1s and 0s? Have you seen any patterns? Is your outcome "any activity (ie at least one "1") in the next six months"?
$endgroup$
– Spacedman
Jun 16 '18 at 9:44
$begingroup$
@Spacedman: Thanks for your input. To be honest i'm a bit lost (+ very new to this field) and haven't done anything yet. No, the data is only 1 and 0. The outcome would be, as you mentioned, "any activity". If there is at least one 1 in the next 6 months then I am not interested in this customer.
$endgroup$
– Andrei
Jun 16 '18 at 9:58
$begingroup$
Is there seasonality (a yearly pattern)? Is there serial correlation in the 1s (ie you are more likely to get 10 1s in a row than if they are completely random)? Do 1s "tail off" for all customers? Do some basic descriptive summaries before thinking all you need to do is throw it into a magic machine learning algorithm.
$endgroup$
– Spacedman
Jun 16 '18 at 15:40
$begingroup$
There pretty much might be a seasonality pattern, that's what i'm trying to figure out. Some of them might have had 1's for a few years and afterwards are just trailing 0's that pretty much mean (based on the data available) that the customer will not be active anymore. Some of them have only a few 1's in the middle of the selected daterange (these are, as well, not very likely to produce new activity). Some might bring up recurring activity based on seasonality (these are the one's id like to determine). There might not have been any sight from them recently but the might be active again soon
$endgroup$
– Andrei
Jun 16 '18 at 15:47
add a comment |
$begingroup$
This question is way too broad. What have you tried already? Have you done any basic statistical analysis? Are the data any different from random 1s and 0s? Have you seen any patterns? Is your outcome "any activity (ie at least one "1") in the next six months"?
$endgroup$
– Spacedman
Jun 16 '18 at 9:44
$begingroup$
@Spacedman: Thanks for your input. To be honest i'm a bit lost (+ very new to this field) and haven't done anything yet. No, the data is only 1 and 0. The outcome would be, as you mentioned, "any activity". If there is at least one 1 in the next 6 months then I am not interested in this customer.
$endgroup$
– Andrei
Jun 16 '18 at 9:58
$begingroup$
Is there seasonality (a yearly pattern)? Is there serial correlation in the 1s (ie you are more likely to get 10 1s in a row than if they are completely random)? Do 1s "tail off" for all customers? Do some basic descriptive summaries before thinking all you need to do is throw it into a magic machine learning algorithm.
$endgroup$
– Spacedman
Jun 16 '18 at 15:40
$begingroup$
There pretty much might be a seasonality pattern, that's what i'm trying to figure out. Some of them might have had 1's for a few years and afterwards are just trailing 0's that pretty much mean (based on the data available) that the customer will not be active anymore. Some of them have only a few 1's in the middle of the selected daterange (these are, as well, not very likely to produce new activity). Some might bring up recurring activity based on seasonality (these are the one's id like to determine). There might not have been any sight from them recently but the might be active again soon
$endgroup$
– Andrei
Jun 16 '18 at 15:47
$begingroup$
This question is way too broad. What have you tried already? Have you done any basic statistical analysis? Are the data any different from random 1s and 0s? Have you seen any patterns? Is your outcome "any activity (ie at least one "1") in the next six months"?
$endgroup$
– Spacedman
Jun 16 '18 at 9:44
$begingroup$
This question is way too broad. What have you tried already? Have you done any basic statistical analysis? Are the data any different from random 1s and 0s? Have you seen any patterns? Is your outcome "any activity (ie at least one "1") in the next six months"?
$endgroup$
– Spacedman
Jun 16 '18 at 9:44
$begingroup$
@Spacedman: Thanks for your input. To be honest i'm a bit lost (+ very new to this field) and haven't done anything yet. No, the data is only 1 and 0. The outcome would be, as you mentioned, "any activity". If there is at least one 1 in the next 6 months then I am not interested in this customer.
$endgroup$
– Andrei
Jun 16 '18 at 9:58
$begingroup$
@Spacedman: Thanks for your input. To be honest i'm a bit lost (+ very new to this field) and haven't done anything yet. No, the data is only 1 and 0. The outcome would be, as you mentioned, "any activity". If there is at least one 1 in the next 6 months then I am not interested in this customer.
$endgroup$
– Andrei
Jun 16 '18 at 9:58
$begingroup$
Is there seasonality (a yearly pattern)? Is there serial correlation in the 1s (ie you are more likely to get 10 1s in a row than if they are completely random)? Do 1s "tail off" for all customers? Do some basic descriptive summaries before thinking all you need to do is throw it into a magic machine learning algorithm.
$endgroup$
– Spacedman
Jun 16 '18 at 15:40
$begingroup$
Is there seasonality (a yearly pattern)? Is there serial correlation in the 1s (ie you are more likely to get 10 1s in a row than if they are completely random)? Do 1s "tail off" for all customers? Do some basic descriptive summaries before thinking all you need to do is throw it into a magic machine learning algorithm.
$endgroup$
– Spacedman
Jun 16 '18 at 15:40
$begingroup$
There pretty much might be a seasonality pattern, that's what i'm trying to figure out. Some of them might have had 1's for a few years and afterwards are just trailing 0's that pretty much mean (based on the data available) that the customer will not be active anymore. Some of them have only a few 1's in the middle of the selected daterange (these are, as well, not very likely to produce new activity). Some might bring up recurring activity based on seasonality (these are the one's id like to determine). There might not have been any sight from them recently but the might be active again soon
$endgroup$
– Andrei
Jun 16 '18 at 15:47
$begingroup$
There pretty much might be a seasonality pattern, that's what i'm trying to figure out. Some of them might have had 1's for a few years and afterwards are just trailing 0's that pretty much mean (based on the data available) that the customer will not be active anymore. Some of them have only a few 1's in the middle of the selected daterange (these are, as well, not very likely to produce new activity). Some might bring up recurring activity based on seasonality (these are the one's id like to determine). There might not have been any sight from them recently but the might be active again soon
$endgroup$
– Andrei
Jun 16 '18 at 15:47
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
First issue in your model building method is that you have data in the binary form for each month and you are trying to predict into a 6 month time period. You gotta define what will be considered active in the next 6 months?
- Active in 6 months = Active in each individual months?
or - Active in 6 months = Active in any given month?
If you want to first predict for individual months, you can use logistic regression. It is useful for a binary classification of this kind.
Use LogisticRegression from sklearn.linear_model to train and fit the data.
Then, use confusion matrix and classification_report from sklearn.metrics to test the performance of your model.
After having predictions for next 6 months, you can create a new column that checks if there are any 1's in the last 6 months and stores 1 else stores 0
$endgroup$
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f33246%2fpredicting-customer-activity-absence%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
First issue in your model building method is that you have data in the binary form for each month and you are trying to predict into a 6 month time period. You gotta define what will be considered active in the next 6 months?
- Active in 6 months = Active in each individual months?
or - Active in 6 months = Active in any given month?
If you want to first predict for individual months, you can use logistic regression. It is useful for a binary classification of this kind.
Use LogisticRegression from sklearn.linear_model to train and fit the data.
Then, use confusion matrix and classification_report from sklearn.metrics to test the performance of your model.
After having predictions for next 6 months, you can create a new column that checks if there are any 1's in the last 6 months and stores 1 else stores 0
$endgroup$
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
add a comment |
$begingroup$
First issue in your model building method is that you have data in the binary form for each month and you are trying to predict into a 6 month time period. You gotta define what will be considered active in the next 6 months?
- Active in 6 months = Active in each individual months?
or - Active in 6 months = Active in any given month?
If you want to first predict for individual months, you can use logistic regression. It is useful for a binary classification of this kind.
Use LogisticRegression from sklearn.linear_model to train and fit the data.
Then, use confusion matrix and classification_report from sklearn.metrics to test the performance of your model.
After having predictions for next 6 months, you can create a new column that checks if there are any 1's in the last 6 months and stores 1 else stores 0
$endgroup$
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
add a comment |
$begingroup$
First issue in your model building method is that you have data in the binary form for each month and you are trying to predict into a 6 month time period. You gotta define what will be considered active in the next 6 months?
- Active in 6 months = Active in each individual months?
or - Active in 6 months = Active in any given month?
If you want to first predict for individual months, you can use logistic regression. It is useful for a binary classification of this kind.
Use LogisticRegression from sklearn.linear_model to train and fit the data.
Then, use confusion matrix and classification_report from sklearn.metrics to test the performance of your model.
After having predictions for next 6 months, you can create a new column that checks if there are any 1's in the last 6 months and stores 1 else stores 0
$endgroup$
First issue in your model building method is that you have data in the binary form for each month and you are trying to predict into a 6 month time period. You gotta define what will be considered active in the next 6 months?
- Active in 6 months = Active in each individual months?
or - Active in 6 months = Active in any given month?
If you want to first predict for individual months, you can use logistic regression. It is useful for a binary classification of this kind.
Use LogisticRegression from sklearn.linear_model to train and fit the data.
Then, use confusion matrix and classification_report from sklearn.metrics to test the performance of your model.
After having predictions for next 6 months, you can create a new column that checks if there are any 1's in the last 6 months and stores 1 else stores 0
edited Jun 16 '18 at 13:35
answered Jun 16 '18 at 13:26
Sharvari GcSharvari Gc
262
262
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
add a comment |
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
A logistic regression model has no other covariates for this data, so it can only predict a mean for each customer, so you may as well count the fraction of 1s for each customer and use that as a percentage probability of a 1 in any month. Then combinatorics and probability theory gets you the P(>0 1s in the next six months). Basic probability theory, no need to "learn" anything. The next level would be to assume some kind of time trend, but that requires a model as a function of time, and an assumed correlation structure and so on.
$endgroup$
– Spacedman
Jun 16 '18 at 14:57
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
Individual months actually do not matter. I'm interested in the 6-month period a s whole. I think seasonality plays a big role here.
$endgroup$
– Andrei
Jun 16 '18 at 15:51
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
$begingroup$
@Spacedman You are right. Should delete this answer?
$endgroup$
– Sharvari Gc
Jun 17 '18 at 14:12
add a comment |
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f33246%2fpredicting-customer-activity-absence%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
This question is way too broad. What have you tried already? Have you done any basic statistical analysis? Are the data any different from random 1s and 0s? Have you seen any patterns? Is your outcome "any activity (ie at least one "1") in the next six months"?
$endgroup$
– Spacedman
Jun 16 '18 at 9:44
$begingroup$
@Spacedman: Thanks for your input. To be honest i'm a bit lost (+ very new to this field) and haven't done anything yet. No, the data is only 1 and 0. The outcome would be, as you mentioned, "any activity". If there is at least one 1 in the next 6 months then I am not interested in this customer.
$endgroup$
– Andrei
Jun 16 '18 at 9:58
$begingroup$
Is there seasonality (a yearly pattern)? Is there serial correlation in the 1s (ie you are more likely to get 10 1s in a row than if they are completely random)? Do 1s "tail off" for all customers? Do some basic descriptive summaries before thinking all you need to do is throw it into a magic machine learning algorithm.
$endgroup$
– Spacedman
Jun 16 '18 at 15:40
$begingroup$
There pretty much might be a seasonality pattern, that's what i'm trying to figure out. Some of them might have had 1's for a few years and afterwards are just trailing 0's that pretty much mean (based on the data available) that the customer will not be active anymore. Some of them have only a few 1's in the middle of the selected daterange (these are, as well, not very likely to produce new activity). Some might bring up recurring activity based on seasonality (these are the one's id like to determine). There might not have been any sight from them recently but the might be active again soon
$endgroup$
– Andrei
Jun 16 '18 at 15:47