Forecasting ticket sales and cityForecasting an individual based on a representative groupBest regression model to use for sales predictionRegression (and Forecasting) on RentSales Prediction for Fashion Retail DataArtificially Increasing Training dataTime series prediction of discontinuous dataForecast vs Prediction: What is the difference?How to decide which forecasting model to use?Forecasting energy consumption with no historical data when there is a trend
gerund and noun applications
What should I install to correct "ld: cannot find -lgbm and -linput" so that I can compile a Rust program?
Brake pads destroying wheels
What does Deadpool mean by "left the house in that shirt"?
How to terminate ping <dest> &
I got the following comment from a reputed math journal. What does it mean?
Does the attack bonus from a Masterwork weapon stack with the attack bonus from Masterwork ammunition?
Four married couples attend a party. Each person shakes hands with every other person, except their own spouse, exactly once. How many handshakes?
What is the significance behind "40 days" that often appears in the Bible?
Is there a hypothetical scenario that would make Earth uninhabitable for humans, but not for (the majority of) other animals?
The average age of first marriage in Russia
Could Sinn Fein swing any Brexit vote in Parliament?
When to use snap-off blade knife and when to use trapezoid blade knife?
Does .bashrc contain syntax errors?
What is the relationship between relativity and the Doppler effect?
Writing in a Christian voice
Is honey really a supersaturated solution? Does heating to un-crystalize redissolve it or melt it?
How could an airship be repaired midflight?
Calculate the frequency of characters in a string
My friend is being a hypocrite
What exactly term 'companion plants' means?
Is this an example of a Neapolitan chord?
Does multi-classing into Fighter give you heavy armor proficiency?
Print a physical multiplication table
Forecasting ticket sales and city
Forecasting an individual based on a representative groupBest regression model to use for sales predictionRegression (and Forecasting) on RentSales Prediction for Fashion Retail DataArtificially Increasing Training dataTime series prediction of discontinuous dataForecast vs Prediction: What is the difference?How to decide which forecasting model to use?Forecasting energy consumption with no historical data when there is a trend
$begingroup$
I am learning data science. I have the following dataset for train tickets:
1. order_date_meduim
order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,
2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]
order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3
3. order_ticketcount,ticketclass
order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax
I need to forecast the ticket sales for a week and also the ordercity with medium of booking.
As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.
I code in Python.
python predictive-modeling prediction data-science-model
$endgroup$
This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.
This question has not received enough attention.
add a comment |
$begingroup$
I am learning data science. I have the following dataset for train tickets:
1. order_date_meduim
order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,
2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]
order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3
3. order_ticketcount,ticketclass
order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax
I need to forecast the ticket sales for a week and also the ordercity with medium of booking.
As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.
I code in Python.
python predictive-modeling prediction data-science-model
$endgroup$
This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.
This question has not received enough attention.
$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53
add a comment |
$begingroup$
I am learning data science. I have the following dataset for train tickets:
1. order_date_meduim
order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,
2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]
order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3
3. order_ticketcount,ticketclass
order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax
I need to forecast the ticket sales for a week and also the ordercity with medium of booking.
As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.
I code in Python.
python predictive-modeling prediction data-science-model
$endgroup$
I am learning data science. I have the following dataset for train tickets:
1. order_date_meduim
order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,
2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]
order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3
3. order_ticketcount,ticketclass
order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax
I need to forecast the ticket sales for a week and also the ordercity with medium of booking.
As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.
I code in Python.
python predictive-modeling prediction data-science-model
python predictive-modeling prediction data-science-model
edited Mar 9 at 23:40
Roshan Mehta
asked Mar 9 at 23:27
Roshan MehtaRoshan Mehta
666
666
This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.
This question has not received enough attention.
This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.
This question has not received enough attention.
$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53
add a comment |
$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53
$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53
$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53
add a comment |
3 Answers
3
active
oldest
votes
$begingroup$
You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.
What Is Time Series Forecasting?
You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )
After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:
- Classical, Statistical
- ARMA for stationary data
- ARIMA for data with a trend - Refer
- SARIMA for data with seasonality
- Holt-Winters Forecasting - Refer
- Theta method - Refer
- Fourier Transformation - Refer
- Machine Learning
- Quantile Regression Forest(QRF)
- Support Vector Regression(SVR)
- Recurrent Neural Networks(RNNs) (LSTM)
If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer
$endgroup$
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
1
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
add a comment |
$begingroup$
Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.
Then, sort the data by date.
Next, visualize the data to see general trends and outliers.
The prophet package can estimate forecasts for time series data. A quick-start notebook is here.
$endgroup$
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
add a comment |
$begingroup$
The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?
You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.
If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47014%2fforecasting-ticket-sales-and-city%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.
What Is Time Series Forecasting?
You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )
After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:
- Classical, Statistical
- ARMA for stationary data
- ARIMA for data with a trend - Refer
- SARIMA for data with seasonality
- Holt-Winters Forecasting - Refer
- Theta method - Refer
- Fourier Transformation - Refer
- Machine Learning
- Quantile Regression Forest(QRF)
- Support Vector Regression(SVR)
- Recurrent Neural Networks(RNNs) (LSTM)
If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer
$endgroup$
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
1
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
add a comment |
$begingroup$
You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.
What Is Time Series Forecasting?
You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )
After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:
- Classical, Statistical
- ARMA for stationary data
- ARIMA for data with a trend - Refer
- SARIMA for data with seasonality
- Holt-Winters Forecasting - Refer
- Theta method - Refer
- Fourier Transformation - Refer
- Machine Learning
- Quantile Regression Forest(QRF)
- Support Vector Regression(SVR)
- Recurrent Neural Networks(RNNs) (LSTM)
If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer
$endgroup$
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
1
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
add a comment |
$begingroup$
You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.
What Is Time Series Forecasting?
You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )
After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:
- Classical, Statistical
- ARMA for stationary data
- ARIMA for data with a trend - Refer
- SARIMA for data with seasonality
- Holt-Winters Forecasting - Refer
- Theta method - Refer
- Fourier Transformation - Refer
- Machine Learning
- Quantile Regression Forest(QRF)
- Support Vector Regression(SVR)
- Recurrent Neural Networks(RNNs) (LSTM)
If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer
$endgroup$
You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.
What Is Time Series Forecasting?
You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )
After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:
- Classical, Statistical
- ARMA for stationary data
- ARIMA for data with a trend - Refer
- SARIMA for data with seasonality
- Holt-Winters Forecasting - Refer
- Theta method - Refer
- Fourier Transformation - Refer
- Machine Learning
- Quantile Regression Forest(QRF)
- Support Vector Regression(SVR)
- Recurrent Neural Networks(RNNs) (LSTM)
If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer
answered 2 days ago
PreetPreet
3335
3335
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
1
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
add a comment |
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
1
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago
1
1
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen♦
yesterday
add a comment |
$begingroup$
Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.
Then, sort the data by date.
Next, visualize the data to see general trends and outliers.
The prophet package can estimate forecasts for time series data. A quick-start notebook is here.
$endgroup$
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
add a comment |
$begingroup$
Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.
Then, sort the data by date.
Next, visualize the data to see general trends and outliers.
The prophet package can estimate forecasts for time series data. A quick-start notebook is here.
$endgroup$
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
add a comment |
$begingroup$
Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.
Then, sort the data by date.
Next, visualize the data to see general trends and outliers.
The prophet package can estimate forecasts for time series data. A quick-start notebook is here.
$endgroup$
Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.
Then, sort the data by date.
Next, visualize the data to see general trends and outliers.
The prophet package can estimate forecasts for time series data. A quick-start notebook is here.
edited Mar 12 at 17:43
answered Mar 12 at 17:05
Brian SpieringBrian Spiering
4,1281029
4,1281029
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
add a comment |
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42
add a comment |
$begingroup$
The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?
You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.
If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
add a comment |
$begingroup$
The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?
You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.
If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
add a comment |
$begingroup$
The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?
You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.
If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?
You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.
If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
answered 9 hours ago
SympaSympa
1062
1062
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
add a comment |
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago
add a comment |
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47014%2fforecasting-ticket-sales-and-city%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53