Forecasting ticket sales and cityForecasting an individual based on a representative groupBest regression model to use for sales predictionRegression (and Forecasting) on RentSales Prediction for Fashion Retail DataArtificially Increasing Training dataTime series prediction of discontinuous dataForecast vs Prediction: What is the difference?How to decide which forecasting model to use?Forecasting energy consumption with no historical data when there is a trend

gerund and noun applications

What should I install to correct "ld: cannot find -lgbm and -linput" so that I can compile a Rust program?

Brake pads destroying wheels

What does Deadpool mean by "left the house in that shirt"?

How to terminate ping <dest> &

I got the following comment from a reputed math journal. What does it mean?

Does the attack bonus from a Masterwork weapon stack with the attack bonus from Masterwork ammunition?

Four married couples attend a party. Each person shakes hands with every other person, except their own spouse, exactly once. How many handshakes?

What is the significance behind "40 days" that often appears in the Bible?

Is there a hypothetical scenario that would make Earth uninhabitable for humans, but not for (the majority of) other animals?

The average age of first marriage in Russia

Could Sinn Fein swing any Brexit vote in Parliament?

When to use snap-off blade knife and when to use trapezoid blade knife?

Does .bashrc contain syntax errors?

What is the relationship between relativity and the Doppler effect?

Writing in a Christian voice

Is honey really a supersaturated solution? Does heating to un-crystalize redissolve it or melt it?

How could an airship be repaired midflight?

Calculate the frequency of characters in a string

My friend is being a hypocrite

What exactly term 'companion plants' means?

Is this an example of a Neapolitan chord?

Does multi-classing into Fighter give you heavy armor proficiency?

Print a physical multiplication table



Forecasting ticket sales and city


Forecasting an individual based on a representative groupBest regression model to use for sales predictionRegression (and Forecasting) on RentSales Prediction for Fashion Retail DataArtificially Increasing Training dataTime series prediction of discontinuous dataForecast vs Prediction: What is the difference?How to decide which forecasting model to use?Forecasting energy consumption with no historical data when there is a trend













1












$begingroup$


I am learning data science. I have the following dataset for train tickets:



1. order_date_meduim

order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,

2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]

order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3

3. order_ticketcount,ticketclass

order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax


I need to forecast the ticket sales for a week and also the ordercity with medium of booking.



As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.



I code in Python.










share|improve this question











$endgroup$





This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.


This question has not received enough attention.















  • $begingroup$
    Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
    $endgroup$
    – Aditya
    Mar 10 at 2:53
















1












$begingroup$


I am learning data science. I have the following dataset for train tickets:



1. order_date_meduim

order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,

2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]

order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3

3. order_ticketcount,ticketclass

order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax


I need to forecast the ticket sales for a week and also the ordercity with medium of booking.



As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.



I code in Python.










share|improve this question











$endgroup$





This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.


This question has not received enough attention.















  • $begingroup$
    Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
    $endgroup$
    – Aditya
    Mar 10 at 2:53














1












1








1


2



$begingroup$


I am learning data science. I have the following dataset for train tickets:



1. order_date_meduim

order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,

2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]

order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3

3. order_ticketcount,ticketclass

order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax


I need to forecast the ticket sales for a week and also the ordercity with medium of booking.



As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.



I code in Python.










share|improve this question











$endgroup$




I am learning data science. I have the following dataset for train tickets:



1. order_date_meduim

order,date,medium
95062,2017-09-11,35
171081,2017-07-05,39
122867,2017-08-18,39
107186,2017-11-23,
171085,2017-09-02,

2. order_ordercityA_ordercityB [some order has only 1 ordercity, I think ordercity means here which city is something like source and destination]

order,ordercityA,ordercityB
81773,4,11
105838,4,
76153,24,18
93058,12,
11623,24,3
3070,24,3

3. order_ticketcount,ticketclass

order,ticketcount,ticketclass
246783,1,pax
1693998,2,pax
1958576,1,other
673681,1,pax
1593899,1,pax
194035,1,pax


I need to forecast the ticket sales for a week and also the ordercity with medium of booking.



As I am new, could someone give a possible answer about how to create a prediction model that could predict the sales for 1 week? Also, I doubt the data is time-series data.



I code in Python.







python predictive-modeling prediction data-science-model






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 9 at 23:40







Roshan Mehta

















asked Mar 9 at 23:27









Roshan MehtaRoshan Mehta

666




666






This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.


This question has not received enough attention.








This question has an open bounty worth +50
reputation from Roshan Mehta ending ending at 2019-03-19 07:15:53Z">tomorrow.


This question has not received enough attention.













  • $begingroup$
    Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
    $endgroup$
    – Aditya
    Mar 10 at 2:53

















  • $begingroup$
    Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
    $endgroup$
    – Aditya
    Mar 10 at 2:53
















$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53





$begingroup$
Yes it's timeseries; work on adding more features to the data which you think could help the model to strengthen its preds.add various statistics and groupby cols, check for holidays and all, weekends, hour etc, etc and I truly recommend not to jump into Time series first without having proper aspect of ML or hands on experience on various datasets/comps/hacks
$endgroup$
– Aditya
Mar 10 at 2:53











3 Answers
3






active

oldest

votes


















2












$begingroup$

You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.



What Is Time Series Forecasting?



You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )



After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:



  • Classical, Statistical

    • ARMA for stationary data

    • ARIMA for data with a trend - Refer

    • SARIMA for data with seasonality

    • Holt-Winters Forecasting - Refer

    • Theta method - Refer

    • Fourier Transformation - Refer


  • Machine Learning

    • Quantile Regression Forest(QRF)

    • Support Vector Regression(SVR)

    • Recurrent Neural Networks(RNNs) (LSTM)


If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer






share|improve this answer









$endgroup$












  • $begingroup$
    How to find out the ticketclass or so called categorical column using timeseries forecasting
    $endgroup$
    – Roshan Mehta
    2 days ago






  • 1




    $begingroup$
    This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
    $endgroup$
    – Sean Owen
    yesterday


















0












$begingroup$

Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.



Then, sort the data by date.



Next, visualize the data to see general trends and outliers.



The prophet package can estimate forecasts for time series data. A quick-start notebook is here.






share|improve this answer











$endgroup$












  • $begingroup$
    Which join, outer, inner or cartesian product?
    $endgroup$
    – Roshan Mehta
    Mar 12 at 17:25










  • $begingroup$
    Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
    $endgroup$
    – Brian Spiering
    Mar 12 at 17:42


















0












$begingroup$

The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?



You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.



If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.






share|improve this answer








New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$












  • $begingroup$
    I only added the 6 first rows. The dataset is quite big.
    $endgroup$
    – Roshan Mehta
    8 hours ago










Your Answer





StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47014%2fforecasting-ticket-sales-and-city%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























3 Answers
3






active

oldest

votes








3 Answers
3






active

oldest

votes









active

oldest

votes






active

oldest

votes









2












$begingroup$

You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.



What Is Time Series Forecasting?



You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )



After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:



  • Classical, Statistical

    • ARMA for stationary data

    • ARIMA for data with a trend - Refer

    • SARIMA for data with seasonality

    • Holt-Winters Forecasting - Refer

    • Theta method - Refer

    • Fourier Transformation - Refer


  • Machine Learning

    • Quantile Regression Forest(QRF)

    • Support Vector Regression(SVR)

    • Recurrent Neural Networks(RNNs) (LSTM)


If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer






share|improve this answer









$endgroup$












  • $begingroup$
    How to find out the ticketclass or so called categorical column using timeseries forecasting
    $endgroup$
    – Roshan Mehta
    2 days ago






  • 1




    $begingroup$
    This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
    $endgroup$
    – Sean Owen
    yesterday















2












$begingroup$

You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.



What Is Time Series Forecasting?



You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )



After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:



  • Classical, Statistical

    • ARMA for stationary data

    • ARIMA for data with a trend - Refer

    • SARIMA for data with seasonality

    • Holt-Winters Forecasting - Refer

    • Theta method - Refer

    • Fourier Transformation - Refer


  • Machine Learning

    • Quantile Regression Forest(QRF)

    • Support Vector Regression(SVR)

    • Recurrent Neural Networks(RNNs) (LSTM)


If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer






share|improve this answer









$endgroup$












  • $begingroup$
    How to find out the ticketclass or so called categorical column using timeseries forecasting
    $endgroup$
    – Roshan Mehta
    2 days ago






  • 1




    $begingroup$
    This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
    $endgroup$
    – Sean Owen
    yesterday













2












2








2





$begingroup$

You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.



What Is Time Series Forecasting?



You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )



After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:



  • Classical, Statistical

    • ARMA for stationary data

    • ARIMA for data with a trend - Refer

    • SARIMA for data with seasonality

    • Holt-Winters Forecasting - Refer

    • Theta method - Refer

    • Fourier Transformation - Refer


  • Machine Learning

    • Quantile Regression Forest(QRF)

    • Support Vector Regression(SVR)

    • Recurrent Neural Networks(RNNs) (LSTM)


If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer






share|improve this answer









$endgroup$



You have got yourself a time series forecasting problem. And with multiple input variables it is called multivariate time series forecasting.



What Is Time Series Forecasting?



You can start with EDA on your data and find out if you can see any trend or seasonality. ( You might need to add or update your current features to get underlying trend/seasonality )



After EDA, you can start looking into following models, all of them are the go-to for time series prediction problem:



  • Classical, Statistical

    • ARMA for stationary data

    • ARIMA for data with a trend - Refer

    • SARIMA for data with seasonality

    • Holt-Winters Forecasting - Refer

    • Theta method - Refer

    • Fourier Transformation - Refer


  • Machine Learning

    • Quantile Regression Forest(QRF)

    • Support Vector Regression(SVR)

    • Recurrent Neural Networks(RNNs) (LSTM)


If you are not comfortable with Statistics then I would advise you to start with LSTMs for forecasting - Refer







share|improve this answer












share|improve this answer



share|improve this answer










answered 2 days ago









PreetPreet

3335




3335











  • $begingroup$
    How to find out the ticketclass or so called categorical column using timeseries forecasting
    $endgroup$
    – Roshan Mehta
    2 days ago






  • 1




    $begingroup$
    This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
    $endgroup$
    – Sean Owen
    yesterday
















  • $begingroup$
    How to find out the ticketclass or so called categorical column using timeseries forecasting
    $endgroup$
    – Roshan Mehta
    2 days ago






  • 1




    $begingroup$
    This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
    $endgroup$
    – Sean Owen
    yesterday















$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago




$begingroup$
How to find out the ticketclass or so called categorical column using timeseries forecasting
$endgroup$
– Roshan Mehta
2 days ago




1




1




$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen
yesterday




$begingroup$
This is an exact duplicate of another answer you posted. Please customize the answer to the actual question. This is pretty broad as advice.
$endgroup$
– Sean Owen
yesterday











0












$begingroup$

Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.



Then, sort the data by date.



Next, visualize the data to see general trends and outliers.



The prophet package can estimate forecasts for time series data. A quick-start notebook is here.






share|improve this answer











$endgroup$












  • $begingroup$
    Which join, outer, inner or cartesian product?
    $endgroup$
    – Roshan Mehta
    Mar 12 at 17:25










  • $begingroup$
    Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
    $endgroup$
    – Brian Spiering
    Mar 12 at 17:42















0












$begingroup$

Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.



Then, sort the data by date.



Next, visualize the data to see general trends and outliers.



The prophet package can estimate forecasts for time series data. A quick-start notebook is here.






share|improve this answer











$endgroup$












  • $begingroup$
    Which join, outer, inner or cartesian product?
    $endgroup$
    – Roshan Mehta
    Mar 12 at 17:25










  • $begingroup$
    Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
    $endgroup$
    – Brian Spiering
    Mar 12 at 17:42













0












0








0





$begingroup$

Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.



Then, sort the data by date.



Next, visualize the data to see general trends and outliers.



The prophet package can estimate forecasts for time series data. A quick-start notebook is here.






share|improve this answer











$endgroup$



Generally, the first step in modeling is merging all separate datasets into a single dataset. It looks like the data can be joined on order as a key.



Then, sort the data by date.



Next, visualize the data to see general trends and outliers.



The prophet package can estimate forecasts for time series data. A quick-start notebook is here.







share|improve this answer














share|improve this answer



share|improve this answer








edited Mar 12 at 17:43

























answered Mar 12 at 17:05









Brian SpieringBrian Spiering

4,1281029




4,1281029











  • $begingroup$
    Which join, outer, inner or cartesian product?
    $endgroup$
    – Roshan Mehta
    Mar 12 at 17:25










  • $begingroup$
    Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
    $endgroup$
    – Brian Spiering
    Mar 12 at 17:42
















  • $begingroup$
    Which join, outer, inner or cartesian product?
    $endgroup$
    – Roshan Mehta
    Mar 12 at 17:25










  • $begingroup$
    Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
    $endgroup$
    – Brian Spiering
    Mar 12 at 17:42















$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25




$begingroup$
Which join, outer, inner or cartesian product?
$endgroup$
– Roshan Mehta
Mar 12 at 17:25












$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42




$begingroup$
Depends on how you want to handle duplicate and missing data. Here is more details jakevdp.github.io/PythonDataScienceHandbook/…
$endgroup$
– Brian Spiering
Mar 12 at 17:42











0












$begingroup$

The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?



You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.



If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.






share|improve this answer








New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$












  • $begingroup$
    I only added the 6 first rows. The dataset is quite big.
    $endgroup$
    – Roshan Mehta
    8 hours ago















0












$begingroup$

The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?



You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.



If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.






share|improve this answer








New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$












  • $begingroup$
    I only added the 6 first rows. The dataset is quite big.
    $endgroup$
    – Roshan Mehta
    8 hours ago













0












0








0





$begingroup$

The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?



You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.



If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.






share|improve this answer








New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






$endgroup$



The first step is to place all your variables on the same dataframe, so a date row would have a complete set of information for that specific date. The second step is to understand the data and make sure the data make sense. At this stage you have three different data set with a different number of rows. The exact same variable "order" has three different values across the three tables. Does that make sense?



You also have only 5 somewhat random data points. I think with just 5 random data points, you won't have enough information to develop a model that is anymore predictive than a simple average. Train ticket sales is most probably very highly seasonal (vacation, holidays, workweek, commuters, etc.). To do a good model I would venture you need at least a few years of data so you can observe the seasonal pattern from year to year.



If you are just learning data science, I may suggest start with another data set that is a heck of a lot larger than this. Here you have literally nothing to work with.







share|improve this answer








New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this answer



share|improve this answer






New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









answered 9 hours ago









SympaSympa

1062




1062




New contributor




Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






Sympa is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











  • $begingroup$
    I only added the 6 first rows. The dataset is quite big.
    $endgroup$
    – Roshan Mehta
    8 hours ago
















  • $begingroup$
    I only added the 6 first rows. The dataset is quite big.
    $endgroup$
    – Roshan Mehta
    8 hours ago















$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago




$begingroup$
I only added the 6 first rows. The dataset is quite big.
$endgroup$
– Roshan Mehta
8 hours ago

















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47014%2fforecasting-ticket-sales-and-city%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Ружовы пелікан Змест Знешні выгляд | Пашырэнне | Асаблівасці біялогіі | Літаратура | НавігацыяДагледжаная версіяправерана1 зменаДагледжаная версіяправерана1 змена/ 22697590 Сістэматыкана ВіківідахВыявына Вікісховішчы174693363011049382

ValueError: Error when checking input: expected conv2d_13_input to have shape (3, 150, 150) but got array with shape (150, 150, 3)2019 Community Moderator ElectionError when checking : expected dense_1_input to have shape (None, 5) but got array with shape (200, 1)Error 'Expected 2D array, got 1D array instead:'ValueError: Error when checking input: expected lstm_41_input to have 3 dimensions, but got array with shape (40000,100)ValueError: Error when checking target: expected dense_1 to have shape (7,) but got array with shape (1,)ValueError: Error when checking target: expected dense_2 to have shape (1,) but got array with shape (0,)Keras exception: ValueError: Error when checking input: expected conv2d_1_input to have shape (150, 150, 3) but got array with shape (256, 256, 3)Steps taking too long to completewhen checking input: expected dense_1_input to have shape (13328,) but got array with shape (317,)ValueError: Error when checking target: expected dense_3 to have shape (None, 1) but got array with shape (7715, 40000)Keras exception: Error when checking input: expected dense_input to have shape (2,) but got array with shape (1,)

Illegal assignment from SObject to ContactFetching String, Id from Map - Illegal Assignment Id to Field / ObjectError: Compile Error: Illegal assignment from String to BooleanError: List has no rows for assignment to SObjectError on Test Class - System.QueryException: List has no rows for assignment to SObjectRemote action problemDML requires SObject or SObject list type error“Illegal assignment from List to List”Test Class Fail: Batch Class: System.QueryException: List has no rows for assignment to SObjectMapping to a user'List has no rows for assignment to SObject' Mystery