Dealing with time series data which is not continuous












1












$begingroup$


I have a time series data in Python 3 as follows:



Date                `Weekly_Sales`
2010-05-02 3400
2010-05-02 5600
2010-05-02` 4590
2010-05-02 5800
2010-05-12 2380
2010-05-12 6700
2010-05-12 3700


The time series is not continuous as there are multiple observations of the same date.I'm trying to forecast sales in python using ARIMA but my ACF and PACF plot shows that there is no corelation between the lags.Also if i run the dickly fuller test to test stationarity,my system freezes.



How can I fix this?










share|improve this question







New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$












  • $begingroup$
    Possible duplicate of Forecasting non-negative sparse time-series data
    $endgroup$
    – Louis T
    16 hours ago
















1












$begingroup$


I have a time series data in Python 3 as follows:



Date                `Weekly_Sales`
2010-05-02 3400
2010-05-02 5600
2010-05-02` 4590
2010-05-02 5800
2010-05-12 2380
2010-05-12 6700
2010-05-12 3700


The time series is not continuous as there are multiple observations of the same date.I'm trying to forecast sales in python using ARIMA but my ACF and PACF plot shows that there is no corelation between the lags.Also if i run the dickly fuller test to test stationarity,my system freezes.



How can I fix this?










share|improve this question







New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$












  • $begingroup$
    Possible duplicate of Forecasting non-negative sparse time-series data
    $endgroup$
    – Louis T
    16 hours ago














1












1








1





$begingroup$


I have a time series data in Python 3 as follows:



Date                `Weekly_Sales`
2010-05-02 3400
2010-05-02 5600
2010-05-02` 4590
2010-05-02 5800
2010-05-12 2380
2010-05-12 6700
2010-05-12 3700


The time series is not continuous as there are multiple observations of the same date.I'm trying to forecast sales in python using ARIMA but my ACF and PACF plot shows that there is no corelation between the lags.Also if i run the dickly fuller test to test stationarity,my system freezes.



How can I fix this?










share|improve this question







New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$




I have a time series data in Python 3 as follows:



Date                `Weekly_Sales`
2010-05-02 3400
2010-05-02 5600
2010-05-02` 4590
2010-05-02 5800
2010-05-12 2380
2010-05-12 6700
2010-05-12 3700


The time series is not continuous as there are multiple observations of the same date.I'm trying to forecast sales in python using ARIMA but my ACF and PACF plot shows that there is no corelation between the lags.Also if i run the dickly fuller test to test stationarity,my system freezes.



How can I fix this?







python time-series






share|improve this question







New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question







New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question






New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 18 hours ago









deathcode 666deathcode 666

61




61




New contributor




deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






deathcode 666 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.












  • $begingroup$
    Possible duplicate of Forecasting non-negative sparse time-series data
    $endgroup$
    – Louis T
    16 hours ago


















  • $begingroup$
    Possible duplicate of Forecasting non-negative sparse time-series data
    $endgroup$
    – Louis T
    16 hours ago
















$begingroup$
Possible duplicate of Forecasting non-negative sparse time-series data
$endgroup$
– Louis T
16 hours ago




$begingroup$
Possible duplicate of Forecasting non-negative sparse time-series data
$endgroup$
– Louis T
16 hours ago










1 Answer
1






active

oldest

votes


















0












$begingroup$

It looks like you have lost a bit of information in that dataset. You shouldn't have 4 measurement for one timestep for one variable - how do you know which of the first four rows to use for 2010-05-02?



I would suggest checking your data source, or then working out a way to explain the meaning of the four values... are they different somehow (using other information)?



How are you even creating lags on that Date index? Take the average over each day?
Depending on the package you use for your Dickey-Fuller test (and other methods), they might not be made to deal with identical timesteps as input... so could explain why the session crashes.






share|improve this answer









$endgroup$













  • $begingroup$
    its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
    $endgroup$
    – deathcode 666
    16 hours ago











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "557"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});






deathcode 666 is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f45293%2fdealing-with-time-series-data-which-is-not-continuous%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









0












$begingroup$

It looks like you have lost a bit of information in that dataset. You shouldn't have 4 measurement for one timestep for one variable - how do you know which of the first four rows to use for 2010-05-02?



I would suggest checking your data source, or then working out a way to explain the meaning of the four values... are they different somehow (using other information)?



How are you even creating lags on that Date index? Take the average over each day?
Depending on the package you use for your Dickey-Fuller test (and other methods), they might not be made to deal with identical timesteps as input... so could explain why the session crashes.






share|improve this answer









$endgroup$













  • $begingroup$
    its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
    $endgroup$
    – deathcode 666
    16 hours ago
















0












$begingroup$

It looks like you have lost a bit of information in that dataset. You shouldn't have 4 measurement for one timestep for one variable - how do you know which of the first four rows to use for 2010-05-02?



I would suggest checking your data source, or then working out a way to explain the meaning of the four values... are they different somehow (using other information)?



How are you even creating lags on that Date index? Take the average over each day?
Depending on the package you use for your Dickey-Fuller test (and other methods), they might not be made to deal with identical timesteps as input... so could explain why the session crashes.






share|improve this answer









$endgroup$













  • $begingroup$
    its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
    $endgroup$
    – deathcode 666
    16 hours ago














0












0








0





$begingroup$

It looks like you have lost a bit of information in that dataset. You shouldn't have 4 measurement for one timestep for one variable - how do you know which of the first four rows to use for 2010-05-02?



I would suggest checking your data source, or then working out a way to explain the meaning of the four values... are they different somehow (using other information)?



How are you even creating lags on that Date index? Take the average over each day?
Depending on the package you use for your Dickey-Fuller test (and other methods), they might not be made to deal with identical timesteps as input... so could explain why the session crashes.






share|improve this answer









$endgroup$



It looks like you have lost a bit of information in that dataset. You shouldn't have 4 measurement for one timestep for one variable - how do you know which of the first four rows to use for 2010-05-02?



I would suggest checking your data source, or then working out a way to explain the meaning of the four values... are they different somehow (using other information)?



How are you even creating lags on that Date index? Take the average over each day?
Depending on the package you use for your Dickey-Fuller test (and other methods), they might not be made to deal with identical timesteps as input... so could explain why the session crashes.







share|improve this answer












share|improve this answer



share|improve this answer










answered 16 hours ago









n1k31t4n1k31t4

5,8012318




5,8012318












  • $begingroup$
    its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
    $endgroup$
    – deathcode 666
    16 hours ago


















  • $begingroup$
    its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
    $endgroup$
    – deathcode 666
    16 hours ago
















$begingroup$
its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
$endgroup$
– deathcode 666
16 hours ago




$begingroup$
its a cross sectional panel data.So when i run the dicky fuller test,the system freezes,the same thing happens when i run ARIMA.
$endgroup$
– deathcode 666
16 hours ago










deathcode 666 is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















deathcode 666 is a new contributor. Be nice, and check out our Code of Conduct.













deathcode 666 is a new contributor. Be nice, and check out our Code of Conduct.












deathcode 666 is a new contributor. Be nice, and check out our Code of Conduct.
















Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f45293%2fdealing-with-time-series-data-which-is-not-continuous%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

How to label and detect the document text images

Vallis Paradisi

Tabula Rosettana