admin管理员组

文章数量:1346060

For a project I am using a sales dataset available here to make an ARIMA model to predict seasonal sales of their top 5 items. I am currently using a group by to build a table to plug into the auto_arima function but I'm having trouble getting it to work. I know what I want the result to look like but I'm having trouble getting there.


## df_comb is a union of the combined datasets on kaggle w/ year month added in as date

total_sales = df_comb.groupby(['Description','YearMonth'])['Quantity'].sum()

This returns

Description                          YearMonth
  DOORMAT UNION JACK GUNS AND ROSES  2010-01      150
                                     2010-02        5
                                     2010-03       14
 3 STRIPEY MICE FELTCRAFT            2009-12      261
                                     2010-01       64
                                                 ... 
ZINC WIRE SWEETHEART LETTER TRAY     2011-08        1
                                     2011-09        2
                                     2011-10        8
                                     2011-11        9
                                     2011-12       13
Name: Quantity, Length: 61768, dtype: int64

I have no idea where to go from here to get the top 5 and then for my ARIMA model do I plot each individually?

Am I thinking about this all wrong?

本文标签: pythonARIMA model on a group byStack Overflow