mmc2015

Time Series Forecasting with the Long Short-Term Memory Network in Python

http://machinelearningmastery.com/time-series-forecasting-long-short-term-memory-network-python/

by Jason Brownlee on April 7, 2017 in Deep Learning

The Long Short-Term Memory recurrent neural network has the promise of learning long sequences of observations.

It seems a perfect match for time series forecasting, and in fact, it may be.

In this tutorial, you will discover how to develop an LSTM forecast model for a one-step univariate time series forecasting problem.

After completing this tutorial, you will know:

How to develop a baseline of performance for a forecast problem.
How to design a robust test harness for one-step time series forecasting.
How to prepare data, develop, and evaluate an LSTM recurrent neural network for time series forecasting.

Let’s get started.

Time Series Forecasting with the Long Short-Term Memory Network in Python
Photo by Matt MacGillivray, some rights reserved.

Tutorial Overview

This is a big topic and we are going to cover a lot of ground. Strap in.

This tutorial is broken down into 9 parts; they are:

Shampoo Sales Dataset
Test Setup
Persistence Model Forecast
LSTM Data Preparation
LSTM Model Development
LSTM Forecast
Complete LSTM Example
Develop a Robust Result
Tutorial Extensions

Python Environment

This tutorial assumes you have a Python SciPy environment installed. You can use either Python 2 or 3 with this tutorial.

You must have Keras (2.0 or higher) installed with either the TensorFlow or Theano backend.

The tutorial also assumes you have scikit-learn, Pandas, NumPy and Matplotlib installed.

If you need help with your environment, see this post:

How to Setup a Python Environment for Machine Learning and Deep Learning with Anaconda

Shampoo Sales Dataset

This dataset describes the monthly number of sales of shampoo over a 3-year period.

The units are a sales count and there are 36 observations. The original dataset is credited to Makridakis, Wheelwright, and Hyndman (1998).

You can download and learn more about the dataset here.

Download the dataset to your current working directory with the name “shampoo-sales.csv“. Note that you may need to delete the footer information added by DataMarket.

The example below loads and creates a plot of the loaded dataset.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
           9 
         
           10 
         
           11 
         
           12 
         
           13 
         
          # load and plot dataset 
         
          from  
          pandas  
          import  
          read_csv 
         
          from  
          pandas  
          import  
          datetime 
         
          from  
          matplotlib  
          import  
          pyplot 
         
          # load dataset 
         
          def  
          parser 
          ( 
          x 
          ) 
          : 
         
          return 
            
          datetime 
          . 
          strptime 
          ( 
          '190' 
          + 
          x 
          , 
            
          '%Y-%m' 
          ) 
         
          series 
            
          = 
            
          read_csv 
          ( 
          'shampoo-sales.csv' 
          , 
            
          header 
          = 
          0 
          , 
            
          parse_dates 
          = 
          [ 
          0 
          ] 
          , 
            
          index_col 
          = 
          0 
          , 
            
          squeeze 
          = 
          True 
          , 
            
          date_parser 
          = 
          parser 
          ) 
         
          # summarize first few rows 
         
          print 
          ( 
          series 
          . 
          head 
          ( 
          ) 
          ) 
         
          # line plot 
         
          series 
          . 
          plot 
          ( 
          ) 
         
          pyplot 
          . 
          show 
          ( 
          )

Running the example loads the dataset as a Pandas Series and prints the first 5 rows.

A line plot of the series is then created showing a clear increasing trend.

Line Plot of Monthly Shampoo Sales Dataset

Experimental Test Setup

We will split the Shampoo Sales dataset into two parts: a training and a test set.

The first two years of data will be taken for the training dataset and the remaining one year of data will be used for the test set.

For example:

 
           1 
         
           2 
         
           3 
         
          # split data into train and test 
         
          X 
            
          = 
            
          series 
          . 
          values 
         
          train 
          , 
            
          test 
            
          = 
            
          X 
          [ 
          0 
          : 
          - 
          12 
          ] 
          , 
            
          X 
          [ 
          - 
          12 
          : 
          ]

Models will be developed using the training dataset and will make predictions on the test dataset.

A rolling forecast scenario will be used, also called walk-forward model validation.

Each time step of the test dataset will be walked one at a time. A model will be used to make a forecast for the time step, then the actual expected value from the test set will be taken and made available to the model for the forecast on the next time step.

For example:

This mimics a real-world scenario where new Shampoo Sales observations would be available each month and used in the forecasting of the following month.

Finally, all forecasts on the test dataset will be collected and an error score calculated to summarize the skill of the model. The root mean squared error (RMSE) will be used as it punishes large errors and results in a score that is in the same units as the forecast data, namely monthly shampoo sales.

For example:

 
           1 
         
           2 
         
           3 
         
          from  
          sklearn 
          . 
          metrics  
          import  
          mean_squared_error 
         
          rmse 
            
          = 
            
          sqrt 
          ( 
          mean_squared_error 
          ( 
          test 
          , 
            
          predictions 
          ) 
          ) 
         
          print 
          ( 
          'RMSE: %.3f' 
            
          % 
            
          rmse 
          )

Persistence Model Forecast

A good baseline forecast for a time series with a linear increasing trend is a persistence forecast.

The persistence forecast is where the observation from the prior time step (t-1) is used to predict the observation at the current time step (t).

We can implement this by taking the last observation from the training data and history accumulated by walk-forward validation and using that to predict the current time step.

For example:

We will accumulate all predictions in an array so that they can be directly compared to the test dataset.

The complete example of the persistence forecast model on the Shampoo Sales dataset is listed below.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
           9 
         
           10 
         
           11 
         
           12 
         
           13 
         
           14 
         
           15 
         
           16 
         
           17 
         
           18 
         
           19 
         
           20 
         
           21 
         
           22 
         
           23 
         
           24 
         
           25 
         
           26 
         
           27 
         
          from  
          pandas  
          import  
          read_csv 
         
          from  
          pandas  
          import  
          datetime 
         
          from  
          sklearn 
          . 
          metrics  
          import  
          mean_squared_error 
         
          from  
          math  
          import  
          sqrt 
         
          from  
          matplotlib  
          import  
          pyplot 
         
          # load dataset 
         
          def  
          parser 
          ( 
          x 
          ) 
          : 
         
          return 
            
          datetime 
          . 
          strptime 
          ( 
          '190' 
          + 
          x 
          , 
            
          '%Y-%m' 
          ) 
         
          series 
            
          = 
            
          read_csv 
          ( 
          'shampoo-sales.csv' 
          , 
            
          header 
          = 
          0 
          , 
            
          parse_dates 
          = 
          [ 
          0 
          ] 
          , 
            
          index_col 
          = 
          0 
          , 
            
          squeeze 
          = 
          True 
          , 
            
          date_parser 
          = 
          parser 
          ) 
         
          # split data into train and test 
         
          X 
            
          = 
            
          series 
          . 
          values 
         
          train 
          , 
            
          test 
            
          = 
            
          X 
          [ 
          0 
          : 
          - 
          12 
          ] 
          , 
            
          X 
          [ 
          - 
          12 
          : 
          ] 
         
          # walk-forward validation 
         
          history 
            
          = 
            
          [ 
          x 
            
          for 
            
          x 
            
          in 
            
          train 
          ] 
         
          predictions 
            
          = 
            
          list 
          ( 
          ) 
         
          for 
            
          i 
            
          in 
            
          range 
          ( 
          len 
          ( 
          test 
          ) 
          ) 
          : 
         
          # make prediction 
         
          predictions 
          . 
          append 
          ( 
          history 
          [ 
          - 
          1 
          ] 
          ) 
         
          # observation 
         
          history 
          . 
          append 
          ( 
          test 
          [ 
          i 
          ] 
          ) 
         
          # report performance 
         
          rmse 
            
          = 
            
          sqrt 
          ( 
          mean_squared_error 
          ( 
          test 
          , 
            
          predictions 
          ) 
          ) 
         
          print 
          ( 
          'RMSE: %.3f' 
            
          % 
            
          rmse 
          ) 
         
          # line plot of observed vs predicted 
         
          pyplot 
          . 
          plot 
          ( 
          test 
          ) 
         
          pyplot 
          . 
          plot 
          ( 
          predictions 
          ) 
         
          pyplot 
          . 
          show 
          ( 
          )

Running the example prints the RMSE of about 136 monthly shampoo sales for the forecasts on the test dataset.

A line plot of the test dataset (blue) compared to the predicted values (orange) is also created showing the persistence model forecast in context.

Persistence Forecast of Observed vs Predicted for Shampoo Sales Dataset

For more on the persistence model for time series forecasting, see this post:

How to Make Baseline Predictions for Time Series Forecasting with Python

Now that we have a baseline of performance on the dataset, we can get started developing an LSTM model for the data.

LSTM Data Preparation

Before we can fit an LSTM model to the dataset, we must transform the data.

This section is broken down into three steps:

Transform the time series into a supervised learning problem
Transform the time series data so that it is stationary.
Transform the observations to have a specific scale.

Transform Time Series to Supervised Learning

The LSTM model in Keras assumes that your data is divided into input (X) and output (y) components.

For a time series problem, we can achieve this by using the observation from the last time step (t-1) as the input and the observation at the current time step (t) as the output.

We can achieve this using the shift() function in Pandas that will push all values in a series down by a specified number places. We require a shift of 1 place, which will become the input variables. The time series as it stands will be the output variables.

We can then concatenate these two series together to create a DataFrame ready for supervised learning. The pushed-down series will have a new position at the top with no value. A NaN (not a number) value will be used in this position. We will replace these NaN values with 0 values, which the LSTM model will have to learn as “the start of the series” or “I have no data here,” as a month with zero sales on this dataset has not been observed.

The code below defines a helper function to do this called timeseries_to_supervised(). It takes a NumPy array of the raw time series data and a lag or number of shifted series to create and use as inputs.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
          # frame a sequence as a supervised learning problem 
         
          def  
          timeseries_to_supervised 
          ( 
          data 
          , 
            
          lag 
          = 
          1 
          ) 
          : 
         
          df 
            
          = 
            
          DataFrame 
          ( 
          data 
          ) 
         
          columns 
            
          = 
            
          [ 
          df 
          . 
          shift 
          ( 
          i 
          ) 
            
          for 
            
          i 
            
          in 
            
          range 
          ( 
          1 
          , 
            
          lag 
          + 
          1 
          ) 
          ] 
         
          columns 
          . 
          append 
          ( 
          df 
          ) 
         
          df 
            
          = 
            
          concat 
          ( 
          columns 
          , 
            
          axis 
          = 
          1 
          ) 
         
          df 
          . 
          fillna 
          ( 
          0 
          , 
            
          inplace 
          = 
          True 
          ) 
         
          return 
            
          df

We can test this function with our loaded Shampoo Sales dataset and convert it into a supervised learning problem.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
           9 
         
           10 
         
           11 
         
           12 
         
           13 
         
           14 
         
           15 
         
           16 
         
           17 
         
           18 
         
           19 
         
           20 
         
           21 
         
           22 
         
          from  
          pandas  
          import  
          read_csv 
         
          from  
          pandas  
          import  
          datetime 
         
          from  
          pandas  
          import  
          DataFrame 
         
          from  
          pandas  
          import  
          concat 
         
          # frame a sequence as a supervised learning problem 
         
          def  
          timeseries_to_supervised 
          ( 
          data 
          , 
            
          lag 
          = 
          1 
          ) 
          : 
         
          df 
            
          = 
            
          DataFrame 
          ( 
          data 
          ) 
         
          columns 
            
          = 
            
          [ 
          df 
          . 
          shift 
          ( 
          i 
          ) 
            
          for 
            
          i 
            
          in 
            
          range 
          ( 
          1 
          , 
            
          lag 
          + 
          1 
          ) 
          ] 
         
          columns 
          . 
          append 
          ( 
          df 
          ) 
         
          df 
            
          = 
            
          concat 
          ( 
          columns 
          , 
            
          axis 
          = 
          1 
          ) 
         
          df 
          . 
          fillna 
          ( 
          0 
          , 
            
          inplace 
          = 
          True 
          ) 
         
          return 
            
          df 
         
          # load dataset 
         
          def  
          parser 
          ( 
          x 
          ) 
          : 
         
          return 
            
          datetime 
          . 
          strptime 
          ( 
          '190' 
          + 
          x 
          , 
            
          '%Y-%m' 
          ) 
         
          series 
            
          = 
            
          read_csv 
          ( 
          'shampoo-sales.csv' 
          , 
            
          header 
          = 
          0 
          , 
            
          parse_dates 
          = 
          [ 
          0 
          ] 
          , 
            
          index_col 
          = 
          0 
          , 
            
          squeeze 
          = 
          True 
          , 
            
          date_parser 
          = 
          parser 
          ) 
         
          # transform to supervised learning 
         
          X 
            
          = 
            
          series 
          . 
          values 
         
          supervised 
            
          = 
            
          timeseries_to_supervised 
          ( 
          X 
          , 
            
          1 
          ) 
         
          print 
          ( 
          supervised 
          . 
          head 
          ( 
          ) 
          )

Running the example prints the first 5 rows of the new supervised learning problem.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
                       0           0 
         
           0    0.000000  266.000000 
         
           1  266.000000  145.899994 
         
           2  145.899994  183.100006 
         
           3  183.100006  119.300003 
         
           4  119.300003  180.300003

For more information on transforming a time series problem into a supervised learning problem, see the post:

Time Series Forecasting as Supervised Learning

Transform Time Series to Stationary

The Shampoo Sales dataset is not stationary.

This means that there is a structure in the data that is dependent on the time. Specifically, there is an increasing trend in the data.

Stationary data is easier to model and will very likely result in more skillful forecasts.

The trend can be removed from the observations, then added back to forecasts later to return the prediction to the original scale and calculate a comparable error score.

A standard way to remove a trend is by differencing the data. That is the observation from the previous time step (t-1) is subtracted from the current observation (t). This removes the trend and we are left with a difference series, or the changes to the observations from one time step to the next.

We can achieve this automatically using the diff() function in pandas. Alternatively, we can get finer grained control and write our own function to do this, which is preferred for its flexibility in this case.

Below is a function called difference() that calculates a differenced series. Note that the first observation in the series is skipped as there is no prior observation with which to calculate a differenced value.

We also need to invert this process in order to take forecasts made on the differenced series back into their original scale.

The function below, called inverse_difference(), inverts this operation.

 
           1 
         
           2 
         
           3 
         
          # invert differenced value 
         
          def  
          inverse_difference 
          ( 
          history 
          , 
            
          yhat 
          , 
            
          interval 
          = 
          1 
          ) 
          : 
         
          return 
            
          yhat 
            
          + 
            
          history 
          [ 
          - 
          interval 
          ]

We can test out these functions by differencing the whole series, then returning it to the original scale, as follows:

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
           9 
         
           10 
         
           11 
         
           12 
         
           13 
         
           14 
         
           15 
         
           16 
         
           17 
         
           18 
         
           19 
         
           20 
         
           21 
         
           22 
         
           23 
         
           24 
         
           25 
         
           26 
         
           27 
         
           28 
         
           29 
         
           30 
         
           31 
         
          from  
          pandas  
          import  
          read_csv 
         
          from  
          pandas  
          import  
          datetime 
         
          from  
          pandas  
          import  
          Series 
         
          # create a differenced series 
         
          def  
          difference 
          ( 
          dataset 
          , 
            
          interval 
          = 
          1 
          ) 
          : 
         
          diff 
            
          = 
            
          list 
          ( 
          ) 
         
          for 
            
          i 
            
          in 
            
          range 
          ( 
          interval 
          , 
            
          len 
          ( 
          dataset 
          ) 
          ) 
          : 
         
          value 
            
          = 
            
          dataset 
          [ 
          i 
          ] 
            
          - 
            
          dataset 
          [ 
          i 
            
          - 
            
          interval 
          ] 
         
          diff 
          . 
          append 
          ( 
          value 
          ) 
         
          return 
            
          Series 
          ( 
          diff 
          ) 
         
          # invert differenced value 
         
          def  
          inverse_difference 
          ( 
          history 
          , 
            
          yhat 
          , 
            
          interval 
          = 
          1 
          ) 
          : 
         
          return 
            
          yhat 
            
          + 
            
          history 
          [ 
          - 
          interval 
          ] 
         
          # load dataset 
         
          def  
          parser 
          ( 
          x 
          ) 
          : 
         
          return 
            
          datetime 
          . 
          strptime 
          ( 
          '190' 
          + 
          x 
          , 
            
          '%Y-%m' 
          ) 
         
          series 
            
          = 
            
          read_csv 
          ( 
          'shampoo-sales.csv' 
          , 
            
          header 
          = 
          0 
          , 
            
          parse_dates 
          = 
          [ 
          0 
          ] 
          , 
            
          index_col 
          = 
          0 
          , 
            
          squeeze 
          = 
          True 
          , 
            
          date_parser 
          = 
          parser 
          ) 
         
          print 
          ( 
          series 
          . 
          head 
          ( 
          ) 
          ) 
         
          # transform to be stationary 
         
          differenced 
            
          = 
            
          difference 
          ( 
          series 
          , 
            
          1 
          ) 
         
          print 
          ( 
          differenced 
          . 
          head 
          ( 
          ) 
          ) 
         
          # invert transform 
         
          inverted 
            
          = 
            
          list 
          ( 
          ) 
         
          for 
            
          i 
            
          in 
            
          range 
          ( 
          len 
          ( 
          differenced 
          ) 
          ) 
          : 
         
          value 
            
          = 
            
          inverse_difference 
          ( 
          series 
          , 
            
          differenced 
          [ 
          i 
          ] 
          , 
            
          len 
          ( 
          series 
          ) 
          - 
          i 
          ) 
         
          inverted 
          . 
          append 
          ( 
          value 
          ) 
         
          inverted 
            
          = 
            
          Series 
          ( 
          inverted 
          ) 
         
          print 
          ( 
          inverted 
          . 
          head 
          ( 
          ) 
          )

Running the example prints the first 5 rows of the loaded data, then the first 5 rows of the differenced series, then finally the first 5 rows with the difference operation inverted.

Note that the first observation in the original dataset was removed from the inverted difference data. Besides that, the last set of data matches the first as expected.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
           9 
         
           10 
         
           11 
         
           12 
         
           13 
         
           14 
         
           15 
         
           16 
         
           17 
         
           18 
         
           19 
         
           20 
         
           21 
         
           Month 
         
           1901-01-01    266.0 
         
           1901-02-01    145.9 
         
           1901-03-01    183.1 
         
           1901-04-01    119.3 
         
           1901-05-01    180.3 
         
           Name: Sales, dtype: float64 
         
           0   -120.1 
         
           1     37.2 
         
           2    -63.8 
         
           3     61.0 
         
           4    -11.8 
         
           dtype: float64 
         
           0    145.9 
         
           1    183.1 
         
           2    119.3 
         
           3    180.3 
         
           4    168.5 
         
           dtype: float64

For more information on making the time series stationary and differencing, see the posts:

How to Check if Time Series Data is Stationary with Python
How to Difference a Time Series Dataset with Python

Transform Time Series to Scale

Like other neural networks, LSTMs expect data to be within the scale of the activation function used by the network.

The default activation function for LSTMs is the hyperbolic tangent (tanh), which outputs values between -1 and 1. This is the preferred range for the time series data.

To make the experiment fair, the scaling coefficients (min and max) values must be calculated on the training dataset and applied to scale the test dataset and any forecasts. This is to avoid contaminating the experiment with knowledge from the test dataset, which might give the model a small edge.

We can transform the dataset to the range [-1, 1] using the MinMaxScaler class. Like other scikit-learn transform classes, it requires data provided in a matrix format with rows and columns. Therefore, we must reshape our NumPy arrays before transforming.

For example:

Again, we must invert the scale on forecasts to return the values back to the original scale so that the results can be interpreted and a comparable error score can be calculated.

 
           1 
         
           2 
         
          # invert transform 
         
          inverted_X 
            
          = 
            
          scaler 
          . 
          inverse_transform 
          ( 
          scaled_X 
          )

Putting all of this together, the example below transforms the scale of the Shampoo Sales data.

 
   
 
     
      
       
           1 
         

           2 
         

           3 
         

           4 
         

           5 
         

           6 
         

           7 
         

           8 
         

           9 
         

           10 
         

           11 
         

           12 
         

           13 
         

           14 
         

           15 
         

           16 
         

           17 
         

           18 
         

           19 
         

           20 
         

           21 
         
 
        
          from  
          pandas  
          import  
          read_csv 
         
 
          from  
          pandas  
          import  
          datetime 
         
 
          from  
          pandas  
          import  
          Series 
         
 
          from  
          sklearn 
          . 
          preprocessing  
          import  
          MinMaxScaler 
         
 
          # load dataset 
         
 
          def  
          parser 
          ( 
          x 
          ) 
          : 
         
 
           
          return 
            
          datetime 
          . 
          strptime 
          ( 
          '190' 
          + 
          x 
          , 
            
          '%Y-%m' 
          ) 
         
 
          series 
            
          = 
            
          read_csv 
          ( 
          'shampoo-sales.csv' 
          , 
            
          header 
          = 
          0 
          , 
            
          parse_dates 
          = 
          [ 
          0 
          ] 
          , 
            
          index_col 
          = 
          0 
          , 
            
          squeeze 
          = 
          True 
          , 
            
          date_parser 
          = 
          parser 
          ) 
         
 
          print 
          ( 
          series 
          . 
          head 
          ( 
          ) 
          ) 
         
 
          # transform scale 
         
 
          X 
            
          = 
            
          series 
          . 
          values 
         
 
          X 
            
          = 
            
          X 
          . 
          reshape 
          ( 
          len 
          ( 
          X 
          ) 
          , 
            
          1 
          ) 
         
 
          scaler 
            
          = 
            
          MinMaxScaler 
          ( 
          feature_range 
          = 
          ( 
          - 
          1 
          , 
            
          1 
          ) 
          ) 
         
 
          scaler 
            
          = 
            
          scaler 
          . 
          fit 
          ( 
          X 
          ) 
         
 
          scaled_X 
            
          = 
            
          scaler 
          . 
          transform 
          ( 
          X 
          ) 
         
 
          scaled_series 
            
          = 
            
          Series 
          ( 
          scaled_X 
          [ 
          : 
          , 
            
          0 
          ] 
          ) 
         
 
          print 
          ( 
          scaled_series 
          . 
          head 
          ( 
          ) 
          ) 
         
 
          # invert transform 
         
 
          inverted_X 
            
          = 
            
          scaler 
          . 
          inverse_transform 
          ( 
          scaled_X 
          ) 
         
 
          inverted_series 
            
          = 
            
          Series 
          ( 
          inverted_X 
          [ 
          : 
          , 
            
          0 
          ] 
          ) 
         
 
          print 
          ( 
          inverted_series 
          . 
          head 
          ( 
          ) 
          ) 
         
 
      
 
     
   

Running the example first prints the first 5 rows of the loaded data, then the first 5 rows of the scaled data, then the first 5 rows with the scale transform inverted, matching the original data.

 
           1 
         
           2 
         
           3 
         
           4 
         
           5 
         
           6 
         
           7 
         
           8 
         
           9 
         
           10 
         
           11 
         
           12 
         
           13 
         
           14 
         
           15 
         
           16 
         
           17 
         
           18 
         
           19 
         
           20 
         
           21 
         
           Month 
         
           1901-01-01    266.0 
         
           1901-02-01    145.9 
         
           1901-03-01    183.1 
         
           1901-04-01    119.3 
         
           1901-05-01    180.3 
         
           Name: Sales, dtype: float64 
         
           0   -0.478585 
         
           1   -0.905456 
         
           2   -0.773236 
         
           3   -1.000000 
         
           4   -0.783188 
         
           dtype: float64 
         
           0    266.0 
         
           1    145.9 
         
           2    183.1 
         
           3    119.3 
         
           4    180.3 
         
           dtype: float64

Now that we know how to prepare data for the LSTM network, we can start developing our model.

LSTM Model Development

The Long Short-Term Memory network (LSTM) is a type of Recurrent Neural Network (RNN).

A benefit of this type of network is that it can learn and remember over long sequences and does not rely on a pre-specified window lagged observation as input.

In Keras, this is referred to as stateful, and involves setting the “stateful” argument to “True” when defining an LSTM layer.

By default, an LSTM layer in Keras maintains state between data within one batch. A batch of data is a fixed-sized number of rows from the training dataset that defines how many patterns to process before updating the weights of the network. State in the LSTM layer between batches is cleared by default, therefore we must make the LSTM stateful. This gives us fine-grained control over when state of the LSTM layer is cleared, by calling the reset_states() function.

The LSTM layer expects input to be in a matrix with the dimensions: [samples, time steps, features].

Samples: These are independent observations from the domain, typically rows of data.
Time steps: These are separate time steps of a given variable for a given observation.
Features: These are separate measures observed at the time of observation.

We have some flexibility in how the Shampoo Sales dataset is framed for the network. We will keep it simple and frame the problem as each time step in the original sequence is one separate sample, with one timestep and one feature.

Given that the training dataset is defined as X inputs and y outputs, it must be reshaped into the Samples/TimeSteps/Features format, for example:

The shape of the input data must be specified in the LSTM layer using the “batch_input_shape” argument as a tuple that specifies the expected number of observations to read each batch, the number of time steps, and the number of features.

The batch size is often much smaller than the total number of samples. It, along with the number of epochs, defines how quickly the network learns the data (how often the weights are updated).

The final import parameter in defining the LSTM layer is the number of neurons, also called the number of memory units or blocks. This is a reasonably simple problem and a number between 1 and 5 should be sufficient.

The line below creates a single LSTM hidden layer that also specifies the expectations of the input layer via the “batch_input_shape” argument.

 
   
 
     
      
       
           1 
         
 
        
          layer 
            
          = 
            
          LSTM 
          ( 
          neurons 
          , 
            
          batch_input_shape 
          = 
          ( 
          batch_size 
          , 
            
          X 
          . 
          shape 
          [ 
          1 
          ] 
          , 
            
          X 
          . 
          shape 
          [ 
          2 
          ] 
          ) 
          , 
            
          stateful 
          = 
          True 
          ) 
         
 
      
 
     
   

The network requires a single neuron in the output layer with a linear activation to predict the number of shampoo sales at the next time step.

Once the network is specified, it must be compiled into an efficient symbolic representation using a backend mathematical library, such as TensorFlow or Theano.

In compiling the network, we must specify a loss function and optimization algorithm. We will use “mean_squared_error” as the loss function as it closely matches RMSE that we will are interested in, and the efficient ADAM optimization algorithm.

Using the Sequential Keras API to define the network, the below snippet creates and compiles the network.

Once compiled, it can be fit to the training data. Because the network is stateful, we must control when the internal state is reset. Therefore, we must manually manage the training process one epoch at a time across the desired number of epochs.

By default, the samples within an epoch are shuffled prior to being exposed to the network. Again, this is undesirable for the LSTM because we want the network to build up state as it learns across the sequence of observations. We can disable the shuffling of samples by setting “shuffle” to “False“.

Also by default, the network reports a lot of debug information about the learning progress and skill of the model at the end of each epoch. We can disable this by setting the “verbose” argument to the level of “0“.

We can then reset the internal state at the end of the training epoch, ready for the next training iteration.

Below is a loop that manually fits the network to the training data.

 
           1 
         
           2 
         
           3 
         
          for 
            
          i 
            
          in 
            
          range 
          ( 
          nb_epoch 
          ) 
          : 
         
          model 
          . 
          fit 
          ( 
          X 
          , 
            
          y 
          , 
            
          epochs 
          = 
          1 
          , 
            
          batch_size 
          = 
          batch_size 
          , 
            
          verbose 
          = 
          0 
          , 
            
          shuffle 
          = 
          False 
          ) 
         
          model 
          . 
          reset_states 
          ( 
          )

Putting this all together, we can define a function called fit_lstm() that trains and returns an LSTM model. As arguments, it takes the training dataset in a supervised learning format, a batch size, a number of epochs, and a number of neurons.

 
   
 
     
      
       
           1 
         

           2 
         

           3 
         

           4 
         

           5 
         

           6 
         

           7 
         

           8 
         

           9 
         

           10 
         

           11 
         
 
        
          def  
          fit_lstm 
          ( 
          train 
          , 
            
          batch_size 
          , 
            
          nb_epoch 
          , 
            
          neurons 
          ) 
          : 
         
 
           
          X 
          , 
            
          y 
            
          = 
            
          train 
          [ 
          : 
          , 
            
          0 
          : 
          - 
          1 
          ] 
          , 
            
          train 
          [ 
          : 
          , 
            
          - 
          1 
          ] 
         
 
           
          X 
            
          = 
            
          X 
          . 
          reshape 
          ( 
          X 
          . 
          shape 
          [ 
          0 
          ] 
          , 
            
          1 
          , 
            
          X 
          . 
          shape 
          [ 
          1 
          ] 
          ) 
         
 
           
          model 
            
          = 
            
          Sequential 
          ( 
          ) 
         
 
           
          model 
          . 
          add 
          ( 
          LSTM 
          ( 
          neurons 
          , 
            
          batch_input_shape 
          = 
          ( 
          batch_size 
          , 
            
          X 
          . 
          shape 
          [ 
          1 
          ] 
          , 
            
          X 
          . 
          shape 
          [ 
          2 
          ] 
          ) 
          , 
            
          stateful 
          = 
          True 
          ) 
          ) 
         
 
           
          model 
          . 
          add 
          ( 
          Dense 
          ( 
          1 
          ) 
          ) 
         
 
           
          model 
          . 
          compile 
          ( 
          loss 
          = 
          'mean_squared_error' 
          , 
            
          optimizer 
          = 
          'adam' 
          ) 
         
 
           
          for 
            
          i 
            
          in 
            
          range 
          ( 
          nb_epoch 
          ) 
          : 
         
 
           
          model 
          . 
          fit 
          ( 
          X 
          , 
            
          y 
          , 
            
          epochs 
          = 
          1 
          , 
            
          batch_size 
          = 
          batch_size 
          , 
            
          verbose 
          = 
          0 
          , 
            
          shuffle 
          = 
          False 
          ) 
         
 
           
          model 
          . 
          reset_states 
          ( 
          ) 
         
 
           
          return 
            
          model 
         
 
      
 
     
   

The batch_size must be set to 1. This is because it must be a factor of the size of the training and test datasets.

The predict() function on the model is also constrained by the batch size; there it must be set to 1 because we are interested in making one-step forecasts on the test data.

We will not tune the network parameters in this tutorial; instead we will use the following configuration, found with a little trial and error:

Batch Size: 1
Epochs: 3000
Neurons: 4

As an extension to this tutorial, you might like to explore different model parameters and see if you can improve performance.

Update: Consider trying 1500 epochs and 1 neuron, the performance may be better!

Next, we will look at how we can use a fit LSTM model to make a one-step forecast.

LSTM Forecast

Once the LSTM model is fit to the training data, it can be used to make forecasts.

Again, we have some flexibility. We can decide to fit the model once on all of the training data, then predict each new time step one at a time from the test data (we’ll call this the fixed approach), or we can re-fit the model or update the model each time step of the test data as new observations from the test data are made available (we’ll call this the dynamic approach).

In this tutorial, we will go with the fixed approach for its simplicity, although, we would expect the dynamic approach to result in better model skill.

To make a forecast, we can call the predict() function on the model. This requires a 3D NumPy array input as an argument. In this case, it will be an array of one value, the observation at the previous time step.

The predict() function returns an array of predictions, one for each input row provided. Because we are providing a single input, the output will be a 2D NumPy array with one value.

We can capture this behavior in a function named forecast() listed below. Given a fit model, a batch-size used when fitting the model (e.g. 1), and a row from the test data, the function will separate out the input data from the test row, reshape it, and return the prediction as a single floating point value.

During training, the internal state is reset after each epoch. While forecasting, we will not want to reset the internal state between forecasts. In fact, we would like the model to build up state as we forecast each time step in the test dataset.

This raises the question as to what would be a good initial state for the network prior to forecasting the test dataset.

In this tutorial, we will seed the state by making a prediction on all samples in the training dataset. In theory, the internal state should be set up ready to forecast the next time step.

We now have all of the pieces to fit an LSTM Network model for the Shampoo Sales dataset and evaluate its performance.

In the next section, we will put all of these pieces together.

Complete LSTM Example

In this section, we will fit an LSTM to the Shampoo Sales dataset and evaluate the model.

This will involve drawing together all of the elements from the prior sections. There are a lot of them, so let’s review:

Load the dataset from CSV file.
Transform the dataset to make it suitable for the LSTM model, including:
1. Transforming the data to a supervised learning problem.
2. Transforming the data to be stationary.
3. Transforming the data so that it has the scale -1 to 1.
Fitting a stateful LSTM network model to the training data.
Evaluating the static LSTM model on the test data.
Report the performance of the forecasts.

Some things to note about the example:

The scaling and inverse scaling behaviors have been moved to the functions scale() and invert_scale() for brevity.
The test data is scaled using the fit of the scaler on the training data, as is required to ensure the min/max values of the test data do not influence the model.
The order of data transforms was adjusted for convenience to first make the data stationary, then a supervised learning problem, then scaled.
Differencing was performed on the entire dataset prior to splitting into train and test sets for convenience. We could just as easily collect observations during the walk-forward validation and difference them as we go. I decided against it for readability.

The complete example is listed below.

你可能感兴趣的:(ML,in,coding,深度学习)

localStorage在上面位置？数据存放文件名是什么？ 2301_79698214 html java
在上述代码中，数据并不是以传统文件的形式存放在某个具体的文件里，而是存储在浏览器的localStorage中。localStorage是HTML5新增的一个会话存储对象，它用于临时保存同一窗口（或标签页）的数据，在关闭窗口或标签页后数据仍然存在。数据存储位置和文件名存储位置：localStorage是浏览器提供的一个存储机制，数据存储在浏览器的本地存储区域，不同的浏览器存储位置不同，例如：Chro
Promise 原理与实战：从基础到高级的完整教程 D.eL 前端工程化从无 -通前端 javascript
一、前言：为什么会出现Promise?Promise的重要性我认为没有必要多说，概括起来就是五个字：必！须！得！掌！握！。而且还要掌握透彻，在实际的使用中，有非常多的应用场景我们不能立即知道应该如何继续往下执行。最常见的一个场景就是ajax请求，通俗来说，由于网速的不同，可能你得到返回值的时间也是不同的，这个时候我们就需要等待，结果出来了之后才知道怎么样继续下去。letxhr=newXMLHttp
NSSCTF_crypto_[HGAME 2022 week3]RSA attack 3 岁岁的O泡奶 python 开发语言密码学 crypto NSSCTF 维纳攻击
[HGAME2022week3]RSAattack3题目:太多了自己去看，提示:维纳攻击首先在做这题之前你得先懂得维纳攻击的原理https://www.cnblogs.com/wandervogel/p/16805992.htmlok啊看懂了维纳攻击的原理就来开始写脚本吧fromCrypto.Util.numberimportlong_to_bytesimportgmpy2#已知参数n=50741
网页大屏适配使用css的scale方法缺点是两边会有留白；无足鸟丶 css css3 html javascript 前端
网页大屏适配使用css的scale方法缺点是两边会有留白；Document*{margin:0;padding:0;}html,body{width:100vw;height:100vh;background-color:blue;}#container{width:100%;height:100%;}.box{width:1920px;height:1080px;background-color
Browser-Use WebUI项目启动指南思考在马桶上人工智能 chatgpt 经验分享 python
摘要此前发布《Browser-UseWebUI使用体验》博文后，鉴于部分朋友运行时出现问题，重新运行并整理相关内容。本文详细记录WebUI项目启动全过程，涵盖Python3.11+、Chrome浏览器及APIKeys等环境要求，Python环境检查、依赖安装等环境配置步骤，.env文件中环境变量的设置方法。同时，针对启动中如lxml.html.clean依赖缺失、连接被拒等问题给出解决方案，介绍启
网络安全入门教程（非常详细）从零基础入门到精通，看完这一篇就够了白帽黑客坤哥 web安全网络安全 python windows
href="https://csdnimg.cn/release/blogv2/dist/mdeditor/css/editerView/kdoc_html_views-1a98987dfd.css"rel="stylesheet"/>href="https://csdnimg.cn/release/blogv2/dist/mdeditor/css/editerView/ck_htmledit_v
10 分钟学会SpringValidation数据校验和全局异常处理 ohn.yu spring spring boot java
以下是一个使用Spring开发的简单RESTAPI小程序，通过对一张user表进行操作，代码演示如何RestAPI开发中实现数据校验、全局异常处理和返回Json格式数据。使用的核心框架包括SpringBootSpringWebSpringDataJPABeanValidation（JSR-303）Lombok1.项目依赖（pom.xml）创建一个Maven项目，添加以下依赖："xmlns:xsi=
移动端IOS的H5页面被键盘顶起后，底部有一大片空白区域的解决方法不怕麻烦的鹿丸浏览器 HTML5 JavaScript 前端 html5 javascript
在移动端开发中，当使用HTML5(特别是在Vue.js框架下)构建应用时，经常会遇到键盘弹出导致页面内容被顶起的问题。当键盘收起后，页面未能自动恢复到原来的位置。当键盘弹出时，你可以通过JavaScript监听键盘的显示和隐藏事件，并相应地调整页面的滚动位置。exportdefault{mounted(){window.addEventListener('focusin',this.handleF
爬虫基础--request库详解 amo的代码园_毕设 Java基础爬虫 java spring boot vue.js python 开发语言
爬虫基础–request库详解1.requests模块介绍request库中文文档：https://docs.python-requests.org/zh_CN/latest/user/quickstart.htmlrequests是一个非常流行的PythonHTTP第三方库，它允许你发送各种HTTP请求，处理cookies、会话、连接池、重定向、多种认证方式等，使得处理HTTP请求变得非常便捷，
Selenium实战-模拟登录淘宝并爬取商品信息_使用selenium模拟真实登录行为,并爬取商品评论数据。 2401_84009899 程序员 selenium python 测试工具
模拟淘宝登录deflogin_taobao():print(‘开始登录…’)try:login_url=‘https://login.taobao.com/member/login.jhtml’driver.get(login_url)input_login_id=wait.until(EC.presence_of_element_located((By.ID,‘fm-login-id’)))in
uniapp中使用webview并与原页面通信数学分析分析什么？ uni-app
uniapp中使用webview并与原页面通信1.接收数据主要使用@message与@onPostMessage接收原页面数据，且两个方法只能在APP中使用，其他平台均不支持。/***接收页面返回参数*@param{Object}item*/htmlMessage(item){console.log('收到的消息',item)letdata=item.detail...},2.发送数据（调用原页面
`fetch` 和 `axios`的前端使用区别 Studying_swz blog 前端
欢迎访问的个人博客：https://swzbk.site/，加好友，拉你入福利群fetch和axios`是前端常用的两种HTTP客户端，以下是它们的核心区别及适用场景：一、本质区别特性fetchaxios类型浏览器原生API（部分环境需polyfill）第三方库（需通过npm/yarn安装）底层实现基于Promise基于Promise，封装了XMLHttpRequest二、核心功能对比1.请求与响
uniapp工程中解析markdown文件 pvfhv uni-app
在uniapp中如何导入markdown文件，同时在页面中解析成html，请参考以下配置：1.安装以下3个依赖包npminstallmarkedhighlight.jsvite-plugin-markdown2.创建vite.config.js配置文件//vite.config.jsimport{defineConfig}from'vite';importunifrom'@dcloudio/vit
智慧城市道路防护栏破损缺陷检测数据集VOC+YOLO格式6939张3类别 FL1623863129 数据集 YOLO 深度学习机器学习
数据集格式：PascalVOC格式+YOLO格式(不包含分割路径的txt文件，仅仅包含jpg图片以及对应的VOC格式xml文件和yolo格式txt文件)图片数量(jpg文件个数)：6939标注数量(xml文件个数)：6939标注数量(txt文件个数)：6939标注类别数：3标注类别名称(注意yolo格式类别顺序不和这个对应，而以labels文件夹classes.txt为准):["body","cr
五、AIGC大模型_09手动实现ReAct_Agent 学不会lostfound AI 人工智能 react_agent LangGraph Multi-Agent PlanAndExecute AIGC
0、前言在上一章节中，我们了解到：create_react_agent是LangGraph提供的一个预构建方法（fromlanggraph.prebuiltimportcreate_react_agent），它可以将语言模型（LLM）和一组工具（Tools）结合起来，创建一个能够根据用户输入自动调用工具的智能代理，这个代理可以根据用户的请求，决定是否需要调用某个工具，并将工具的输出反馈给用户这个函
详解小程序多端框架全面测评前端可乐老师前端
现在流行的多端框架可以大致分为三类：1.全包型这类框架最大的特点就是从底层的渲染引擎、布局引擎，到中层的DSL，再到上层的框架全部由自己开发，代表框架是Qt和Flutter。这类框架优点非常明显：性能（的上限）高；各平台渲染结果一致。缺点也非常明显：需要完全重新学习DSL（QML/Dart），以及难以适配中国特色的端：小程序。这类框架是最原始也是最纯正的的多端开发框架，由于底层到上层每个环节都掌握
Springboot启动失败：解决「org.yaml.snakeyaml.error.YAMLException」报错全记录 -天凉好秋- spring boot java idea visual studio code
##关键字Java、Springboot、vscode、idea、nacos启动失败、YAMLException、字符集配置---##背景环境###项目架构-**框架**：SSM（Spring+SpringMVC+MyBatis）-**中间件**：Nacos（配置管理+服务发现）-**配置存储**：Nacos中存储了Springboot的配置，包括：数据库连接信息、Redis连接信息、服务配置等。
PDF转图片 JAVA JAVA派派 java PDF
前言以下是一个使用ApachePDFBox将PDF文件转换为图片的封装方法。这个方法将会把PDF的每一页转换为一张图片，并保存到指定的目录中。1.添加依赖首先，你需要在项目中添加PDFBox的依赖。如果你使用的是Maven，可以在pom.xml中添加以下依赖：org.apache.pdfboxpdfbox2.0.292.转换方法importorg.apache.pdfbox.pdmodel.PDD
将 VOC 格式 XML 转换为 YOLO 格式 TXT JeJe同学 xml YOLO
目录1.导入必要的模块2.定义类别名称3.设置文件路径完整代码1.导入必要的模块importosimportxml.etree.ElementTreeasETos：用于文件和目录操作，例如创建目录、遍历文件等。xml.etree.ElementTree：用于解析XML文件，从中提取信息。2.定义类别名称class_names=['nest','balloon','kite','trash']这是一
使用Tiktoken进行文本分割：优化大语言模型的输入 bhawfgrcbtwny 语言模型 python 人工智能
引言在处理大语言模型时，因其对输入的token数量有限制，文本分割成为一个至关重要的任务。为了确保生成的文本块不会超过模型的token限制，我们需要使用与模型相同的tokenizer来计数和分割文本。在本文中，我们将探讨如何使用Tiktoken和其他工具来实现有效的文本分割。主要内容1.Tiktoken介绍Tiktoken是由OpenAI创建的一个快速BPE（BytePairEncoding）to
python 输入一行字符串删除其中所有大写字母后输出_Python练习题3.17删除字符 weixin_39624873 python 输入一行字符串删除其中所有大写字母后输出
输入一个字符串str，再输入要删除字符c，大小写不区分，将字符串str中出现的所有字符c删除。输入格式:在第一行中输入一行字符在第二行输入待删除的字符输出格式:在一行中输出删除后的字符串输入样例:在这里给出一组输入。例如：beee输出样例:在这里给出相应的输出。例如：result:b代码如下：#!/usr/bin/python#-*-coding:utf-8-*-s=input().strip()
图像处理篇---图像预处理 Ronin-Lotus 图像处理篇深度学习篇程序代码篇图像处理人工智能 opencv python 深度学习计算机视觉
文章目录前言一、通用目的1.1数据标准化目的实现1.2噪声抑制目的实现高斯滤波中值滤波双边滤波1.3尺寸统一化目的实现1.4数据增强目的实现1.5特征增强目的实现：边缘检测直方图均衡化锐化二、分领域预处理2.1传统机器学习（如SVM、随机森林）2.1.1特点2.1.2预处理重点灰度化二值化形态学操作特征工程2.2深度学习（如CNN、Transformer）2.2.1特点2.2.2预处理重点通道顺序
目前市场上主流的机器视觉的框架有哪些？他们的特点及优劣 yuanpan 机器学习计算机视觉
目前市场上主流的机器视觉框架和工具可以分为商业软件、开源工具和深度学习框架三大类。以下是它们的总结及特点对比：1.商业软件(1)Halcon(MVTec)特点：专注于工业机器视觉，提供高精度、高效率的算法。支持复杂的工业应用，如缺陷检测、3D视觉、深度学习等。提供图形化开发工具HDevelop和多种编程接口。优势：算法优化好，适合实时工业应用。硬件兼容性强，支持多种工业相机和设备。劣势：商业软件，
1.1PaddleTS_环境配置：一个易用的深度时序建模的Python库 pythonQA python paddlepaddle
PaddleTS是一个易用的深度时序建模的Python库，它基于飞桨深度学习框架PaddlePaddle，专注业界领先的深度模型，旨在为领域专家和行业用户提供可扩展的时序建模能力和便捷易用的用户体验。PaddleTS的主要特性包括：设计统一数据结构，实现对多样化时序数据的表达，支持单目标与多目标变量，支持多类型协变量封装基础模型功能，如数据加载、回调设置、损失函数、训练过程控制等公共方法，帮助开发
【大模型科普】AIGC技术发展与应用实践（一文读懂AIGC）人工智能
【专栏介绍】⌈⌈⌈人工智能与大模型应用⌋⌋⌋人工智能（AI）通过算法模拟人类智能，利用机器学习、深度学习等技术驱动医疗、金融等领域的智能化。大模型是千亿参数的深度神经网络（如ChatGPT），经海量数据训练后能完成文本生成、图像创作等复杂任务，显著提升效率，但面临算力消耗、数据偏见等挑战。当前正加速与教育、科研融合，未来需平衡技术创新与伦理风险，推动可持续发展。文章目录一、AIGC概述（一）什么是
代码逐行解析 | 教你在C++中使用深度学习提取特征点 3Ｄ视觉工坊 3D视觉从入门到精通 c++深度学习开发语言人工智能
点击下方卡片，关注「3D视觉工坊」公众号选择星标，干货第一时间送达扫描下方二维码，加入3D视觉技术星球，星球内汇集了众多3D视觉实战问题，以及各个模块的学习资料：最新顶会论文、书籍、源码、视频（近20门系统课程[星球成员可免费学习]）等。想要入门3D视觉、做项目、搞科研，就加入我们吧。作者：泡椒味的口香糖|来源：3DCV添加微信：dddvision
3DXML 与 SOLIDWORKS 格式转换：技术协同及迪威模型方案 3D小将迪威模型联讯软件 SolidWorks模型 UG模型 Rhino模型 SketchUp模型 catia模型 stl模型 stp模型
一、引言在产品设计的前沿领域，3DXML与SOLIDWORKS作为主流格式，虽各有所长，但因格式差异，常成为数据流通与协作的阻碍。对于技术人员和学生党而言，掌握二者间的转换技术，不仅能提升设计效率，更是参与复杂项目协作的必备技能。迪威模型在线转换功能，凭借其先进技术，为这一转换难题提供了高效解决方案。二、3DXML与SOLIDWORKS格式基础（一）3DXML3DXML由达索系统精心打造，其核心压
工作记录 2017-01-20 月巴月巴白勺合鸟月半医疗行业开发技术分享 Microsoft Visual Studio开发技术分享健康医疗 C#
工作记录2017-01-20序号工作相关人员1修改从AmazingChart导出的数据的程序。处理AmazingChart的数据的导入，预计下周一可以提交。修改EDI837的生成。更新RD服务器。郝更新的问题1、更新了DataExport。1.1增加了BillingJobInfo\ProblemList、PatVisit\ProviderInfo\ProviderList、PatMas\Probl
深度学习-130-RAG技术之基于Anything LLM搭建本地私人知识库的应用策略问题总结(一) 皮皮冰燃深度学习深度学习人工智能 RAG
文章目录1AnythingLLM的本地知识库1.1本地知识库应用场景1.2效果对比及思考1.3本地体现在哪些方面1.3.1知识在本地1.3.2分割后的文档在本地1.3.3大模型部署运行在本地2问错问题带来的问题2.1常见的问题2.2原因分析3为什么LLM不使用我的文件？3.1LLM不是万能的【omnipotent】3.2LLM不会自省【introspect】3.3AnythingLLM是如何工作的
设备树学习（二十三、番外篇-中断子系统之softirq）奔跑的小刺猬设备树设备树原理和实现
既然开始学了，那么还是一次把中断的所有知识都系统的学一下。刚好有蜗窝大神的博客做指引。http://www.wowotech.net/irq_subsystem/soft-irq.html一、前言对于中断处理而言，linux将其分成了两个部分，一个叫做中断handler（tophalf），是全程关闭中断的，另外一部分是deferabletask（bottomhalf），属于不那么紧急需要处理的事情
Maven Array_06 eclipse jdk maven
Maven Maven是基于项目对象模型(POM)，信息来管理项目的构建，报告和文档的软件项目管理工具。 Maven 除了以程序构建能力为特色之外，还提供高级项目管理工具。由于 Maven 的缺省构建规则有较高的可重用性，所以常常用两三行 Maven 构建脚本就可以构建简单的项目。由于 Maven 的面向项目的方法，许多 Apache Jakarta 项目发文时使用 Maven，而且公司
ibatis的queyrForList和queryForMap区别 bijian1013 java ibatis
一.说明 iBatis的返回值参数类型也有种：resultMap与resultClass，这两种类型的选择可以用两句话说明之： 1.当结果集列名和类的属性名完全相对应的时候，则可直接用resultClass直接指定查询结果类
LeetCode[位运算] - #191 计算汉明权重 Cwind java 位运算 LeetCode Algorithm 题解
原题链接：#191 Number of 1 Bits 要求：写一个函数，以一个无符号整数为参数，返回其汉明权重。例如，‘11’的二进制表示为'00000000000000000000000000001011', 故函数应当返回3。汉明权重：指一个字符串中非零字符的个数；对于二进制串，即其中‘1’的个数。难度：简单分析：将十进制参数转换为二进制，然后计算其中1的个数即可。 “
浅谈java类与对象 15700786134 java
java是一门面向对象的编程语言，类与对象是其最基本的概念。所谓对象，就是一个个具体的物体，一个人，一台电脑，都是对象。而类，就是对象的一种抽象，是多个对象具有的共性的一种集合，其中包含了属性与方法，就是属于该类的对象所具有的共性。当一个类创建了对象，这个对象就拥有了该类全部的属性，方法。相比于结构化的编程思路，面向对象更适用于人的思维
linux下双网卡同一个IP 被触发 linux
转自： http://q2482696735.blog.163.com/blog/static/250606077201569029441/ 由于需要一台机器有两个网卡，开始时设置在同一个网段的IP，发现数据总是从一个网卡发出，而另一个网卡上没有数据流动。网上找了下，发现相同的问题不少：一、关于双网卡设置同一网段IP然后连接交换机的时候出现的奇怪现象。当时没有怎么思考、以为是生成树
安卓按主页键隐藏程序之后无法再次打开肆无忌惮_ 安卓
遇到一个奇怪的问题，当SplashActivity跳转到MainActivity之后，按主页键，再去打开程序，程序没法再打开（闪一下），结束任务再开也是这样，只能卸载了再重装。而且每次在Log里都打印了这句话"进入主程序"。后来发现是必须跳转之后再finish掉SplashActivity 本来代码： // 销毁这个Activity fin
通过cookie保存并读取用户登录信息实例知了ing JavaScript html
通过cookie的getCookies()方法可获取所有cookie对象的集合；通过getName()方法可以获取指定的名称的cookie；通过getValue()方法获取到cookie对象的值。另外，将一个cookie对象发送到客户端，使用response对象的addCookie()方法。下面通过cookie保存并读取用户登录信息的例子加深一下理解。（1）创建index.jsp文件。在改
JAVA 对象池矮蛋蛋 java ObjectPool
原文地址： http://www.blogjava.net/baoyaer/articles/218460.html Jakarta对象池 ☆为什么使用对象池恰当地使用对象池化技术，可以有效地减少对象生成和初始化时的消耗，提高系统的运行效率。Jakarta Commons Pool组件提供了一整套用于实现对象池化
ArrayList根据条件+for循环批量删除的方法 alleni123 java
场景如下： ArrayList<Obj> list Obj-> createTime, sid. 现在要根据obj的createTime来进行定期清理。（释放内存） ------------------------- 首先想到的方法就是 for(Obj o:list){ if(o.createTime-currentT>xxx){
阿里巴巴“耕地宝”大战各种宝百合不是茶平台战略
“耕地保”平台是阿里巴巴和安徽农民共同推出的一个 “首个互联网定制私人农场”，“耕地宝”由阿里巴巴投入一亿，主要是用来进行农业方面，将农民手中的散地集中起来不仅加大农民集体在土地上面的话语权，还增加了土地的流通与利用率，提高了土地的产量，有利于大规模的产业化的高科技农业的发展，阿里在农业上的探索将会引起新一轮的产业调整，但是集体化之后农民的个体的话语权将更少，国家应出台相应的法律法规保护
Spring注入有继承关系的类（1） bijian1013 java spring
一个类一个类的注入 1.AClass类 package com.bijian.spring.test2; public class AClass { String a; String b; public String getA() { return a; } public void setA(Strin
30岁转型期你能否成为成功人士 bijian1013 成功
很多人由于年轻时走了弯路，到了30岁一事无成，这样的例子大有人在。但同样也有一些人，整个职业生涯都发展得很优秀，到了30岁已经成为职场的精英阶层。由于做猎头的原因，我们接触很多30岁左右的经理人，发现他们在职业发展道路上往往有很多致命的问题。在30岁之前，他们的职业生涯表现很优秀，但从30岁到40岁这一段，很多人
[Velocity三]基于Servlet+Velocity的web应用 bit1129 velocity
什么是VelocityViewServlet 使用org.apache.velocity.tools.view.VelocityViewServlet可以将Velocity集成到基于Servlet的web应用中，以Servlet+Velocity的方式实现web应用 Servlet + Velocity的一般步骤 1.自定义Servlet，实现VelocityViewServl
【Kafka十二】关于Kafka是一个Commit Log Service bit1129 service
Kafka is a distributed, partitioned, replicated commit log service.这里的commit log如何理解？ A message is considered "committed" when all in sync replicas for that partition have applied i
NGINX + LUA实现复杂的控制 ronin47 lua nginx 控制
安装lua_nginx_module 模块 lua_nginx_module 可以一步步的安装，也可以直接用淘宝的OpenResty Centos和debian的安装就简单了。。这里说下freebsd的安装： fetch http://www.lua.org/ftp/lua-5.1.4.tar.gz tar zxvf lua-5.1.4.tar.gz cd lua-5.1.4 ma
java-14.输入一个已经按升序排序过的数组和一个数字，在数组中查找两个数，使得它们的和正好是输入的那个数字 bylijinnan java
public class TwoElementEqualSum { /** * 第 14 题：题目：输入一个已经按升序排序过的数组和一个数字，在数组中查找两个数，使得它们的和正好是输入的那个数字。要求时间复杂度是 O(n) 。如果有多对数字的和等于输入的数字，输出任意一对即可。例如输入数组 1 、 2 、 4 、 7 、 11 、 15 和数字 15 。由于
Netty源码学习-HttpChunkAggregator-HttpRequestEncoder-HttpResponseDecoder bylijinnan java netty
今天看Netty如何实现一个Http Server org.jboss.netty.example.http.file.HttpStaticFileServerPipelineFactory： pipeline.addLast("decoder", new HttpRequestDecoder()); pipeline.addLast(&quo
java敏感词过虑-基于多叉树原理 cngolon 违禁词过虑替换违禁词敏感词过虑多叉树
基于多叉树的敏感词、关键词过滤的工具包，用于java中的敏感词过滤 1、工具包自带敏感词词库，第一次调用时读入词库，故第一次调用时间可能较长，在类加载后普通pc机上html过滤5000字在80毫秒左右，纯文本35毫秒左右。 2、如需自定义词库，将jar包考入WEB-INF工程的lib目录，在WEB-INF/classes目录下建一个 utf-8的words.dict文本文件，
多线程知识 cuishikuan 多线程
T1，T2，T3三个线程工作顺序，按照T1，T2，T3依次进行 public class T1 implements Runnable{ @Override
spring整合activemq dalan_123 java spring jms
整合spring和activemq需要搞清楚如下的东东1、ConnectionFactory分： a、spring管理连接到activemq服务器的管理ConnectionFactory也即是所谓产生到jms服务器的链接 b、真正产生到JMS服务器链接的ConnectionFactory还得
MySQL时间字段究竟使用INT还是DateTime？ dcj3sjt126com mysql
环境：Windows XPPHP Version 5.2.9MySQL Server 5.1 第一步、创建一个表date_test（非定长、int时间） CREATE TABLE `test`.`date_test` (`id` INT NOT NULL AUTO_INCREMENT ,`start_time` INT NOT NULL ,`some_content`
Parcel: unable to marshal value dcj3sjt126com marshal
在两个activity直接传递List<xxInfo>时，出现Parcel: unable to marshal value异常。在MainActivity页面（MainActivity页面向NextActivity页面传递一个List<xxInfo>）： Intent intent = new Intent(this, Next
linux进程的查看上（ps） eksliang linux ps linux ps -l linux ps aux
ps:将某个时间点的进程运行情况选取下来转载请出自出处：http://eksliang.iteye.com/admin/blogs/2119469 http://eksliang.iteye.com ps 这个命令的man page 不是很好查阅，因为很多不同的Unix都使用这儿ps来查阅进程的状态，为了要符合不同版本的需求，所以这个
为什么第三方应用能早于System的app启动 gqdy365 System
Android应用的启动顺序网上有一大堆资料可以查阅了，这里就不细述了，这里不阐述ROM启动还有bootloader，软件启动的大致流程应该是启动kernel -> 运行servicemanager 把一些native的服务用命令启动起来（包括wifi, power, rild, surfaceflinger, mediaserver等等）-> 启动Dalivk中的第一个进程Zygot
App Framework发送JSONP请求(3) hw1287789687 jsonp 跨域请求发送jsonp ajax请求越狱请求
App Framework 中如何发送JSONP请求呢? 使用jsonp,详情请参考:http://json-p.org/ 如何发送Ajax请求呢? (1)登录 /*** * 会员登录 * @param username * @param password */ var user_login=function(username,password){ // aler
发福利，整理了一份关于“资源汇总”的汇总 justjavac 资源
觉得有用的话，可以去github关注：https://github.com/justjavac/awesome-awesomeness-zh_CN 通用 free-programming-books-zh_CN 免费的计算机编程类中文书籍精彩博客集合 hacke2/hacke2.github.io#2 ResumeSample 程序员简历
用 Java 技术创建 RESTful Web 服务 macroli java 编程 Web REST
转载：http://www.ibm.com/developerworks/cn/web/wa-jaxrs/ JAX-RS (JSR-311) 【 Java API for RESTful Web Services 】是一种 Java™ API，可使 Java Restful 服务的开发变得迅速而轻松。这个 API 提供了一种基于注释的模型来描述分布式资源。注释被用来提供资源的位
CentOS6.5-x86_64位下oracle11g的安装详细步骤及注意事项超声波 oracle linux
前言：这两天项目要上线了，由我负责往服务器部署整个项目，因此首先要往服务器安装oracle，服务器本身是CentOS6.5的64位系统，安装的数据库版本是11g，在整个的安装过程中碰到很多的坑，不过最后还是通过各种途径解决并成功装上了。转别写篇博客来记录完整的安装过程以及在整个过程中的注意事项。希望对以后那些刚刚接触的菜鸟们能起到一定的帮助作用。安装过程中可能遇到的问题（注
HttpClient 4.3 设置keeplive 和 timeout 的方法 supben httpclient
ConnectionKeepAliveStrategy kaStrategy = new DefaultConnectionKeepAliveStrategy() { @Override public long getKeepAliveDuration(HttpResponse response, HttpContext context) { long keepAlive
Spring 4.2新特性-@Import注解的升级 wiselyman spring 4
3.1 @Import @Import注解在4.2之前只支持导入配置类在4.2,@Import注解支持导入普通的java类,并将其声明成一个bean 3.2 示例演示java类 package com.wisely.spring4_2.imp; public class DemoService { public void doSomethin