How to Get Reproducible Results with Keras

阿新 • • 發佈：2019-01-12

Neural network algorithms are stochastic.

This means they make use of randomness, such as initializing to random weights, and in turn the same network trained on the same data can produce different results.

This can be confusing to beginners as the algorithm appears unstable, and in fact they are by design. The random initialization allows the network to learn a good approximation for the function being learned.

Nevertheless, there are times when you need the exact same result every time the same network is trained on the same data. Such as for a tutorial, or perhaps operationally.

In this tutorial, you will discover how you can seed the random number generator so that you can get the same results from the same network on the same data, every time.

Let’s get started.

How to Get Reproducible Results from Neural Networks with Keras
Photo by Samuel John, some rights reserved.

Tutorial Overview

This tutorial is broken down into 6 parts. They are:

Why do I Get Different Results Every Time?
Demonstration of Different Results
The Solutions
Seed Random Numbers with the Theano Backend

Seed Random Numbers with the TensorFlow Backend
What if I Am Still Getting Different Results?

Environment

This tutorial assumes you have a Python SciPy environment installed. You can use either Python 2 or 3 with this example.

This tutorial assumes you have Keras (v2.0.3+) installed with either the TensorFlow (v1.1.0+) or Theano (v0.9+) backend.

This tutorial also assumes you have scikit-learn, Pandas, NumPy, and Matplotlib installed.

If you need help setting up your Python environment, see this post:

Why do I Get Different Results Every Time?

This is a common question I see from beginners to the field of neural networks and deep learning.

This misunderstanding may also come in the form of questions like:

How do I get stable results?
How do I get repeatable results?
What seed should I use?

Neural networks use randomness by design to ensure they effectively learn the function being approximated for the problem. Randomness is used because this class of machine learning algorithm performs better with it than without.

The most common form of randomness used in neural networks is the random initialization of the network weights. Although randomness can be used in other areas, here is just a short list:

Randomness in Initialization, such as weights.
Randomness in Regularization, such as dropout.
Randomness in Layers, such as word embedding.
Randomness in Optimization, such as stochastic optimization.

These sources of randomness, and more, mean that when you run the exact same neural network algorithm on the exact same data, you are guaranteed to get different results.

For more on the why behind stochastic algorithms, see the post:

Demonstration of Different Results

We can demonstrate the stochastic nature of neural networks with a small example.

In this section, we will develop a Multilayer Perceptron model to learn a short sequence of numbers increasing by 0.1 from 0.0 to 0.9. Given 0.0, the model must predict 0.1; given 0.1, the model must output 0.2; and so on.

The code to prepare the data is listed below.

# create sequence
length = 10
sequence = [i/float(length) for i in range(length)]
# create X/y pairs
df = DataFrame(sequence)
df = concat([df.shift(1), df], axis=1)
df.dropna(inplace=True)
# convert to MLPfriendly format
values = df.values
X, y = values[:,0], values[:,1]

12345678910

# create sequencelength=10sequence=[i/float(length)foriinrange(length)]# create X/y pairsdf=DataFrame(sequence)df=concat([df.shift(1),df],axis=1)df.dropna(inplace=True)# convert to MLPfriendly formatvalues=df.valuesX,y=values[:,0],values[:,1]

We will use a network with 1 input, 10 neurons in the hidden layer, and 1 output. The network will use a mean squared error loss function and will be trained using the efficient ADAM algorithm.

The network needs about 1,000 epochs to solve this problem effectively, but we will only train it for 100 epochs. This is to ensure we get a model that makes errors when making predictions.

After the network is trained, we will make predictions on the dataset and print the mean squared error.

The code for the network is listed below.

# design network
model = Sequential()
model.add(Dense(10, input_dim=1))
model.add(Dense(1))
model.compile(loss='mean_squared_error', optimizer='adam')
# fit network
model.fit(X, y, epochs=100, batch_size=len(X), verbose=0)
# forecast
yhat = model.predict(X, verbose=0)
print(mean_squared_error(y, yhat[:,0]))

12345678910

# design networkmodel=Sequential()model.add(Dense(10,input_dim=1))model.add(Dense(1))model.compile(loss='mean_squared_error',optimizer='adam')# fit networkmodel.fit(X,y,epochs=100,batch_size=len(X),verbose=0)# forecastyhat=model.predict(X,verbose=0)print(mean_squared_error(y,yhat[:,0]))

In the example, we will create the network 10 times and print 10 different network scores.

The complete code listing is provided below.

from pandas import DataFrame
from pandas import concat
from keras.models import Sequential
from keras.layers import Dense
from sklearn.metrics import mean_squared_error

# fit MLP to dataset and print error
def fit_model(X, y):
	# design network
	model = Sequential()
	model.add(Dense(10, input_dim=1))
	model.add(Dense(1))
	model.compile(loss='mean_squared_error', optimizer='adam')
	# fit network
	model.fit(X, y, epochs=100, batch_size=len(X), verbose=0)
	# forecast
	yhat = model.predict(X, verbose=0)
	print(mean_squared_error(y, yhat[:,0]))

# create sequence
length = 10
sequence = [i/float(length) for i in range(length)]
# create X/y pairs
df = DataFrame(sequence)
df = concat([df.shift(1), df], axis=1)
df.dropna(inplace=True)
# convert to MLP friendly format
values = df.values
X, y = values[:,0], values[:,1]
# repeat experiment
repeats = 10
for _ in range(repeats):
	fit_model(X, y)

123456789101112131415161718192021222324252627282930313233

from pandas import DataFramefrom pandas import concatfrom keras.models import Sequentialfrom keras.layers import Densefrom sklearn.metrics import mean_squared_error# fit MLP to dataset and print errordef fit_model(X,y):# design networkmodel=Sequential()model.add(Dense(10,input_dim=1))model.add(Dense(1))model.compile(loss='mean_squared_error',optimizer='adam')# fit networkmodel.fit(X,y,epochs=100,batch_size=len(X),verbose=0)# forecastyhat=model.predict(X,verbose=0)print(mean_squared_error(y,yhat[:,0]))# create sequencelength=10sequence=[i/float(length)foriinrange(length)]# create X/y pairsdf=DataFrame(sequence)df=concat([df.shift(1),df],axis=1)df.dropna(inplace=True)# convert to MLP friendly formatvalues=df.valuesX,y=values[:,0],values[:,1]# repeat experimentrepeats=10for_inrange(repeats):fit_model(X,y)

Running the example will print a different accuracy in each line.

Your specific results will differ. A sample output is provided below.

0.0282584265697
0.0457025913022
0.145698137198
0.0873461454407
0.0309397604521
0.046649185173
0.0958450337178
0.0130660263779
0.00625176026631
0.00296055161492

12345678910

0.02825842656970.04570259130220.1456981371980.08734614544070.03093976045210.0466491851730.09584503371780.01306602637790.006251760266310.00296055161492

The Solutions

The are two main solutions.

Solution #1: Repeat Your Experiment

The traditional and practical way to address this problem is to run your network many times (30+) and use statistics to summarize the performance of your model, and compare your model to other models.

I strongly recommend this approach, but it is not always possible due to the very long training times of some models.

For more on this approach, see:

Solution #2: Seed the Random Number Generator

Alternately, another solution is to use a fixed seed for the random number generator.

Random numbers are generated using a pseudo-random number generator. A random number generator is a mathematical function that will generate a long sequence of numbers that are random enough for general purpose use, such as in machine learning algorithms.

Random number generators require a seed to kick off the process, and it is common to use the current time in milliseconds as the default in most implementations. This is to ensure different sequences of random numbers are generated each time the code is run, by default.

This seed can also be specified with a specific number, such as “1”, to ensure that the same sequence of random numbers is generated each time the code is run.

The specific seed value does not matter as long as it stays the same for each run of your code.

The specific way to set the random number generator differs depending on the backend, and we will look at how to do this in Theano and TensorFlow.

Seed Random Numbers with the Theano Backend

Generally, Keras gets its source of randomness from the NumPy random number generator.

For the most part, so does the Theano backend.

We can seed the NumPy random number generator by calling the seed() function from the random module, as follows:

from numpy.random import seed
seed(1)

12	from numpy.random import seedseed(1)

The importing and calling of the seed function is best done at the top of your code file.

This is a best practice because it is possible that some randomness is used when various Keras or Theano (or other) libraries are imported as part of their initialization, even before they are directly used.

We can add the two lines to the top of our example above and run it two times.

You should see the same list of mean squared error values each time you run the code (perhaps with some minor variation due to precision on different machines), as follows:

0.169326527063
2.75750621228e-05
0.0183287291562
1.93553737255e-07
0.0549871087449
0.0906326807824
0.00337575114075
0.00414857518259
8.14587362008e-08
0.0522927019639

12345678910

0.1693265270632.75750621228e-050.01832872915621.93553737255e-070.05498710874490.09063268078240.003375751140750.004148575182598.14587362008e-080.0522927019639

Your results should match mine (ignoring minor differences of precision).

Seed Random Numbers with the TensorFlow Backend

Keras does get its source of randomness from the NumPy random number generator, so this must be seeded regardless of whether you are using a Theano or TensorFlow backend.

It must be seeded by calling the seed() function at the top of the file before any other imports or other code.

from numpy.random import seed
seed(1)

12	from numpy.random import seedseed(1)

In addition, TensorFlow has its own random number generator that must also be seeded by calling the set_random_seed() function immediately after the NumPy random number generator, as follows:

from tensorflow import set_random_seed
set_random_seed(2)

12	from tensorflow import set_random_seedset_random_seed(2)

To be crystal clear, the top of your code file must have the following 4 lines before any others;

from numpy.random import seed
seed(1)
from tensorflow import set_random_seed
set_random_seed(2)

1234	from numpy.random import seedseed(1)from tensorflow import set_random_seedset_random_seed(2)

You can use the same seed for both, or different seeds. I don’t think it makes much difference as the sources of randomness feed into different processes.

Adding these 4 lines to the above example will allow the code to produce the same results every time it is run. You should see the same mean squared error values as those listed below (perhaps with some minor variation due to precision on different machines):

0.224045112999
0.00154879478823
0.00387589994044
0.0292376881968
0.00945528404353
0.013305765525
0.0206255228201
0.0359538356108
0.00441943512128
0.298706569397

12345678910

0.2240451129990.001548794788230.003875899940440.02923768819680.009455284043530.0133057655250.02062552282010.03595383561080.004419435121280.298706569397

Your results should match mine (ignoring minor differences of precision).

What if I Am Still Getting Different Results?

To re-iterate, the most robust way to report results and compare models is to repeat your experiment many times (30+) and use summary statistics.

If this is not possible, you can get 100% repeatable results by seeding the random number generators used by your code. The solutions above should cover most situations, but not all.

What if you have followed the above instructions and still get different results from the same algorithm on the same data?

It is possible that there are other sources of randomness that you have not accounted for.

Randomness from a Third-Party Library

Perhaps your code is using an additional library that uses a different random number generator that too must be seeded.

Try cutting your code back to the minimum required (e.g. one data sample, one training epoch, etc.) and carefully read the API documentation in an effort to narrow down additional third-party libraries introducing randomness.

Randomness from Using the GPU

All of the above examples assume the code was run on a CPU.

It is possible that when using the GPU to train your models, the backend may be configured to use a sophisticated stack of GPU libraries, and that some of these may introduce their own source of randomness that you may or may not be able to account for.

For example, there is some evidence that if you are using Nvidia cuDNN in your stack, that this may introduce additional sources of randomness and prevent the exact reproducibility of your results.

Randomness from a Sophisticated Model

It is possible that because of the sophistication of your model and the parallel nature of training, that you are getting unreproducible results.

This is very likely caused by efficiencies made by the backend library and perhaps the inability to use the sequence of random numbers across cores.

I have not seen this myself, but see signs in some GitHub issues and StackOverflow questions.

You can try to reduce the complexity of your model to see if this affects the reproducibility of results, if only to narrow down the cause.

I would suggest reading up on how your backend uses randomness and see if there are any options open to you.

In Theano, see:

In TensorFlow, see:

Also, consider searching for other people with the same issue for further insight. Some great places to search include:

Summary

In this tutorial, you discovered how to get reproducible results for neural network models in Keras.

Specifically, you learned:

That neural networks are stochastic by design and that the source of randomness can be fixed to make results reproducible.
That you can seed the random number generators in NumPy and TensorFlow and this will make most Keras code 100% reproducible.
That there are some cases where there are additional sources of randomness and you have ideas on how to seek them out and perhaps fix them too.

Did this tutorial help?
Share your experience in the comments.

Are you still getting unreproducible results with Keras?
Share your experience; perhaps someone else here can help.

Frustrated With Your Progress In Deep Learning?

What If You Could Develop A Network in Minutes

…with just a few lines of Python

It covers self-study tutorials and end-to-end projects on topics like:
Multilayer Perceptrons, Convolutional Nets and Recurrent Neural Nets, and more…

Finally Bring Deep Learning To
Your Own Projects

Skip the Academics. Just Results.

How to Get Reproducible Results with Keras

Tutorial Overview

Environment

Why do I Get Different Results Every Time?

Demonstration of Different Results

The Solutions

Solution #1: Repeat Your Experiment

Solution #2: Seed the Random Number Generator

Seed Random Numbers with the Theano Backend

Seed Random Numbers with the TensorFlow Backend

What if I Am Still Getting Different Results?

Randomness from a Third-Party Library

Randomness from Using the GPU

Randomness from a Sophisticated Model

Summary

Frustrated With Your Progress In Deep Learning?

What If You Could Develop A Network in Minutes

Finally Bring Deep Learning To
Your Own Projects

How to Get Reproducible Results with Keras

How to Get Good Results Fast with Deep Learning for Time Series Forecasting

How to get bitting code with SEC-E9 key cutting machine

How To Get Baseline Results And Why They Matter

How To Get Started With Machine Learning in R (get results in one weekend)

question 002: dev c++ 當中如何調整字體大小？How to get the first program with C++? c++屬於什麽軟件？

AR# 71416 ZynqMP: How to get baremetal DPDMA example to work with single lane DisplayPort configured

How to Get Started With Conversational AI

How to Get AXU with the Argentas Wallet?

How To Get Started With Machine Learning Algorithms in R

How to Get Started with Deep Learning for Natural Language Processing (7

How to Get Started with Machine Learning in Python

How to Get Your First Data Science Job: Interview with Michael Galarnyk

How to create own operator with python in mxnet?

TED - How To Get Better At The Things You Care About

How to Get the Length of File in C

How to Get What You Want 如何得到你想要的

How to get Pycharm

轉載 -- How To Optimize Your Site With GZIP Compression

轉載 -- How To Optimize Your Site With HTTP Caching

How to Get Reproducible Results with Keras

Tutorial Overview

Environment

Why do I Get Different Results Every Time?

Demonstration of Different Results

The Solutions

Solution #1: Repeat Your Experiment

Solution #2: Seed the Random Number Generator

Seed Random Numbers with the Theano Backend

Seed Random Numbers with the TensorFlow Backend

What if I Am Still Getting Different Results?

Randomness from a Third-Party Library

Randomness from Using the GPU

Randomness from a Sophisticated Model

Summary

Frustrated With Your Progress In Deep Learning?

What If You Could Develop A Network in Minutes

Finally Bring Deep Learning To Your Own Projects

相關推薦

Finally Bring Deep Learning To
Your Own Projects