ALGOGENE | Guidelines to re-use model results for backtests

admin

Guidelines to re-use model results for backtests

Programming

When developing a trading algorithm, it may be the case where we have trained/built a particular model for a financial instrument and want to apply it on other financial sectors/markets. For such case, re-using a pre-trained model for strategy backtests could largely reduce the time in repeated model calibration process. However, if we handle it inproperly, it is likely for us to fall into the trap of "Look Ahead Bias"!

What is "Look Ahead Bias"?

Look-ahead bias refers to making trade decisions based on data/information that would only be available in the future.

For backtesting, it is crucial that we only use information that would have been available at the time of the trade. For example, using a yearly earnings figure that would be released a quarter later will potentially bias the results in favor of the desired outcome. The accuracy of this strategy performance is also doubtful.

Here are some considerations for identifying a potential look-ahead bias.

When was the data released?
At what time the data observable and available to us?

Load/Dump model on ALGOGENE Web IDE

As an event streaming backtest enginee, ALGOGENE primarily eliminated the look-ahead bias where data feed into our strategy script according to chronological order, as if replaying a recorder. On the other hand, ALGOGENE provides feasibility to save and re-use customized models for backtesting on the Web IDE. Such feasibility, however, might expose us to potential look-ahead bias that we might inappropriately introduce. Thus, the questions above would be a guide to justify whether our re-used model is logically sound good.

Now, let's see how to load/dump a model on ALGOGENE IDE. Upon account registration on the platform, each user has automatically been assigned with a partition on ALGOGENE's cloud environment. All we need to do is simply to save the model to our assigned cloud directory (i.e. self.evt.path_lib), and then retrieve it for other backtest process. In the following example, it is presented how to implement programmatically, but it may not be logically sound good in terms of strategy backtesting.

Suppose we want to find out the mathematical relationship between 3 financial instruments, defined as Y = f(X1, X2). We based on 'keras' (https://www.tensorflow.org/guide/keras/save_and_serialize) library to derive the fitted model using the past 100 daily observations. In the first example below, we create 'model_1' directory and dump the results there. In the second example, 'model_1' is retrieved for another backtest.

Source Codes

Save a model refers to line #52

from AlgoAPI import AlgoAPIUtil, AlgoAPI_Backtest
from datetime import datetime, timedelta
import tensorflow as tf
from tensorflow import keras
import numpy as np


def get_model():
    # Create a simple model with 2 input variables
    inputs = keras.Input(shape=(2,))
    outputs = keras.layers.Dense(1)(inputs)
    model = keras.Model(inputs, outputs)
    model.compile(optimizer="adam", loss="mean_squared_error")
    return model

class AlgoEvent:
    def __init__(self):
        self.lasttime = datetime(2000,1,1)
        self.isSaved = False
        self.numOfObs = 100 
        self.arr_Y, self.arr_X1, self.arr_X2 = [], [], []
        self.model = get_model()

    def start(self, mEvt):
        # get my selected financial instruments
        self.myinstrument_Y = mEvt['subscribeList'][0]
        self.myinstrument_X1 = mEvt['subscribeList'][1]
        self.myinstrument_X2 = mEvt['subscribeList'][2]

        # start backtest
        self.evt = AlgoAPI_Backtest.AlgoEvtHandler(self, mEvt)
        self.evt.start()

    def on_bulkdatafeed(self, isSync, bd, ab):
        if isSync and not self.isSaved:
            # get new day price
            if bd[self.myinstrument_Y]['timestamp'] > self.lasttime + timedelta(hours=24):
                self.lasttime = bd[self.myinstrument_Y]['timestamp']

                # append observation
                self.arr_Y.append(bd[self.myinstrument_Y]['lastPrice'])
                self.arr_X1.append(bd[self.myinstrument_X1]['lastPrice'])
                self.arr_X2.append(bd[self.myinstrument_X2]['lastPrice'])

                if len(self.arr_Y) >= self.numOfObs:
                    # Train the model
                    test_input = np.array([[self.arr_X1[i], self.arr_X2[i]] for i in range(0,self.numOfObs)])
                    test_target = np.array([[self.arr_Y[i]] for i in range(0,self.numOfObs)])
                    self.model.fit(test_input, test_target)

                    # save the model
                    self.model.save(self.evt.path_lib+"model_1")

                    self.isSaved = True

Load a model refers to line #24

from AlgoAPI import AlgoAPIUtil, AlgoAPI_Backtest
from datetime import datetime, timedelta
import tensorflow as tf
from tensorflow import keras


def get_model():
    # Create a simple model with 2 variables
    inputs = keras.Input(shape=(2,))
    outputs = keras.layers.Dense(1)(inputs)
    model = keras.Model(inputs, outputs)
    model.compile(optimizer="adam", loss="mean_squared_error")
    return model

class AlgoEvent:
    def __init__(self):
        pass

    def start(self, mEvt):
        # start backtest
        self.evt = AlgoAPI_Backtest.AlgoEvtHandler(self, mEvt)

        # load my model_1
        self.model = keras.models.load_model(self.evt.path_lib+"model_1")

        self.evt.start()

    def on_marketdatafeed(self, md, ab):
        # use self.model for new market data feed ...
        pass

3 0

Posted on : 2020-12-02 14:56:53.079000

admin

Since the system release in 2021.04, users can now interact with your assigned cloud directory.

The self.evt.path_lib mentioned above will be equivalent to /lib in cloud server.

From [My History] > [Custom File Viewer], you can directly upload, download, edit, and delete your data and models.

0 0

Posted on : 2021-04-30 15:33:50.725000

Bee Bee

Original Posted by - admin: Since the system release in 2021.04, users can now interact with your assigned cloud directory.
The self.evt.path_lib mentioned above will be equivalent to /lib in cloud server.
From [My History] > [Custom File Viewer], you can directly upload, download, edit, and delete your data and models.

Hi, I have trained a model in my local machine. Can I include it for backtest?

0 0

Posted on : 2021-05-27 11:51:46.770000

admin

Original Posted by - Bee Bee: Hi, I have trained a model in my local machine. Can I include it for backtest?

Yes, you can firstly upload the trained model and library under cloud directory /lib, say /lib/m1

Then, in backtest script, the model can be imported or loaded from self.evt.path_lib+"m1".

2 0

Posted on : 2021-05-30 13:43:04.279000

Hot Topic
投資必看莊家出貨與派發的四大警示信號
中美晶片博弈進入「過路費」時代
商業思維掌握財富成長的關鍵策略
The Truth Behind That $10k to $60 Million Dream
30年來最高利率！日本加息對市場的隱藏機會
认知决定命运掌握富人思维的20个关键
Bitcoin Under Pressure for Japan Upcoming Rate Hike?
賺錢新風口 AI微短劇的崛起與未來
From Theory to Practice: A Comprehensive Investment Guide for Every Trader
Python Project: Build an AI Face Detection Alarm System with Your Webcam

Guidelines to re-use model results for backtests

Programming

What is "Look Ahead Bias"?

Load/Dump model on ALGOGENE Web IDE

Source Codes

Categories