Folks, I need clarification on where the data should exist for a multi-year Backtest or Optimization.
You'll see in a previous support thread about doing a multi-quarter playback I posted, I came to determine all of the playback data needs to reside within the last quarter's data, to do a full run. For example, if I running a backtest on ES from Jan1 2023 to Dec31 2023 within a single run, that means I need to copy of the ES 03-23, ES 06-23, and ES 09-23 playback data into the ES 12-23 data. I'm wondering if the Backtest/Optimization data schema is equally bent?
For my test, I have the April 2033 and Nov 23 data for GC only. (Screen shot foo6). That is, I have only the GC 06-23 and GC 12-23 directories with corresponding data under my /tick directory.
When I run a test on GC 06-23 from April 1 to April 30 2023 on 06-23 GC and run only April23 I get a total net profit of $2640 (screen shot foo1)
When I run a test on GC 12-23 from Nov 1 to Nov 30 2023 on 12-23 GC and run only Nov23, I get a next profit of $820 (screen shot foo2)
Now, when I run a test from April 1 to Nov 30, I should get a profit of $3460, right? Well, that's not the case. When I run both months under one run, I instead get a profit of $3740 (screen shot foo3)
If I remove the data from GC 06-23 and place the April and Nov 23 data into the 12-23 GC directory, and run both months under one run, I get a total of $4500 (screen shot foo4)
If I remove the data from the GC 12-23 and place April and Nov 23 data into 06-23 GC directory, and run both months under one run, I get a total of ($740) (screen shot foo5)
So, all of this leaves me confused:
- I'm using the exact same data, but my results are significantly different as to where the data is placed.
- If I want to run a backtest over several years in a single run, should each quarter's directory contain data only for that quarter? Or, just like when I run a multiple year playback, I have to shove all of the data into the last quarter's directory?
- What should my backtest data placement be under a multi-year time period under a single run, to make sure I have the most accurate results?
Thanks for your clarity.

Comment