Unit Testing in TorQ

Introduction

Unit testing is a software validation methodology where individual modules or 'units' of source code are tested to determine whether they work as intended. TorQ's unit testing framework has been designed to be user-friendly and includes lots of useful features such as debugging and integration testing. We shall see that, using this framework, it is straightforward to test both small units of code as well as larger pieces of application functionality.

Test Basics

My First Test

Tests are written in the style of Kx's k4unit, where tests are written in CSV files and loaded into a Q process along with the testing code and are run line by line, with the results being stored in a table. In order to write and run a basic test, in a new folder create a CSV file with some basic tests in it, like the following:

action,ms,bytes,lang,code,repeat,minver,comment
comment,0,0,,this will be ignored,1,,""
before,0,0,q,aa:22,1,,"This sets a variable before the tests begin"
true,0,0,q,2=sum 1 1,1,,"A basic arithmetic check"
fail,0,0,q,2=`aa,1,,"This ought to fail"
after,0,0,q,bb:33,1,,"This code executes after the tests have run"

Then use the torq.q script to start up a TorQ process with the test flag pointing to the directory where the CSV is written and a debug flag so that the process outputs its logs into the prompt and stays alive:

$ q ${TORQHOME}/torq.q -proctype test -procname test1 -test /path/to/my/tests -debug

This will automatically load in the test code and all CSVs in that directory into a test TorQ process and run the tests in those files, then output the results to the screen.

Simple RDB Test

A similar scenario is when the test code is loaded into a 'production' TorQ process, ie. an RDB or a tickerplant, in order to test the functioning of that process. A simple RDB example can be seen here:

action,ms,bytes,lang,code,repeat,minver,comment
true,0,0,q,`segmentedtickerplant~.rdb.tickerplanttypes,1,,"Check the TP type"
true,0,0,q,`~.rdb.subscribeto,1,,"Check RDB is subscribed to null symbol (all tables)"
true,0,0,q,0~count heartbeat,1,,"Check heartheat table is empty"
run,0,0,q,.rdb.upd[`heartbeat;(.z.p;`test;`rdb1;1;1i;`testhost;1i)],1,,"Call RDB UPD function"
true,0,0,q,1~count heartbeat,1,,"Check that a row has been added to the heartbeat table"

These tests simply check that certain process variables have been set properly and test the UPD function, and these are triggered similarly to the previous set, except that an RDB process is used to load in the tests rather than a 'blank' test process. Here, the -proctype is an RDB, which loads in the code in the ${KDBCODE}/rdb/ folder as well as any common code, so that the process logic can be tested. The procname can be anything you want:

$ q ${TORQHOME}/torq.q -proctype rdb -procname rdb1 -test /path/to/my/tests -debug

Integration Testing: WDB Subscription Tests

Once the basic concepts from before have been understood it is possible to generate much more complex testing scenarios. Often it will be necessary to test interactions between different processes, in which case you will need more than just the one testing process. In this example test we are examining the interaction between a Segmented Tickerplant (STP) and three WDBs, one of which is subscribed to everything, one to a subset of syms and the other to a subset of tables. In order to facilitate this, we need to make some additions to our test directory. As well as the CSVs containing the tests themselves we now add three more files: process.csv, run.sh and settings.q, and we shall examine these in more detail.

process.csv

An easy way to spin up multiple custom processes is to pass a custom version of process.csv into the TorQ start script. Our file will contain a discovery process, three WDBs and an STP and can be seen here:

host,port,proctype,procname,U,localtime,g,T,w,load,startwithall,extras,qcmd
localhost,{KDBBASEPORT}+100,discovery,discovery1,${TORQHOME}/appconfig/passwords/accesslist.txt,1,0,,,${KDBCODE}/processes/discovery.q,1,,q
localhost,{KDBBASEPORT}+101,wdb,wdball,${TORQHOME}/appconfig/passwords/accesslist.txt,1,1,,,${KDBCODE}/processes/wdb.q,1,-.wdb.tickerplanttypes segmentedtickerplant,q
localhost,{KDBBASEPORT}+102,wdb,wdbsymfilt,${TORQHOME}/appconfig/passwords/accesslist.txt,1,1,180,,${KDBCODE}/processes/wdb.q,1,-.wdb.tickerplanttypes segmentedtickerplant -.wdb.subsyms GOOG,q
localhost,{KDBBASEPORT}+103,wdb,wdbtabfilt,${TORQHOME}/appconfig/passwords/accesslist.txt,1,1,180,,${KDBCODE}/processes/wdb.q,1,-.wdb.tickerplanttypes segmentedtickerplant -.wdb.subtabs quote,q
localhost,{KDBBASEPORT}+104,segmentedtickerplant,stp1,${TORQHOME}/appconfig/passwords/accesslist.txt,1,0,,,${KDBCODE}/processes/segmentedtickerplant.q,1,-schemafile ${TORQHOME}/database.q,q

As we can see here, three WDBs are being started up but with different parameters in the 'extras' column, meaning they will have different subscription behaviour.

run.sh

Since we need to run multiple shell commands to bring up all the processes and run the tests, it makes sense to move them to a file:

#!/bin/bash

# Path to test directory
testpath=${KDBTESTS}/demo

# Start procs
${TORQHOME}/torq.sh start all -csv ${testpath}/process.csv

# Start test proc
/usr/bin/rlwrap q ${TORQHOME}/torq.q \
  -proctype test -procname test1 \
  -test ${testpath} \
  -load ${KDBTESTS}/helperfunctions.q ${testpath}/settings.q \
  -procfile ${testpath}/process.csv -debug

# Shut down procs
${TORQHOME}/torq.sh stop all -csv ${testpath}/process.csv

The first piece of code in this script simply sets a variable with the path for the test directory. Then the discovery, STP and WDB processes are started, with the custom CSV flag pointing to the file we just created above. If only a subset of the processes in that file are needed then they can be called by name instead of using 'all'. Next, our test process is started up with some extra flags. The -load flag allows other q files to be loaded, in this case we are loading a file of helper functions defined in the KDBTESTS directory and the settings.q file which will be explored in greater depth shortly. There is also a -procfile flag in use which, again, points to our custom process CSV, and this more easily allows the use of TorQ connection management in tests. Finally, once all the tests have been run and the test process exited, all the other processes are brought down. The tests can now be run from the command line with a simple /path/to/tests/run.sh.

settings.q

This file, while not strictly speaking necessary, is a very handy place to store variables and functions that will be used in the tests rather than having to declare them in the CSV itself. In this example we are storing TorQ connection parameters and test updates for the trade and quote tables:

// IPC connection parameters
.servers.CONNECTIONS:`wdb`segmentedtickerplant`tickerplant;
.servers.USERPASS:`admin:admin;

// Test updates
testtrade:((5#`GOOG),5?`4;10?100.0;10?100i;10#0b;10?.Q.A;10?.Q.A;10#`buy);
testquote:(10?`4;(5?50.0),50+5?50.0;10?100.0;10?100i;10?100i;10?.Q.A;10?.Q.A;10#`3);

WDB Tests

The following CSV shows a fairly straightforward use of this setup:

action,ms,bytes,lang,code,repeat,minver,comment
before,0,0,q,.servers.startup[],1,,"Start TorQ connection management"
before,0,0,q,.proc.sys "sleep 2",1,,"Wait for proc to start"
before,0,0,q,stpHandle:gethandle[`stp1],1,,"Open STP handle"
before,0,0,q,wdbHandles:`all`sym`tab!gethandle each `wdball`wdbsymfilt`wdbtabfilt,1,,"Open WDB handles"
before,0,0,q,t1:wdbHandles[`all`sym] @\: "count trade",1,,"Get initial trade table counts"
before,0,0,q,q1:(value wdbHandles) @\: "count quote",1,,"Get initial quote table counts"
run,0,0,q,"stpHandle @/: `.u.upd ,/: ((`trade;testtrade);(`quote;testquote))",1,,"Send trade and quote updates to STP"
run,0,0,q,.proc.sys "sleep 2",1,,"Wait for updates to publish"
true,0,0,q,(t1+10 5)~wdbHandles[`all`sym] @\: "count trade",1,,"Check trade update was correctly published"
true,0,0,q,(q1+10 0 10)~(value wdbHandles) @\: "count quote",1,,"Check quote update was correctly published"

The first thing that happens is that TorQ connection management is set up, then handles are opened to the STP and WDBs and some test updates are sent to the STP. Finally, the test process grabs the latest table counts from the WDBs and checks they have updated correctly. Note that functions and variables not defined in the test are brought in from the settings and helper function files.

Our test directory now looks like this:

wdbtests
|---- process.csv
|---- run.sh
|---- settings.q
|---- test.csv

Adding functionality to the run script

In order to make our tests more dynamic and useful there are a few things we can do. In the KDBTESTS directory there is a flagparse.sh script which contains some basic code for parsing command line flags. If we add this line to the top of our run script:

source $KDBTESTS/flagparse.sh

We can now pass flags to our run script, and these are as follows:

Flag	Description
`-d`	Starts the test process in debug mode
`-s`	Starts the test process in debug and stop mode (thrown out to q prompt on error or test failure)
`-w`	Writes test results to CSVs on disk
`-q`	Starts the test process in quiet mode
`-r timestamp`	Passes a timestamp into the tests for on-disk versioning

The other addition to be made is a -testresults flag to the TorQ start line in the run script. This passes in a folder where the test process will store its logs and test results in a date-partitioned folder structure. Our run script now looks like the following:

#!/bin/bash

# Handle command-line arguments
source $KDBTESTS/flagparse.sh

# Path to test directory
testpath=${KDBTESTS}/demo

# Start procs
${TORQHOME}/torq.sh start discovery1 stp1 wdball wdbsymfilt wdbtabfilt -csv ${testpath}/process.csv

# Start test proc
/usr/bin/rlwrap q ${TORQHOME}/torq.q \
  -proctype test -procname test1 \
  -test ${testpath} \
  -load ${KDBTESTS}/helperfunctions.q ${testpath}/settings.q \
  -testresults ${KDBTESTS}/demo/results/ \
  -runtime $run \
  -procfile ${testpath}/process.csv \
  $debug $stop $write $quiet

# Shut down procs
${TORQHOME}/torq.sh stop discovery1 stp1 wdball wdbsymfilt wdbtabfilt -csv ${testpath}/process.csv

Near the end we have four variables which are optional flags that will be added by the flag parser script. We now have plenty of options when it comes to running the script:

# Run in debug mode
./run.sh -d
# Run in debug and stop mode
./run.sh -s
# Run in quiet mode, pass in a runtime and write results to disk
./run.sh -r 2020.10.12D16:34:44.261143000 -wq

Running Multiple Test Sets

When testing a large piece of functionality it is very likely that there will a large number of tests to run. It is advised in such a case that the tests are split up into different technical groupings, each with their own folder with test CSVs, process and settings files and run scripts. This is essentially creating an isolated environment for each group of tests. The issue though is that it becomes cumbersome to run all tests, and so a script has been created which executes all the run scripts in each of the directories, saves their results to disk and displays them at the end of the run.

The runall.q script lives in the KDBTESTS directory and it takes two arguments, -rundir and -resdir, which are the top level directory where your test sub-folders live, and the folder where the test logs and results are kept respectively. This script runs each of the run.sh scripts in each of the sub-folders in the -rundir folder in write and quiet mode, passing in the current timestamp so that all will be saved to disk in an easily-versioned manner. An example run would look like the following:

q runall.q -rundir /path/to/top/test/folder/ -resdir /path/to/results/folder/

Debugging Tests

In the scenario where there are a large number of tests spread across several folders, you can use the run-all mechanic to test them all. However, say there is an error in one of the test files. Normally, this would be fairly difficult to track down and deal with, but the features of this framework make it much easier. At the end of the run, three things will be shown on the console, a results table, a failures table and dictionary of error logs and any errors in them. Any tests that fail will be displayed in the fails table and any code errors that occur will be logged to disk and displayed in the dictionary, and both of these sources contain the file the test was in, the line number it is on and the code itself.

A demo folder has been prepared which contains a code error and a test failure. All the tests in the folder were run as follows:

q runall.q -rundir demo -resdir demo/results

When the tests finish running the results are displayed on the console. There are various failed tests and an entry appears in our error logs which can be expanded out:

...
"Logged errors:"
:demo/results/2020.11.11/logs/err_eod.log| ()
:demo/results/2020.11.11/logs/err_wdb.log| ,"2020.11.11D16:37:21.716171000|aquaq-184|test|test1|ERR|KUexecerr|run error in file :/ho.."
...
q) raze value errors
"2020.11.11D16:37:21.716171000|aquaq-184|test|test1|ERR|KUexecerr|run error in file :/home/mpotter/kdbCode/segtp/deploy/tests/demo/wdb/test.csv on line 8 - stHandle. Code: 'stHandle @/: `.u.upd ,/: ((`trade;testtrade);(`quote;testquote))'"

The script has read the file demo/results/2020.11.11/logs/err_wdb.log and from the message within we can see that there is a code error in a 'run' command in the file /home/mpotter/kdbCode/segtp/deploy/tests/demo/wdb/test.csv on line 8, and the offending piece of code is also displayed. This may be enough information for us to solve the issue, but if not we can dig deeper.

We can see that the error is coming from the WDB tests, and so we run those in debug mode in the following way:

./demo/wdb/run.sh -d

This runs the tests and displays the error message we saw earlier before outputting our test results and failures and leaving us in the q session. By examining the error message and being able to access the q session we can see that the variable stHandle should in fact be stpHandle and this typo is causing the error. Once this is fixed, it can be run again and the error doesn't appear any more. There is still one test failing, however:

q) select action,code,csvline from KUerr
action code                                               csvline
-----------------------------------------------------------------
true   (t1+1 5)~t2:wdbHandles[`all`sym] @\: "count trade" 10

The best way to debug this would be to be able to exit the tests as this line is being run and be able to examine it from there. We can do this by invoking the stop mode in our test script:

./demo/wdb/run.sh -s

This throws us out to the q prompt at this point in the tests with the following error:

'failed to load /home/mpotter/kdbCode/segtp/deploy/tests/runtests.q : true test failure in file :/home/mpotter/kdbCode/segtp/deploy/tests/demo/wdb/test.csv on line 10

We can get the code which failed and run it here to see what it returns. From doing some quick debugging we can see that one of the items being added to t1 is wrong, it should be ten rather than 1. Once this is fixed we can run the WDB tests again and we see that there are now no errors and all the tests pass! We can then run all of our tests again as at the start and no new test failures come up and the latest error logs are empty.

What Needs Tested?

There are many areas that need tested in TorQ. There is a directory within TorQ named 'tests' where all the required tests are situated. The directories that contain the tests can be seen here;

.
├── bglaunchprocess
├── dataaccess
├── dqe
├── k4unit
├── merge
├── pcap
├── performance
├── permissions
├── rdb
└── stp

Each of these directories contain their own tests that need to be ran, apart from k4unit. In the 'bglaunchprocess' for example, there are the following files;

kod.q.acct  process.csv  run.sh  settings.q  test.csv

Now in this directory, we can run the tests by the following command;

bash run.sh -d

This will present us with 2 tables, a KUT which is a table with all the tests that have been ran, and a KUTR table which is the results of these tests.

The purpose of testing is to ensure that any changes you have made to TorQ has not broken anything and that TorQ still functions properly. Therefore, these tests must be ran both on your development branch and on the master branch and compared in order to ensure nothing has been broken. The resulting tables can sometimes be extensive and difficult to compare from the console. I find the best way to compare the results is to apply a SQL statement to the KUTR table.

errors:select from KUTR where not ok

This statement will return a table of all the failed tests. If the errors table is then saved as a csv file on both branches by;

save `:errors.csv

These two csv's can easily be compared by a simple text comparison website. If there is no difference between these two tables, then nothing has been broken and that particular test can be deemed as a pass.

All tests are ran in this manner apart from a few which are noted below.

RDB

The RDB does not contain the run.sh file like the majority of the other directories. In order to run this test, change directory into the tests directory, and run the aforementioned basic command used for running a test, changing the proctype to rdb and the procname to test1 as seen below:

q ${TORQHOME}/torq.q -proctype rdb -procname test1 -test rdb -debug

STP

The STP contains 23 subdirectories, all of which containing their own test. All of which are tested by the standard;

bash run.sh -d

However when running this in the Stripe directory, you will not recieve the KUT and KUTR csv's. Instead, this test will create a new directory named 'results', situated in the STP directory. In results, there will be a subdirectory named stripe, then within that, the present day's date and finally in that directory, your test result csv's will be present.

Notes on Best Practice

Here are some recommendations on making test development more straightforward:

Use TorQ connection management when dealing with multiple processes
When just unit testing one process in isolation, run the tests from an instance of that process
When performing integration tests that examine the interaction of multiple processes, run the tests from a 'blank' process where possible, as this ensures that test code and process code don't get in each other's way
Put any variable or function declarations in a settings file so as not to clutter the test code
If there are a large number of tests, split into folders of related tests
Try to keep test output isolated from the rest of the application, ie. any logs or HDB data should be output to a separate testing location and cleared afterwards if appropriate