CARLsim
5.0.0
CARLsim: a GPU-accelerated SNN simulator
|
This tutorial will really put your CARLsim skills to the test, as we will combine different concepts and libraries to create a network of direction-selective neurons that are arranged on a spatial grid.
We will first use the VisualStimulus
toolbox to generate a drifting sinusoidal grating. Then we will use the MotionEnergy
library to parse the stimulus and generate direction-selective responses akin to those of neurons in the primary visual cortex (V1). These responses will then be assigned to a spike generator group, which we term V1. A second group, termed middle temporal area (MT) neurons, will then sum over V1 responses using a Gaussian kernel in space (x and y), but not direction (z).
At the end of the tutorial, you will know how to:
VisualStimulusToolbox
MotionEnergy
libraryMotionEnergy
output to SpikeGenerator groupsThe source code of this tutorial can be found in %%CARLSIM_ROOT_DIR%%/doc/source/tutorial/5_motion_energy
.
The MotionEnergy
library provides a CUDA implementation of Simoncelli and Heeger's Motion Energy model (Simoncelli & Heeger, 1998). The code itself is based on Beyeler et al. (2014), and comes with both a Python and a C/C++ interface.
In this tutorial, we will make use of the C/C++ interface. For this, we first have to install the library.
If you followed the CARLsim installation instructions outlined in Chapter 1: Getting Started, you already have the code in external/MotionEnergy
.
However, if your external/MotionEnergy
directory appears to be empty (e.g., because you forgot the –recursive
flag when you cloned CARLsim), you have to initialize the submodule yourself. Simply navigate to the CARLsim root directory and run the following:
Then navigate to the library directory to compile and install the code.
This will install the library to /opt/CARL/ME
. If you wish to install it in a different library, you can export an environment variable ME_LIB_DIR
prior to compiling:
You can verify the installation by running the example project:
In order to create a visual stimulus depicting a drifting sinusoidal grating, we will make use of the VisualStimulusToolbox
.
If this is your first time using the toolbox, have a look at 4.1 VisualStimulusToolbox to make sure it is correctly installed.
The easiest way to create the stimulus is to run the script script/createGratingVideo.m
. What it does is, it creates a 32x32 pixel grating using GratingStim
:
We can then add frames to empty stimulus by specifying the number of frames and the direction of motion in degrees, where 0 stands for rightward motion, 90 deg is upward, 180 deg is leftward, and 270 deg is downward. Here we want to sample the spectrum in 45 deg increments, and add 10 frames each:
This should have added 80 frames to the object.
help GratingStim.add
to find out how to change the spatial frequency of the grating, as well as other parameters.You can always have a look at what you created by using plot
:
Last thing to do is save the stimulus to file:
PlaidStim
, random dot fields using DotStim
, drifting bars using BarStim
, a combination of all of the above—or you can simply load a video using MovieStim
. Check out the documentation for more!Now it is time to set up the CARLsim network.
If you have worked through the previous tutorial, you should already know how to do this. The idea is to assign every pixel in the stimulus to a neuron in the spiking network. In order to preserve spatial relationships, we arrange all neurons on a grid that matches the image dimensions.
The stimulus is imported simply by:
We can create a Grid3D struct from the stim
dimensions. We want to create a population of V1 neurons that is selective for eight directions of motion. For this to work, we therefore need eight neurons per pixel location (each selective for a different direction of motion). We can simply assign motion direction to the third dimension of the grid struct. Hence the right Grid3D dimensions for V1 are:
We also want a grid for a population of MT neurons. We could use the same grid as for V1, but since there are more neurons in V1 than there are in MT, let's downscale the grid a bit:
So all we need to do is create groups with the above grids:
We also want V1 to be input to MT. MT neurons should sum over V1 neurons using a Gaussian kernel in x and y (let's say a kernel the size of 7x7 pixels), but they should not sum over z (which is the direction of motion). In this way, MT neurons will preserve the direction selectivity of their V1 inputs. Hence the correct RadiusRF struct has size 7x7x0:
The MotionEnergy object is initialized with the size of the stimulus we imported above:
Its job is then to parse the stimulus frame-by-frame, and convert it to a float array of filter activation values. We will have exactly one filter value per neuron in the group, so it's possible to convert this activation value into spikes.
For this to work, we have to choose a frame duration: This means that every frame in the video is worth some number of milliseconds of simulation time, let's say 50 ms:
During this time, every neuron in the group will fire spikes according to a Poisson process (using the PoissonRate object), the mean rate of which has yet to be determined. This process will get clearer in a second, but for now let's just initialize the PoissonRate container and assign it to the V1 group:
After all this, don't forget to call CARLsim::setupNetwork and set a bunch of monitors:
Then we are ready to run the network.
This is where the magic happens. We will read a frame of the stimulus movie using VisualStimulus
, pass the frame to the MotionEnergy
object, and assign the output to the PoissonRate object of the V1 group. Bam!
First, we will parse the stimulus frame-by-frame, so we'll wrap all the action in a for loop:
Two commands remain.
The first is how to read the next frame of the stimulus. This logic is all contained within VisualStimulus
, so all you need to do is simply call the readFrameChar
repeatedly to get an array of all grayscale values in an unsigned char* frame
array that we initialized above:
This will automatically access the next frame in the queue.
Then we pass the frame to the MotionEnergy object. This object has methods that expose different intermediate steps of the Motion Energy model, such as the V1 linear filter response, the normalization step, and the V1 complex cell response. It is the latter that we want here:
This method takes a frame
, processes it, and returns an array of filter values that gets written to the rateV1
container. Now, because all computations are performed on the GPU, it would be silly to copy the result from the GPU to the CPU, only to find out that the PoissonRate object also lives on the GPU—so we have to copy all data back. Instead, we want to pass the right pointer (rateV1.getRatePtrGPU()
if the PoissonRate object was allocated on the GPU), and indicate that this is a pointer to device memory (using onGPU
set to true).
And that's it!
You can compile and run the above network as follows:
The result can then be analyzed using the OAT. Visualizing network activity is easiest with the NetworkMonitor. Go back to the scripts
folder in MATLAB, and run:
This script will first plot all network activity as heat maps:
Here you can see group activity for V1 and MT over time. You can see hotspots of activity, which tends to run over the image from left to rigth as time moves on. The 8 "blobs" or subpopulations you see correspond to the 8 subpopulations of neurons we created above. Within these 8 subpopulations, we have all pixels of a stimulus frame represented, each pixel location encoded by a neuron. Thus the 8 subpopulations you see correspond to a 32x32 slice through the 32x32x8 grid of neurons.
As the drifting direction of the grating changes, so do the hotspots of activity. This is easiest to visualize using a flow field: Since we have 8 neurons coding for 8 different directions at each pixel location, we can use population vector decoding to find the net vector of motion that these 8 neurons encode. This is the second plot produced by the script:
This should make things much clearer: As time moves on, the direction of motion judgment of the neurons changes as well. You can also see that V1 activity tends to be noisier, whereas the Gaussian smoothing performed by the MT population makes the motion appear much more coherent.