The pynn_spinnaker from project-rig

Spike source array

Aside from plasticity, all that is really required for feature parity with SpYnNaKeR:

Finish SDRAM-bound C version of SSA and port to CPP
Create SpikeSourceArray region that turns whatever the hell PyNN produces into SDRAM bitmaps

Partitioning improvements

Delay rows can mean that row estimate is way off
Partitioning of poisson sources is particularly thick - 1000 15Hz poisson sources with 0.1ms timestep don't need splitting into 25s
If initial partitioning doesn't fit on a chip should reduce width and try again

Integrate profiler

Port profiler to CPP so it uses scope-based tags
Integrate extraction of profiler data nicely with PyNN

Populations where synapse clusters have fewer post-neurons that neuron clusters breaks

This is something to do with buffer offsets

On-chip synaptic-matrix generation

On chip connection builder needs delays to be scaled into timesteps

This could be done using lazy_param_map functions similar to the refractory period for neurons

Current sources

Using SpikeSourcePoisson to provide input to all neurons in network is bad:

Although SpikeSourcePoisson C++ code can run at 1024 neurons a core, architectural limit is in the order of 10Hz - well below typical cortical background
One-to-one connected spike sources are also a worst case for synapse processors

As PR neural populations already receive input via DMAd currents, adding current source populations to implement http://data.andrewdavison.info/docs/PyNN/reference/electrodes.html API would be relatively trivial.

Parameter changing

When population and projection parameters are changed, determine if these changes require re-replacement
If re-placement is required, download mutable parameters, run normal flow
Otherwise, mark vertices and regions as dirty and re-upload inplace

Improved post-synaptic partitioning

Cell types should produce a shift of 1024 i.e 1, 0.5, 0.25, 0.125

Constraining that doesn't rely on slices matching between input and output vertices
Neuron populations should partition based on firing rate, timestep and cost
Current input populations should partition based on timestep and cost

Static unsigned inhibitory weights are not correctly read back as negative

Hypergeometric variate generator fails for large parameter values

The hypergeometric variate generator, used for the FixedTotalNumberConnector(with_replacement=False) connector, fails for large parameter values.

The hypergeometric generator generates a sample from the hypergeometric distribution with parameters ngood, nbad, and nsample. This is the distribution over the number of red balls we find when we sample nsample times without replacement from an urn containing ngood red balls and nbad white balls.

The generator works only if ngood, nbad, and nsample are less than 2^16.
In our case nbad is the size of the submatrix of the synaptic matrix we are dealing with, minus the number of synapses within the submatrix. This can take values at least as high as 1024^2.

Note that this sampler is only used for the FixedTotalNumberConnector with with_replacement=False. This is not a commonly used connector; it is currently unimplemented in PyNN, which ignores the with_replacement flag.

Improved synapse cluster presynaptic partitioning

Because the actual rows can become so short, potentially max synaptic event rate might be a poor metric. SynapseCluster could partition pre-synaptically based on CPU cycles, a more complex constant + N * synapse model of row cost and estimates for number of extension rows based on delay distribution

Delay extension rows break various metrics

Max cols is used to allocate delay buffers - should use non-delay partitioned row length
Because the actual rows can become so short, potentially max synaptic event rate might be a poor metric. SynapseCluster could partition pre-synaptically based on CPU cycles, a more complex constant + N * synapse model of row cost and estimates for number of extension rows based on delay distribution

Replace ``NULL`` with ``nullptr``

Shape of STDP curve doesn't match NEST

Right hand side is different

1024 neurons per core

There is not enough space to duplicate IF_curr_exp neuron and synapse parameters for 1024 neurons, solution is probably to store immutable parameters for unique parameter configurations and have num neurons long array of uint16_t indices pointing to each neuron's immutable parameter set or neuron long array of pointers:

Figure out if the difference in generated assembly is worth the extra 2 bytes per neuron
Generalize the python side into lazy_param_map

When reading synaptic matrices delays are in ticks

These should be converted back to milliseconds

Estimation methods on connectors should be _scores

Connectors are PyNN objects so it should be clear that these are internal additions

On-chip parameter clipping behaviour

As discussed with @grey-area, negative weight or delay parameters can come about due to:

Inhibitory weights - inhibitory weights are specified negative in PyNN
The tail of a distribution - these should be zeroed

Proposal:

Absolute value of 'constant' parameters and appropriate distribution parameters (specified by lazy_param_map) should be taken on host
In on-chip connection builder, delays < 1 timestep should be set to 1
In on-chip connection builder, unsigned weights < 0 should be set to 0

Scoped enums

enum DMATag
{
  DMATagSpikeDataRead,
  DMATagMax,
};

For:

enum class DMATag
{
  SpikeDataRead,
  Max,
}

Possible also type to uint32_t to ensure good ARM code results
http://blog.smartbear.com/c-plus-plus/closer-to-perfection-get-to-know-c11-scoped-and-based-enum-types/

Current input processors don't declare their SDRAM desires

They kinda should as spike sources are quite big

Views

Setting parameters within a view definitely doesn't work

Connection builder stats

It would be very useful if the connection builder had the ability to return stats e.g. number of overflowing rows

Fix connection building progress bars

These are hooked to the connection building so look crap - at minimum unhook!

allow_self_connections property in connectors built on chip

Fixed probability connector should have this flag

Spike Source Array bad size calculation

max_spike_blocks should be number of non-empty bins in sum of bincounts for each spiketrain in vertex. In constructor, spike times could be quantized so they can be used both in sizeof and write_subregion_to_file

Addon module support

setup.py boilerplate to install python script in path which uses file to locate python model and then get path to base make file
Makefile use script to find Makefile.depend
model_binaries always live in folder beneath python models so inspect.getfile(C.class) should find it
Expose models through native models interface

In lazy_param_map s2211 should be s2011

Integrate latest rig P&R API

This means network tester can be used without special code

SDRAM estimate for placer

This is starting to break stuff

On-chip generation of synaptic matrices with delay extensions

Will also need to check matrix doesn't exceed allocated size

SDRAM back propagation

Back propagation of spikes via router for learning seems to result in a lot of dropped packets

Allocate double-buffered bit field large enough for number of neurons
Neuron cores should write bitfield of output every timestep
Synapse cores should read bitfield and apply spikes to post history

Clipped normals with maximum value of np.inf aren't evaluated correctly

e.g. RandomDistribution("normal_clipped", [1.5, 0.75, 0.1, np.inf]) - Which is pretty typical usage

Better RNG control for on-chip generation

Seeding of NativeRNG
Multiple RNGs

On-chip generation of OneToOneConnector

This requires passing through of vertex slices etc

Including size of population means _estimate_initial_constraint makes poor decisions

Synapse 1 max j constraint = 1024
Synapse 2 max j constraint = 256
Neuron j constraint = 256
Current input j constraint = 512

Synapse constraint clamped down by population size (1000) to 1000 therefore it is no longer rounded to 4 * self.neuron_j_constraint

Automatic calculation of board count to pass to spalloc

Simulator._constrain_cluster could return a chip count estimate. Perhaps algorithm could be something like:

# List of spare cores in ascending order
spare_cores = []
num_chips = 0

...
# Where constrains are built
num_cluster_cores = 1 + len(n.input_verts)

# Loop through list of chips with spare cores
# **TODO** could we use bisect here
spare_cores_found = False
for i in len(spare_cores):
    # If there are enough space on this chip
    if spare_cores[i] >= num_cluster_cores:
        spare_cores[i] -= num_cluster_cores    
        spare_cores_found = True
        break

# **TODO** mechanism to remove zeroes entries from spare_cores list

# If no chip was found with sufficient resources to contain this cluster add a chip and
# a new spare_cores entry containing the cores that remain on it after adding this cluster
if not spare_cores_found:
    num_chips += 1
    spare_cores.append(16 - num_cluster_cores)

Assemblies

Setting multiple populations parameters using an assembly
Creating multiple population-to-population connections using an assembly

Code quality

PEP8
Automated tests for stuff within spinnaker_folder i.e. region level stuff
Investigate what higher-level testing PyNN has and integrate with that
Python code review
C++ code review

NeuronVertex -> OutputVertex
SynapseVertex-> InputVertex (encompassing SDRAM current-based inputs)
not happy with the Population in SynapsePopulation, NeuralPopulation and CurrentInputPopulation - Populations are PyNN constructs which consist of a number of synapse and neural populations each of which consists of a number of homogeneous input and output vertices!
Pinus Rigida may not actually be the best name :-(

Downloading of iobuf if there are errors or warnings

Set a flag in DTCM
Copy it to SDRAM
Read it back from host and if flag is set download io_buf

Poisson source is irritatingly complicated

Fast and slow sources seperation MAY be optimal but:

Changing rate can change size of struct etc etc
All the lazy_param_map.apply_indices crud could be removed
Special purpose code in regions.SpikeSourcePoisson could be removed

project-rig / pynn_spinnaker Goto Github PK

pynn_spinnaker's People

Contributors

Watchers

Forkers

pynn_spinnaker's Issues

Recommend Projects

Recommend Topics

Recommend Org