Git Product home page Git Product logo

hwpe-stream's Introduction

Documentation Status

If you are using these IPs for an academic publication, please cite the following paper:

@article{conti2018xne,
  author={F. {Conti} and P. D. {Schiavone} and L. {Benini}},
  journal={IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems},
  title={XNOR Neural Engine: A Hardware Accelerator IP for 21.6-fJ/op Binary Neural Network Inference},
  year={2018},
  doi={10.1109/TCAD.2018.2857019},
  ISSN={0278-0070},
}

See documentation on https://hwpe-doc.readthedocs.io/en/latest/.

Repository content

The hwpe-stream repository contains the definition of the HWPE-Stream and TCDM interfaces used with HWPEs (HW Processing Engines), as well as the IPs necessary to manage the streams and construct streamers, e.g. for the XNE, HWCE, etc. This repository contains the following IPs:

basic - basic IPs to manage HWPE-Streams:

  • mux_static: multiplexes HWPE-Streams, driven by a static selection signal
  • demux_static: demultiplexes for HWPE-Streams, driven by a static selection signal
  • merge: merges multiple same-sized HWPE-Streams into one with bigger data width
  • split: splits a HWPE-Stream in multiple ones with smaller data width
  • fence: synchronizes handshakes between a set of streams
  • buffer: delays a stream by 1 cycle (for timing purposes)

fifo - FIFO decoupling queues for HWPE-Streams:

  • fifo: parametric FIFO
  • fifo_sidech: parametric FIFO (with side channel)
  • fifo_earlystall: parametric FIFO stalling one cycle before being full
  • fifo_earlystall_sidech: parametric FIFO stalling one cycle before being full (with side channel)
  • fifo_ctrl: standalone FIFO controller
  • fifo_scm: standard cell memory (SCM) usable to implement a FIFO memory
  • fifo_scm_test_wrap: BIST wrapper for SCM-based FIFO

tcdm - IPs to manage TCDM streams based on HWPE-Stream building blocks:

  • tcdm_mux_static: multiplexes TCDM streams, driven by a static selection signal
  • tcdm_reorder_static: reorders TCDM streams, driven by a static order
  • tcdm_mux: multiplexes TCDM streams dynamically, according to a round-robin policy
  • tcdm_reorder: reorders TCDM streams dynamically, according to a round-robin policy
  • tcdm_fifo_store: parametric FIFO decoupling queue for TCDM streams (only memory write operations)
  • tcdm_fifo_load: parametric FIFO decoupling queue for TCDM streams (only memory read operations)
  • tcdm_fifo_load_sidech: parametric FIFO decoupling queue for TCDM streams (only memory read operations, with side channel)

streamer - IPs to transform TCDM streams in HWPE-Streams and viceversa:

  • sink: generates a HWPE-Stream from a TCDM load stream, according to a 3D strided pattern in TCDM
  • source: generates a TCDM store stream from a HWPE-Stream, according to a 3D strided pattern in TCDM
  • addressgen: generates memory addresses according to a 3D strided pattern in TCDM
  • strbgen: generates strobes for non-aligned lines
  • sink_realign: HWPE-Stream realigner for non-aligned TCDM store streams
  • source_realign: HWPE-Stream realigner for non-aligned TCDM load streams

Testbench

The repository contains at the moment only a skeleton infrastructure for verification, with a simple FIFO test (rtl/verif/tb_fifo.sv). To fetch the required IPs using Bender and generate build scripts:

make update-ips

To build the simulation infrastructure:

make hw-all

To perform a test with QuestaSim:

make run

The gui option enables the QuestaSim GUI, while P_STALL_GEN and P_STALL_RECV control the stall probability of the traffic generators and receivers (from 0.0 to 0.99). E.g., to perform a test with 80% receiver stall probability, 15% generator stall probability and GUI, the command is:

make run gui=1 P_STALL_GEN=0.15 P_STALL_RECV=0.8

hwpe-stream's People

Contributors

andreaskurth avatar arpansur avatar florent-gwt avatar francescoconti avatar giannap avatar iislut avatar micprog avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hwpe-stream's Issues

Single element lines with misaligned base address cause wrong data retrieval

When issuing a streamer request with the following addressgen control parameters half of the data seems to be retrieved, due to a hypothetically anticipated ending of req signals to TCDM memory:

base address = 0x12
transaction size = 0x80
line stride = 0x20
line length = 1
feat stride = 0
feat length = 80
loop outer = 0
realign type = 0
line length remainder = 0

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.