Deep Neural Network Accelerator Design

Project Overview

Created an accelerator core that computes multi-layer perceptron (MLP) inference for the MNIST handwritten digit dataset. The system processes 28×28 grayscale images and classifies them into digits (0-9).

Implementation:

Hardware Design: RTL implementation of matrix-vector operations in Q16.16 fixed-point arithmetic.
System Integration: Interfacing the accelerator with an embedded soft-core CPU and off-chip SDRAM.
Software Integration: Modifying provided software to utilize the accelerator.

Key Components:

RTL Design: Accelerator core to compute dot products and apply the ReLU activation function.
Nios II System: Embedded soft-core CPU for system control.
Off-chip SDRAM Interface: Handling external memory for weights, activations, and biases.
Clock Management: Generating system clocks with desired properties using PLLs.
Testing Framework: Comprehensive testbenches to validate functionality.

Features

Accelerator Core: Computes matrix-vector dot products with Q16.16 fixed-point precision.
ReLU Activation: Implements a hardware version of the rectified linear unit.
Avalon Interface: Supports both servant and master interfaces for flexible data transfer.
Pre-trained Model: Utilizes a pre-trained MLP with two 1000-neuron hidden layers.
MNIST Inference: Performs inference for digit classification.
Nios II Integration: Communicates with the processor for control and debugging.

Implementation Details

Accelerator Design

Q16.16 Fixed-Point Arithmetic:
- 32-bit signed integers represent numbers in 1/65536 units.
- Arithmetic operations adjusted to maintain fractional precision.
Avalon Interconnect:
- Reads weights and activations from SDRAM using the master interface.
- Handles multiple sequential requests for matrix-vector multiplication.

Memory Mapping

Word Offset	Description
0	Start computation / Read result
2	Weight matrix byte address
3	Input activations vector byte address
5	Input activations vector length

Testing

Hardware Testing

Unit Tests: Designed tb_rtl_dot.sv to validate individual modules.
System Integration Tests: Mock SDRAM and Avalon interfaces to verify end-to-end functionality.

Software Testing

Functional Verification: Used C to create a software-only implementation of the matrix-vector product for comparison.
On-Hardware Debugging: Used DE1-SoC board with JTAG and UART for debugging and result verification.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
scripts		scripts
settings		settings
src		src
vga-core		vga-core
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Neural Network Accelerator Design

Project Overview

Key Components:

Features

Implementation Details

Accelerator Design

Memory Mapping

Testing

Hardware Testing

Software Testing

About

Releases

Packages

Languages

vaibhavambastha/dnn-accelerator-design

Folders and files

Latest commit

History

Repository files navigation

Deep Neural Network Accelerator Design

Project Overview

Key Components:

Features

Implementation Details

Accelerator Design

Memory Mapping

Testing

Hardware Testing

Software Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages