aboutsummaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorNathan Bailey <nathan.bailey@arm.com>2024-06-03 09:58:31 +0100
committerNathan Bailey <nathan.bailey@arm.com>2024-06-13 13:18:44 +0100
commit09b5122bab771161377321e3f17e05465171ad06 (patch)
tree74676eb296c110a925996448d86cc9dcde28b002 /README.md
parent9896c7e97da38cdaa14953fdce81a29397d1fca3 (diff)
downloadmlia-09b5122bab771161377321e3f17e05465171ad06.tar.gz
feat: Unstructured Sparsity Rewrites for Fully Connected and Conv2D LayersHEADmain
Adds support for unstructured polynomial decay pruning rewrites Resolves: MLIA-1171 Signed-off-by: Nathan Bailey <nathan.bailey@arm.com> Change-Id: I9e753f35f8afe53aa24b87d794ff6986a571168f
Diffstat (limited to 'README.md')
-rw-r--r--README.md10
1 files changed, 10 insertions, 0 deletions
diff --git a/README.md b/README.md
index efe2fbc..a684342 100644
--- a/README.md
+++ b/README.md
@@ -187,8 +187,10 @@ The following rewrites are supported:
* fully-connected - replaces a subgraph with a fully connected layer
* fully-connected-sparsity - replaces a subgraph with a pruned 2:4 sparse fully connected layer
+* fully-connected-unstructured-sparsity - replaces a subgraph with an unstructured pruned fully connected layer
* fully-connected-clustering - replaces a subgraph with a clustered fully connected layer
* conv2d-sparsity - replaces a subgraph with a pruned 2:4 sparse conv2d layer
+* conv2d-unstructured-sparsity - replaces a subgraph with an unstructured pruned conv2d layer
* conv2d-clustering - replaces a subgraph with a clustered conv2d layer
**Note:** A ***Keras model*** (.h5 or SavedModel) is required as input to
@@ -237,6 +239,10 @@ There are a number of predefined profiles for rewrites shown below:
| :-----------------------------------: | :--------: | :--: | :-----------: | :---: | :---------: | :-------: | :---------: | :---------: | :--------: | :--------: |
| optimization-fully-connected-pruning | 32 | 1e-3 | True | 48000 | "cosine" | 1 | 0 | None | 2 | 4 |
+| Name | Batch Size | LR | Show Progress | Steps | LR Schedule | Num Procs | Num Threads | Checkpoints | Initial Sparsity | End Sparsity | End Step |
+| :-----------------------------------: | :--------: | :--: | :-----------: | :---: | :---------: | :-------: | :---------: | :---------: | :--------: | :--------: | :--------: |
+| optimization-fully-connected-unstructured-pruning | 32 | 1e-3 | True | 48000 | "cosine" | 1 | 0 | None | 0.25 | 0.5 | 48000 |
+
| Name | Batch Size | LR | Show Progress | Steps | LR Schedule | Num Procs | Num Threads | Checkpoints | Num Clusters | Cluster Centroids Init | Activation | Kernel Size |
| :-------------------------------------: | :--------: | :--: | :-----------: | :---: | :---------: | :-------: | :---------: | :---------: | :----------: | :--------------------------------: | :--------: | :---------: |
| optimization-conv2d-clustering | 32 | 1e-3 | True | 48000 | "cosine" | 1 | 0 | None | 16 | "CentroidInitialization.LINEAR" | "relu" | 3x3 |
@@ -245,6 +251,10 @@ There are a number of predefined profiles for rewrites shown below:
| :-----------------------------------: | :--------: | :--: | :-----------: | :---: | :---------: | :-------: | :---------: | :---------: | :--------: | :--------: | :--------: | :---------: |
| optimization-conv2d-pruning | 32 | 1e-3 | True | 48000 | "cosine" | 1 | 0 | None | 2 | 4 | "relu" | 3x3 |
+| Name | Batch Size | LR | Show Progress | Steps | LR Schedule | Num Procs | Num Threads | Checkpoints | Initial Sparsity | End Sparsity | End Step | Activation | Kernel Size |
+| :-----------------------------------: | :--------: | :--: | :-----------: | :---: | :---------: | :-------: | :---------: | :---------: | :--------: | :--------: | :--------: | :--------:| :---------: |
+| optimization-conv2d-unstructured-pruning | 32 | 1e-3 | True | 48000 | "cosine" | 1 | 0 | None | 0.25 | 0.5 | 48000 | "relu" | 3x3 |
+
These are summarized below:
* optimization - Provides training parameters for rewrites