I love your tool, I think it's very interesting. Are you planning on adding a support

I'm just playing around with hahahah <span class="email-hidden-tog

Is there a way to implement in the future LSTM, RNN, GRUs support? about weightwatcher HOT 21 OPEN

calculatedcontent commented on July 17, 2024

Is there a way to implement in the future LSTM, RNN, GRUs support?

from weightwatcher.

Comments (21)

charlesmartin14 commented on July 17, 2024

Yes I am starting on this now. We just need some pre-trained models to test on. If you can provide one, that would speed things up

from weightwatcher.

charlesmartin14 commented on July 17, 2024

It would be helpful to have some pretrained LSTMs to test this on . Can you provide this ?

from weightwatcher.

arvoelke commented on July 17, 2024

Is there any update on the status of support for RNNCell layers such as LSTM, GRU, etc? This would be quite the breakthrough as inferring the recurrent nonlinear dynamics of a network from the structure of its weight matrices is a very difficult open problem.

We just need some pre-trained models to test on.
It would be helpful to have some pretrained LSTMs to test this on . Can you provide this ?

There are quite a few in the collection of Keras examples, e.g.:

They should be fairly fast to run and get a trained model out of. Do you need something much larger w.r.t. the model or the dataset?

from weightwatcher.

charlesmartin14 commented on July 17, 2024

I'll take a look again
We did try this once for but the internal GRU matrices did not have enough eigenvalues to say anything meaningful because the internal matrices in the cells were very rectangular, whereas this approach works better for matrices that have at least 50 eigenvalues

from weightwatcher.

charlesmartin14 commented on July 17, 2024

I have a new idea to apply for this. If there is demand we can explore it

from weightwatcher.

dan-jacobson commented on July 17, 2024

I would love love love support for these layer types, and would be willing to contribute to this feature. What's the new idea, and could I be helpful at all in implementing it?

from weightwatcher.

Alamwealthkid commented on July 17, 2024

I'm just playing around with hahahah

…

On Wed, Oct 12, 2022 at 5:00 PM dan-jacobson ***@***.***> wrote: I would love love love support for these layer types, and would be willing to contribute to this feature. What's the new idea, and could I be helpful at all in implementing it? — Reply to this email directly, view it on GitHub <#27 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AZSXAGM7GJC3XHLJKPO3FJ3WC3OCZANCNFSM4KIJBTSQ> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

from weightwatcher.

charlesmartin14 commented on July 17, 2024

The LSTM approach requires different tecniques from RMT

To start this, I need some sample models with a wide range of accuracies

Any idea where to get these ?

from weightwatcher.

dan-jacobson commented on July 17, 2024

Yeah absolutely -- I'll happily train a couple different LSTMs and checkpoint them along the way. I'll try to do two or three different tasks.

I'll do a classic Karpathy-esque char-rnn (using LSTMs) like this https://github.com/JY-Yoon/RNN-Implementation-using-NumPy/blob/master/RNN%20Implementation%20using%20NumPy.ipynb

and then maybe some sort of time series forecasting task. I'll implement and point you to the repo in github when I have them trained.

Sound good?

from weightwatcher.

dan-jacobson commented on July 17, 2024

I've pushed a repo I trained last night with checkpoints along the way

https://github.com/dan-jacobson/example_lstms

from weightwatcher.

charlesmartin14 commented on July 17, 2024

Thank you! Ill try to get to it shortly I sprained my arm this weekend and its going to be difficult to type for a couple weeks If you want to try it yourself, we can't compute alpha, but we can compute other metrics, like the spectral norm and the MP soft rank

…

On Sat, Oct 22, 2022 at 10:09 AM dan-jacobson ***@***.***> wrote: I've pushed a repo I trained last night with checkpoints along the way https://github.com/dan-jacobson/example_lstms — Reply to this email directly, view it on GitHub <#27 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AADZWEBSXNUZUE7XSMBM76TWEQNS3ANCNFSM4KIJBTSQ> . You are receiving this because you commented.Message ID: ***@***.***>

-- This e-mail message, and any attachments, is intended only for the use of the individual or entity identified in the alias address of this message and may contain information that is confidential, privileged and subject to legal restrictions and penalties regarding its unauthorized disclosure and use. Any unauthorized review, copying, disclosure, use or distribution is strictly prohibited. If you have received this e-mail message in error, please notify the sender immediately by reply e-mail and delete this message, and any attachments, from your system. Thank you.

from weightwatcher.

dan-jacobson commented on July 17, 2024

No worries ! hope your arm heals quickly. I'll try those ideas in the next week or so.

from weightwatcher.

charlesmartin14 commented on July 17, 2024

I ran an initial analysis on the LSTMs

The alphas actually increase with increasing epoch

from weightwatcher.

charlesmartin14 commented on July 17, 2024

Including epoch 10, the correlation flow is very different

from weightwatcher.

charlesmartin14 commented on July 17, 2024

Can you upload the LSTM initial state (before epoch 0) ?

Also, what stopping criteria did you use ?

from weightwatcher.

dan-jacobson commented on July 17, 2024

Ooh not sure I saved the initial random state. I can retrain and save the initial state this time.

Right now I use no stopping criteria -- just let it run for a while, then interrupted training after the samples (i'm printing out one each epoch) started to look reasonably coherent. Happy to introduce one if you'd like.

from weightwatcher.

9527-ly commented on July 17, 2024

like the spectral norm and the MP soft rank

Hi @dan-jacobson. Have you studied the applicability of other metrics, like the spectral norm and the MP soft rank.

from weightwatcher.

charlesmartin14 commented on July 17, 2024

yes the MP softrank is probably the correct metric for these systems the current implementation may not work well for LSTMs i can try to add it when i get back from vacation in a couple weeks in the meantime you can join the discord channel and ping me there

…

Sent from my iPhone

On Nov 9, 2022, at 5:40 PM, 9527-ly ***@***.***> wrote: like the spectral norm and the MP soft rank Hi @dan-jacobson. Have you studied the applicability of other metrics, like the spectral norm and the MP soft rank. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.

from weightwatcher.

9527-ly commented on July 17, 2024

yes the MP softrank is probably the correct metric for these systems the current implementation may not work well for LSTMs i can try to add it when i get back from vacation in a couple weeks in the meantime you can join the discord channel and ping me there
…
Sent from my iPhone
On Nov 9, 2022, at 5:40 PM, 9527-ly @.***> wrote: like the spectral norm and the MP soft rank Hi @dan-jacobson. Have you studied the applicability of other metrics, like the spectral norm and the MP soft rank. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.

Thank you. I will try. Have a nice holiday

from weightwatcher.

bkocis commented on July 17, 2024

Any news on the state of the implementation for LSTM layers?

from weightwatcher.

charlesmartin14 commented on July 17, 2024

Weightwatcher is not really designed for LSTMsThe LSTMs weight matrices are usually very tall and very thin and do not have enough eigenvalues for analysis using random matrix theory or my quantum field theory approachSent from my iPhoneOn Feb 23, 2024, at 3:39 PM, Balazs Kocsis ***@***.***> wrote: Any news on the state of the implementation for LSTM layers? —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: ***@***.***>

from weightwatcher.

Is there a way to implement in the future LSTM, RNN, GRUs support? about weightwatcher HOT 21 OPEN

Comments (21)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent