Hello Jathushan Rajasegaran, thanks for your nice work. I have some questions about th

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

θ is updated in the inner loop for all tasks <p dir="au

a few question about the implementation about itaml HOT 8 CLOSED

Ereebay commented on August 24, 2024

a few question about the implementation

from itaml.

Comments (8)

brjathu commented on August 24, 2024

Hi @Ereebay, Thanks for your interest in our work, meta_test() is the adaptation process in the paper.

"I guess the meta training in the meta_test() is the adaptation process mentioned in the inference process. And the meta test without task knowledge is the task prediction." this is correct.

"But why does the task prediction happens after the adaptation in the code? It seems that you directly use the task_id to do the adaptation and class prediction." - On implementation wise, I do adaptation for all the tasks and store them, and the actual testing happens in the plot_cifar.ipynb file. Mainly because of computational efficiency so that I don't have to use GPUs and gradient updates at test time.

hope this answers your questions. feel free to ask if you have any questions.

from itaml.

Ereebay commented on August 24, 2024

Hi @brjathu, thanks for your reply, I have one more question about the parameters.

The model parameters contain the features parameters(theta) and the classifier's parameters (phi). In order to keep multi classifiers parameters, it can copy the base model parameters. But, the parameters of the features should be updated through all the tasks. In the code, it seems to be reset to the base too.

            for task_idx in range(1+self.args.sess):
                idx = np.where((np_targets>= task_idx*self.args.class_per_task) & (np_targets < (task_idx+1)*self.args.class_per_task))[0]
                ai = self.args.class_per_task*task_idx
                bi = self.args.class_per_task*(task_idx+1)
                
                ii = 0
                if(len(idx)>0):
                    sessions.append([task_idx, ii])
                    ii += 1
                    for i,(p,q) in enumerate(zip(model.parameters(), model_base.parameters())):
                        p=copy.deepcopy(q)

from itaml.

brjathu commented on August 24, 2024

Hi, at the start of each inner loop update, we move from the model_base parameters to a task-specific parameter?

when copying, theta, and phi, both are copied, but only the theta and a part of phi corresponding that task is updated with gradients in the inner loop.

from itaml.

Ereebay commented on August 24, 2024

Hi, at the start of each inner loop update, we move from the model_base parameters to a task-specific parameter?

when copying, theta, and phi, both are copied, but only the theta and a part of phi corresponding that task is updated with gradients in the inner loop.

So the theta doesn't be updated through all the tasks? Because it shows that the model will copy the base model's parameters every task.

from itaml.

brjathu commented on August 24, 2024

no, it will. after finishing a single loop starting from 122, it will replace model base with model.
see line 121,

        model_base = copy.deepcopy(model)

from itaml.

Ereebay commented on August 24, 2024

θ is updated in the inner loop for all tasks

My confusion is about this sentence. I thought the theta is updated from the first to the last tasks. I'm afraid I misunderstood it. Does this mean theta updated form the base model for every task right? Not the previous task.

from itaml.

brjathu commented on August 24, 2024

"Does this mean theta updated form the base model for every task right? Not the previous task." yes this is correct. sorry for the confusion. Although theta is updated form the base model for every task right, for each batch would be more meaningfull.

from itaml.

Ereebay commented on August 24, 2024

Thanks for your patient explanation again! It solves my confusion.

from itaml.

a few question about the implementation about itaml HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent