Evaluating rate of convergence #1071
-
Hi Everyone, I'm researching how continual learning methods can help improve the speed at which we converge to a solution. The idea is that training a model on Task A and using the same model to train Task B could result in Task B converging to a solution quicker by reusing the parameters from Task A. In order to test this, I've trained a model using the Joint Strategy on Task B i.e. training a model from scratch comparing this to a Naive strategy where I'd like the first experience to train the model on Task A and then the second experience on Task B. For the Naive strategy I've set up the following, task B in this case is training the model to recognize classes 2 and 3:
The problem is this line: This line bounds the targets to 0,1 for the second experience instead of 2,3 because the model only expects 2 classes, so targets>2 results in an index error. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
You can remove that line since there is no need to change the targets. You can either use an |
Beta Was this translation helpful? Give feedback.
You can remove that line since there is no need to change the targets. You can either use an
IncrementalClassifier
, which automatically expands the head with new classes, or use aMultiHeadClassifier
.