Can't train on GPU . Model Quick96 error #132
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Choose one or several GPU idxs (separated by comma).
[CPU] : CPU
[0] : METAL
[0] Which GPU indexes to choose? :
0
Metal device set to: Apple M2 Pro
systemMemory: 16.00 GB
maxCacheSize: 5.33 GB
18 devices <core.leras.device.Devices object at 0x120f93dc0>
GPU COUNT 1
gpu id 0
devices /CPU:0
Initializing models: 0%| | 0/5 [00:00<?, ?it/s]
It shows that tf.reduce_mean is the problem , what can i do to solve his problem.
Pleasehelp
I solved it by using the appropriate versions of tensorflow-macos==2.8.0 , tensorflow-metal==0.5.0 and numpy== 1.23.
Now at-least the training starts but , loss doesn't go down. It just randomly calculates . Even in the preview, columns 2,4,5 which contain the learned outputs don't show up.
So definitely something wrong with the model as even on CPU the same problem of loss not going down persists.