NVIDIA RTX 3060 12GB GDDR6 RAM - Inference Speed is slower for DeepSeekR1-32Billions Parameters
Reason1 - Seems related to System memory (Action : Increase system RAM to 64GB to see if inference speed gets improved.)
Reason2 - GPU dedicated Memory (Action : Upgrade GPU from RTX 3060 to RTX 4060 with 12GB or 16 GB or 24 GB.)
No comments:
Post a Comment