The main goal of this research is development of computer-vision-based automated counters that do not require large training datasets, but are adapted to a previously unseen category by using only a few training examples (few-shot), no training examples (zero-shot) or text-based prompts (text-prompt-based).