We explore the feasibility of deploying LLMs on device, a model in which user prompts and LLM outputs never leave the device premises.| Brave