💎 Gemma3 — Desi Chatbot (GGUF / HF fallback)

Gemma3 (quantized GGUF) — Local inference if available, otherwise fallback to Hugging Face Inference API.

તમારો પ્રશ્ન / Prompt

Max tokens

16 1024

Temperature

0 1.5

Runtime: HuggingFace Inference

Tips: Reduce max tokens if you see OOM. Upload a smaller Q4 quantized GGUF for Spaces.

જવાબ (Response)