Spaces:
Runtime error
Runtime error
A newer version of the Gradio SDK is available:
5.45.0
metadata
title: Aidc Llm Laos
emoji: π
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 5.42.0
app_file: app.py
pinned: false
license: other
short_description: aidc-llm--laos
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
AIDC LLM Laos 10k (Gemma-3 4B IT) β Chat Space
This Space runs a Gradio chat UI for:
Phonepadith/aidc-llm-laos-10k-gemma-3-4b-it
Deploy
- Space type: Python
- Hardware: Prefer GPU (T4/A10/A100). CPU works but is slow.
- Add files:
- app.py
- requirements.txt
- (Optional) Set Space secret
USE_4BIT=1
to enable 4-bit loading (GPU + bitsandbytes).
Notes
- The app uses the tokenizer's chat template via
apply_chat_template
. - Adjust
max_new_tokens
to control response length. - If OOM on small GPUs, try:
- Reduce
max_new_tokens
- Set
USE_4BIT=1
- Enable
low_cpu_mem_usage=True
(can be added in the loader)
- Reduce