File size: 928 Bytes
c826c84
 
 
 
 
 
 
 
 
 
 
 
 
 
8399504
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
title: Aidc Llm Laos
emoji: πŸ“‰
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 5.42.0
app_file: app.py
pinned: false
license: other
short_description: aidc-llm--laos
---

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

# AIDC LLM Laos 10k (Gemma-3 4B IT) β€” Chat Space

This Space runs a Gradio chat UI for:
`Phonepadith/aidc-llm-laos-10k-gemma-3-4b-it`

## Deploy
- Space type: Python
- Hardware: Prefer GPU (T4/A10/A100). CPU works but is slow.
- Add files:
  - app.py
  - requirements.txt
- (Optional) Set Space secret `USE_4BIT=1` to enable 4-bit loading (GPU + bitsandbytes).

## Notes
- The app uses the tokenizer's chat template via `apply_chat_template`.
- Adjust `max_new_tokens` to control response length.
- If OOM on small GPUs, try:
  - Reduce `max_new_tokens`
  - Set `USE_4BIT=1`
  - Enable `low_cpu_mem_usage=True` (can be added in the loader)