Instructions to use ziksy/dataset-templater-qwen35-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use ziksy/dataset-templater-qwen35-lora with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
Dataset Templater LoRA β Qwen3.5-27B
A LoRA adapter that teaches Qwen3.5-27B to produce JSON templates mapping arbitrary HuggingFace datasets into ChatML QA training pairs, and to refuse datasets whose rows carry no usable QA content (embeddings, numeric telemetry, bibliographic metadata).
Input / Output
Input: column names + 1β3 sample rows from a HuggingFace dataset.
Output: a single-line JSON object, one of two shapes:
{"question": "{instruction}\n{input}", "answer": "{output}"}
or, for unusable shapes:
{"usable": false, "reason": "rows are embeddings, not QA pairs"}
Benchmarks
197-item held-out test set; 27% hard negatives across varied shapes (five-axis rubric).
| Axis | Base 27B | Base + LoRA | Delta |
|---|---|---|---|
| valid_json | 100.0% | 100.0% | = |
| correct_refuse | 14.7% | 18.8% | +4.1 |
| column_sanity | 68.5% | 91.9% | +23.4 |
| shape_match | 50.8% | 77.2% | +26.4 |
| exact | 38.6% | 69.5% | +30.9 |
Full per-family breakdown and per-example results are in
eval/lora_results.jsonl and adapters/qwen35-27b/scores.json.
Training
- Base model:
Qwen/Qwen3.5-27B - Examples: 2,000 (synthetic corpus, 27% hard negatives with variety)
- Rank: 16, Alpha: 32, Epochs: 3
- LR: 2e-5, Batch: 4 Γ grad_accum 4, BF16
- Hardware: A100 SXM4 80GB (Vast.ai), 2h11m training
- Final train_loss: 0.6338 Β· eval_loss: 0.311
Known Limitations
neg_tabular(weather-style numeric rows with innocuous column names liketemperature_c,humidity_pct,pressure_hpa): 0/10 refuse. LoRA confidently templates these as QA even though rows are numeric telemetry. This failure mode is stylistic β the corpus-v2 rewrite added many other tabular shapes to teach refusal and those succeed (neg_tabular_v29/10), but the specific weather vocabulary wasn't rebalanced enough.legal_structured(Clarus-style adversarial case): 0/17 shape_match. The LoRA identifies theinput_*fields (17/17 column_sanity) but pairs them differently than ground truth. Base 27B also gets 0/17 β neither scale nor this LoRA solves this specific trap yet.
Usage
Hot-load on llama-server
# Start llama-server with Qwen3.5-27B and this adapter
llama-server \
-m Qwen3.5-27B-Q4_K_M.gguf \
--lora dataset-templater-qwen35-lora-f16.gguf \
--host 127.0.0.1 --port 8788 \
--reasoning off --jinja -ngl 99
Files
adapters/qwen35-27b/dataset-templater-qwen35-lora-f16.ggufβ the adapter (152 MB F16)adapters/qwen35-27b/training_stats.jsonβ loss, hyperparametersadapters/qwen35-27b/scores.jsonβ full eval numbersagora-manifest.jsonβ discovery manifest (Agora registry format)eval/questions.jsonlβ 197-item held-out test seteval/lora_results.jsonlβ per-example predictions + scorestrain_lora.pyΒ·eval_remote.pyβ reproducibility scripts
- Downloads last month
- 5
Hardware compatibility
Log In to add your hardware
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support
Model tree for ziksy/dataset-templater-qwen35-lora
Base model
Qwen/Qwen3.5-27B