Model Card for Model ID

This bot gives a bitter review fn any paper you submit. See https://hippocampus-garden.com/tiny_llama_dpo_lora/ for full details.

Model Details

Model Description

  • Developed by: Shion Honda
  • Model type: Text Generation
  • Language(s) (NLP): English
  • License: MIT
  • Finetuned from model: TinyLlama/TinyLlama-1.1B-Chat-v1.0

Model Card Contact

[More Information Needed]

Framework versions

  • PEFT 0.10.0
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora

Adapter
(1097)
this model

Dataset used to train shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora