codellama-7b-code-review-v1 / README.md

ken123777

Update README.md

b1ff199 verified 6 months ago

preview code

raw

history blame contribute delete

11.7 kB

metadata

license: mit
language:
  - en
  - ko
  - code
library_name: transformers
tags:
  - code-llama
  - code-review
  - fine-tuning
  - SFT
  - LoRA
pipeline_tag: text-generation
base_model:
  - codellama/CodeLlama-7b-hf

Model Card for codellama-7b-code-review

Model Details / 모델 상세 정보

🇺🇸 English

This model is fine-tuned from Meta's codellama/CodeLlama-7b-hf to review and provide feedback on code changes (diffs) from GitHub Pull Requests. It has been primarily trained on JavaScript and React code reviews, aiming to generate constructive feedback from a senior engineer's perspective on topics like code quality, architecture, performance, and conventions.

Developed by: ken12377
Model type: Causal Language Model
Language(s): English, Korean, Diff format
License: apache-2.0
Finetuned from model: codellama/CodeLlama-7b-hf

🇰🇷 한국어

이 모델은 Meta의 codellama/CodeLlama-7b-hf 모델을 기반으로, GitHub Pull Request의 코드 변경사항(diff)을 리뷰하고 피드백을 제공하도록 파인튜닝되었습니다. 주로 JavaScript와 React 코드 리뷰에 중점을 두고 학습되었으며, 시니어 엔지니어의 관점에서 코드 품질, 아키텍처, 성능, 컨벤션 등에 대한 건설적인 피드백을 생성하는 것을 목표로 합니다.

개발자: ken12377
모델 종류: 인과 관계 언어 모델 (Causal Language Model)
언어: 영어, 한국어, Diff 형식
라이선스: apache-2.0
파인튜닝 기반 모델: codellama/CodeLlama-7b-hf

Model Sources / 모델 소스

Repository: https://huggingface.co/ken12377/codellama-7b-code-review

Uses / 사용 정보

🇺🇸 English

Direct Use

This model can be used directly for code review automation. By providing code changes in diff format as input, the model will generate review comments.

Warning: The content generated by the model always requires review. The final decision must be made by a human developer.

Downstream Use

This model can be reused as a base for further fine-tuning on specific project's internal coding conventions or more specialized review criteria.

Out-of-Scope Use

This model is specialized for code review tasks. It may not perform well for other purposes such as general-purpose chatbots, code generation, or translation. Especially, inputting code that is not in diff format may lead to unexpected results.

🇰🇷 한국어

직접 사용

이 모델은 코드 리뷰 자동화에 직접 사용될 수 있습니다. diff 형식의 코드 변경사항을 입력으로 제공하면, 모델은 해당 코드에 대한 리뷰 코멘트를 생성합니다.

경고: 모델이 생성하는 내용은 항상 검토가 필요하며, 최종 결정은 개발자가 직접 내려야 합니다.

다운스트림 사용

이 모델은 특정 프로젝트의 내부 코딩 컨벤션이나 더 전문화된 리뷰 기준을 학습시키기 위한 기반 모델로 재사용될 수 있습니다.

사용 범위 외

이 모델은 코드 리뷰 태스크에 특화되어 있으므로, 일반적인 챗봇 대화나 코드 생성, 번역 등의 다른 목적으로는 좋은 성능을 보이지 않을 수 있습니다. 특히 diff 형식이 아닌 코드를 입력하면 예상치 못한 결과가 나올 수 있습니다.

Bias, Risks, and Limitations / 편향, 위험 및 한계

🇺🇸 English

Data Bias: The model was trained on public GitHub Pull Request data, so it may be biased towards specific coding styles or patterns present in that data.
Inaccuracy (Hallucination): The model may occasionally generate feedback that is factually incorrect or out of context. The generated reviews always need verification.
Limited Knowledge: The model's knowledge is limited to the data at the time of fine-tuning and may not reflect the latest library or framework updates.

🇰🇷 한국어

데이터 편향: 모델은 공개된 GitHub Pull Request 데이터를 기반으로 학습되었으므로, 해당 데이터에 존재하는 특정 코딩 스타일이나 패턴에 편향되어 있을 수 있습니다.
부정확성(환각): 모델은 때때로 사실과 다르거나 문맥에 맞지 않는 피드백을 생성할 수 있습니다. 생성된 리뷰는 항상 검증이 필요합니다.
제한된 지식: 모델의 지식은 파인튜닝 시점의 데이터로 한정되어 있으며, 최신 라이브러리나 프레임워크 변경사항을 반영하지 못할 수 있습니다.

Recommendations / 권장 사항

🇺🇸 English

Users should treat the code reviews generated by the model as a 'draft' or 'assistive tool' to help the development process, not as a final judgment. It is recommended that a human expert reviews critical changes.

🇰🇷 한국어

사용자는 모델이 생성한 코드 리뷰를 최종적인 판단이 아닌, 개발 과정을 돕는 '초안' 또는 '보조 도구'로 활용해야 합니다. 중요한 변경사항에 대해서는 반드시 인간 전문가의 검토를 거치는 것을 권장합니다.

How to Get Started with the Model / 모델 시작하기

🇺🇸 English

Note: This model may be available in two versions: Adapter and Merged. Use the appropriate code for your model type.

1. Using the Adapter Model (`ken12377/codellama-7b-code-review-adapter`)

To use the adapter model, you must first load the base model and then apply the adapter using the peft library.

2. Using the Merged Model (`ken12377/codellama-7b-code-review`)

If the model is fully merged with the base model, you can load it directly without peft.

🇰🇷 한국어

참고: 이 모델은 어댑터(Adapter) 와 병합된(Merged) 두 가지 버전으로 제공될 수 있습니다. 자신의 모델 타입에 맞는 코드를 사용하세요.

1. 어댑터 모델 사용법 (`ken12377/codellama-7b-code-review-adapter`)

어댑터 모델을 사용하려면, 기반 모델을 먼저 로드한 후 peft 라이브러리를 사용해 어댑터를 적용해야 합니다.

2. 병합된 모델 사용법 (`ken12377/codellama-7b-code-review`)

모델이 기반 모델과 완전히 병합된 경우, peft 없이 직접 모델을 로드하여 사용할 수 있습니다.

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

# --- Configuration (Choose one) ---
# 1. For Adapter Model
use_adapter = True
base_model_name = "codellama/CodeLlama-7b-hf"
adapter_or_model_name = "ken12377/codellama-7b-code-review-adapter"

# 2. For Merged Model
# use_adapter = False
# adapter_or_model_name = "ken12377/codellama-7b-code-review"

# --- Load Model and Tokenizer ---
if use_adapter:
    base_model = AutoModelForCausalLM.from_pretrained(
        base_model_name,
        torch_dtype=torch.float16,
        device_map="auto",
    )
    tokenizer = AutoTokenizer.from_pretrained(adapter_or_model_name)
    model = PeftModel.from_pretrained(base_model, adapter_or_model_name)
else:
    tokenizer = AutoTokenizer.from_pretrained(adapter_or_model_name)
    model = AutoModelForCausalLM.from_pretrained(
        adapter_or_model_name,
        torch_dtype=torch.float16,
        device_map="auto",
    )

model.eval()

# --- Inference ---
diff_code = """
--- a/src/components/LoginForm.js
+++ b/src/components/LoginForm.js
-import React from 'react';
+import React, { useState } from 'react';

-const LoginForm = () => (
-  <form>
-    <label>Email: <input type="email" /></label>
-    <br />
-    <label>Password: <input type="password" /></label>
-    <br />
-    <button type="submit">Log In</button>
-  </form>
-);
+const LoginForm = () => {
+  const [credentials, setCredentials] = useState({ email: '', password: '' });
+  /* ... (rest of the diff code) ... */
+};

 export default LoginForm;
"""

# Prompt in Korean
# 마크다운 파서의 혼동을 피하기 위해 코드 블록 구분자를 변수로 만들어 사용합니다.
diff_block_delimiter = "```"
prompt = f"""### 지시:
제공된 코드는 pull request의 diff 내용입니다. 코드의 개선할 수 있는 부분에 대해 최소 3가지 항목으로 나누어 상세하고 구체적인 피드백을 제공해주세요.

### 입력:
{diff_block_delimiter}diff
{diff_code}
{diff_block_delimiter}

### 응답:
1. """

inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7, repetition_penalty=1.2)
response = tokenizer.decode(outputs[0]len(inputs.input_ids[0]):], skip_special_tokens=True)

print(response)

Training Details / 학습 상세 정보

🇺🇸 English

Training Data

This model was fine-tuned using the review_dataset.json file, which contains public Pull Request data collected from GitHub. The dataset is structured in a instruction, input(diff), output(review comment) format.

Training Procedure

The model was fine-tuned using the QLoRA technique. It utilized the SFTTrainer from the trl library, applying 4-bit quantization and LoRA (Low-Rank Adaptation) for efficient training.

Training Hyperparameters

model: codellama/CodeLlama-7b-hf
max_seq_length: 4096
lora_alpha: 128
lora_dropout: 0.1
lora_r: 64
learning_rate: 2e-4
optimizer: paged_adamw_32bit
gradient_accumulation_steps: 8
per_device_train_batch_size: 2
max_steps: 1900

🇰🇷 한국어

학습 데이터

이 모델은 GitHub에서 수집된 공개 Pull Request 데이터를 포함하는 review_dataset.json 파일을 사용하여 파인튜닝되었습니다. 데이터셋은 instruction, input(diff), output(리뷰 코멘트) 형식으로 구성되어 있습니다.

학습 절차

모델은 QLoRA 기법을 사용하여 파인튜닝되었습니다. trl 라이브러리의 SFTTrainer를 사용했으며, 4-bit 양자화와 LoRA(Low-Rank Adaptation)를 적용하여 효율적인 학습을 진행했습니다.

학습 하이퍼파라미터

모델: codellama/CodeLlama-7b-hf
최대 시퀀스 길이: 4096
LoRA Alpha: 128
LoRA Dropout: 0.1
LoRA Rank (r): 64
학습률: 2e-4
옵티마이저: paged_adamw_32bit
Gradient Accumulation Steps: 8
장치별 학습 배치 크기: 2
최대 스텝 수: 1900

Compute Infrastructure / 컴퓨팅 인프라

🇺🇸 English

Hardware Type: RunPod Cloud GPU
Cloud Provider: RunPod

🇰🇷 한국어

하드웨어 종류: RunPod 클라우드 GPU
클라우드 제공업체: RunPod

Model Card for codellama-7b-code-review

Model Details / 모델 상세 정보

Model Sources / 모델 소스

Uses / 사용 정보

Direct Use

Downstream Use

Out-of-Scope Use

직접 사용

다운스트림 사용

사용 범위 외

Bias, Risks, and Limitations / 편향, 위험 및 한계

Recommendations / 권장 사항

How to Get Started with the Model / 모델 시작하기

1. Using the Adapter Model (ken12377/codellama-7b-code-review-adapter)

2. Using the Merged Model (ken12377/codellama-7b-code-review)

1. 어댑터 모델 사용법 (ken12377/codellama-7b-code-review-adapter)

2. 병합된 모델 사용법 (ken12377/codellama-7b-code-review)

Training Details / 학습 상세 정보

Training Data

Training Procedure

Training Hyperparameters

학습 데이터

학습 절차

학습 하이퍼파라미터

Compute Infrastructure / 컴퓨팅 인프라

1. Using the Adapter Model (`ken12377/codellama-7b-code-review-adapter`)

2. Using the Merged Model (`ken12377/codellama-7b-code-review`)

1. 어댑터 모델 사용법 (`ken12377/codellama-7b-code-review-adapter`)

2. 병합된 모델 사용법 (`ken12377/codellama-7b-code-review`)