Hwaran Lee's picture

1 2 6

Hwaran Lee

hwaranlee

·

https://hwaranlee.github.io

AI & ML interests

Safety and Trustworthy of AI / Language Models

Recent Activity

upvoted a paper 19 days ago

Token Bottleneck: One Token to Remember Dynamics

liked a model 3 months ago

naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-0.5B

liked a model 3 months ago

naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B

View all activity

Organizations

authored a paper about 1 year ago

KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge

Paper • 2402.13605 • Published Feb 21, 2024

authored 6 papers over 1 year ago

Critic-Guided Decoding for Controlled Text Generation

Paper • 2212.10938 • Published Dec 21, 2022

KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application

Paper • 2305.17701 • Published May 28, 2023 • 1

ProPILE: Probing Privacy Leakage in Large Language Models

Paper • 2307.01881 • Published Jul 4, 2023 • 1

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

Paper • 2402.12991 • Published Feb 20, 2024

LifeTox: Unveiling Implicit Toxicity in Life Advice

Paper • 2311.09585 • Published Nov 16, 2023

KoBBQ: Korean Bias Benchmark for Question Answering

Paper • 2307.16778 • Published Jul 31, 2023

authored a paper almost 2 years ago

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 55