Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Cheng
RosyCheng
Follow
0 followers
·
7 following
https://scholar.google.com/citations?user=smUBVOQAAAAJ&hl=en
Rosy0912
AI & ML interests
LLM Alignment&Security
Recent Activity
authored
a paper
26 days ago
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
authored
a paper
26 days ago
PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization
authored
a paper
26 days ago
Gibberish is All You Need for Membership Inference Detection in Contrastive Language-Audio Pretraining
View all activity
Organizations
Papers
6
arxiv:
2503.18991
arxiv:
2412.05892
arxiv:
2410.18371
arxiv:
2409.04340
Expand 6 papers
models
0
None public yet
datasets
0
None public yet