Weijie Xu
xwjzds
AI & ML interests
LLM Evaluation @Amazon
Recent Activity
authored
a paper
about 1 month ago
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded
Dialogue Generation
authored
a paper
about 1 month ago
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large
Language Models
authored
a paper
about 1 month ago
FalseReject: A Resource for Improving Contextual Safety and Mitigating
Over-Refusals in LLMs via Structured Reasoning