Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Michael Chen's picture
2 2

Michael Chen

michaelchen
·

AI & ML interests

None yet

Organizations

Open-Source AI Meetup's profile picture Utility Foundations's profile picture

Collections 2

Evals
  • SciCode: A Research Coding Benchmark Curated by Scientists

    Paper • 2407.13168 • Published Jul 18, 2024 • 14
  • AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

    Paper • 2407.15711 • Published Jul 22, 2024 • 9
  • The Vision of Autonomic Computing: Can LLMs Make It a Reality?

    Paper • 2407.14402 • Published Jul 19, 2024 • 14
Robustness
  • Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

    Paper • 2407.13833 • Published Jul 18, 2024 • 12
Evals
  • SciCode: A Research Coding Benchmark Curated by Scientists

    Paper • 2407.13168 • Published Jul 18, 2024 • 14
  • AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

    Paper • 2407.15711 • Published Jul 22, 2024 • 9
  • The Vision of Autonomic Computing: Can LLMs Make It a Reality?

    Paper • 2407.14402 • Published Jul 19, 2024 • 14
Robustness
  • Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

    Paper • 2407.13833 • Published Jul 18, 2024 • 12

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs