Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yicui 's Collections
Mechanistic
Coding
Benchmark
Training
ICL
Architecture
RL
TDD
Theory
Instructions

Theory

updated Nov 14, 2024
Upvote
-

  • KAN: Kolmogorov-Arnold Networks

    Paper • 2404.19756 • Published Apr 30, 2024 • 114

  • The Platonic Representation Hypothesis

    Paper • 2405.07987 • Published May 13, 2024 • 2

  • The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning

    Paper • 2304.05366 • Published Apr 11, 2023 • 1

  • Explaining NonLinear Classification Decisions with Deep Taylor Decomposition

    Paper • 1512.02479 • Published Dec 8, 2015 • 1

  • Large Language Models as Markov Chains

    Paper • 2410.02724 • Published Oct 3, 2024 • 34

  • Neural Machine Translation by Jointly Learning to Align and Translate

    Paper • 1409.0473 • Published Sep 1, 2014 • 6

  • Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models

    Paper • 2310.17086 • Published Oct 26, 2023 • 1

  • Cross-Entropy Loss Functions: Theoretical Analysis and Applications

    Paper • 2304.07288 • Published Apr 14, 2023 • 1

  • The Geometry of Concepts: Sparse Autoencoder Feature Structure

    Paper • 2410.19750 • Published Oct 10, 2024 • 2

  • Scaling Laws for Precision

    Paper • 2411.04330 • Published Nov 7, 2024 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs