Shivansh Chaudhary
Shivansh000
AI & ML interests
None yet
Recent Activity
posted
an
update
about 16 hours ago
Just did my research on the latest Kimi K2 thinking launch from Moonshot AI, and I strongly believe that this moment is a major inflection point in open agentic A ecosystem.
The breakthrough lies in test-time scaling, moving us past constrained generation to long-horizon problem solving agents. They’ve shown capacity for 200-300 sequential tool calling, with preservation of context and reasoning finely, alongside self correction over extended computation. And, remember, this is the worst it’s ever going to be.
We have demonstrably entered the phase of deep, structured cognition, and the ability to perform 23 interleaved reasoning steps to solve a phd-level math problem is a great demonstration of this cognitive depth.
Unsurprisingly, the SOTA benchmarks reinforce this reality. More crucially for the industry, this is an open-weights release.
Thanks to Moonshot team, for providing a new anchor point for the open-AI ecosystem.
https://huggingface.co/moonshotai
https://huggingface.co/moonshotai/Kimi-K2-Thinking
posted
an
update
8 days ago
I am dedicating this weekend to practicing/reading the latest b(ook)log from hugging face. It is meant to be a guide for anyone trying to go from “we have a great dataset and GPUs” to “we built a really strong model.” Will share thoughts upon completion.
Thanks for the treat @eliebak @ThomasWolf and HF team!
https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook
reacted
to
their
post
with 🤗
over 1 year ago
My 1st post on 🤗 I would love to discuss topics related to bias in LLMs:
1) Are researchers and enterprises concerned about detecting and addressing social bias in the Gen AI applications? If so, what are the existing approaches?
2) Are there trusted and labeled datasets to evaluate bias in LLM generations?