AI & ML interests

None defined yet.

AtAndDev 
posted an update 5 days ago
view post
Post
202
Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE
AtAndDev 
posted an update about 2 months ago
view post
Post
2886
deepseek-ai/DeepSeek-R1-0528

This is the end
  • 1 reply
·
AtAndDev 
posted an update 4 months ago
view post
Post
3120
Llama 4 is out...
·
AtAndDev 
posted an update 4 months ago
view post
Post
4345
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
  • 6 replies
·
AtAndDev 
posted an update 5 months ago
view post
Post
1660
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
not-lain 
posted an update 5 months ago
AtAndDev 
posted an update 5 months ago
not-lain 
posted an update 6 months ago
AtAndDev 
posted an update 6 months ago
view post
Post
1942
everywhere i go i see his face
AtAndDev 
posted an update 6 months ago
view post
Post
577
Deepseek gang on fire fr fr
AtAndDev 
posted an update 6 months ago
view post
Post
1655
R1 is out! And with a lot of other R1 releated models...
not-lain 
posted an update 6 months ago
view post
Post
1788
we now have more than 2000 public AI models using ModelHubMixin🤗
not-lain 
posted an update 7 months ago
view post
Post
4135
Published a new blogpost 📖
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
🔗 https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :
AtAndDev 
posted an update 7 months ago
view post
Post
495
@s3nh Hey man check your discord! Got some news.
  • 4 replies
·
lunarflu 
posted an update 8 months ago
not-lain 
posted an update 9 months ago
view post
Post
2447
ever wondered how you can make an API call to a visual-question-answering model without sending an image url 👀

you can do that by converting your local image to base64 and sending it to the API.

recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
🔗 https://github.com/not-lain/loadimg

API request example 🛠️:
from loadimg import load_img
from huggingface_hub import InferenceClient

# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" ) 

client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")

messages = [
	{
		"role": "user",
		"content": [
			{
				"type": "text",
				"text": "Describe this image in one sentence."
			},
			{
				"type": "image_url",
				"image_url": {
					"url": my_b64_img # base64 allows using images without uploading them to the web
				}
			}
		]
	}
]

stream = client.chat.completions.create(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct", 
	messages=messages, 
	max_tokens=500,
	stream=True
)

for chunk in stream:
    print(chunk.choices[0].delta.content, end="")
lunarflu 
posted an update 11 months ago
not-lain 
posted an update 12 months ago