Blazgo ( )

replied to their post 1 day ago

Yes! It has gone from 671B to 2.7T in slightly more than 2 weeks.

671B to 2.7T?!? I can barely run 235B models! (and that's with Q3 or Q2).

Regardless, as long as it performs well and does the job i suppose.

But i wonder if trimming the models or getting them to be more optimized in size vs performance shouldn't be a bigger push. Though i'm new to this scene so i could just be ignorant in how this is all done.

"Ultra" is the keywords here. We are going to release Pro and Mini models. Also because of DynaMoE architecture (publishing soon), you can run parts of the model on your setup. The downside is it isn't very well supported by many apps so everything will be using our code (until support improves).

reacted to their post with ❤️ 2 days ago

Post

2506

🚀 Deca 3 Ultra Alpha is coming in the next 72 hours! 🚀

We're on the verge of something monumental. Right now, we're in the final stages of testing, and we're about to drop a game-changing milestone in the open-source AI community. 🎉

In just two weeks, we've managed to almost 4x the size of the largest open-source LLM at that time (and we are still 2.6x bigger than the largest LLM). This is unprecedented and a testament to the power of collaboration, innovation, and the relentless pursuit of pushing AI to its limits.

The future of open-source AI is now. Stay tuned for the release – we’re just getting started.

- Model testing finishes: 24hrs from now
- Model gets uploaded: 30hrs from now
- Related code/inference stack gets published: 70-90hrs from now

5 replies

·

replied to their post 2 days ago

Yes! It has gone from 671B to 2.7T in slightly more than 2 weeks. This Alpha variant is very crude and proof-of-concept so don't expect insane performance, but it is nice to see that open-source AI is catching up.

posted an update 3 days ago

Post

2506

🚀 Deca 3 Ultra Alpha is coming in the next 72 hours! 🚀

We're on the verge of something monumental. Right now, we're in the final stages of testing, and we're about to drop a game-changing milestone in the open-source AI community. 🎉

In just two weeks, we've managed to almost 4x the size of the largest open-source LLM at that time (and we are still 2.6x bigger than the largest LLM). This is unprecedented and a testament to the power of collaboration, innovation, and the relentless pursuit of pushing AI to its limits.

The future of open-source AI is now. Stay tuned for the release – we’re just getting started.

- Model testing finishes: 24hrs from now
- Model gets uploaded: 30hrs from now
- Related code/inference stack gets published: 70-90hrs from now

5 replies

·

replied to pagezyhf's post 10 days ago

Please add Kimi k2!

replied to retronic's post 15 days ago

Use a gguf?

replied to andito's post 22 days ago

is Mirage opensource?

replied to AtAndDev's post about 2 months ago

😭

replied to omaragent03's post 3 months ago

You can do that here: https://huggingface.co/spaces/autotrain-projects/autotrain-advanced?duplicate=true
You will almost always need a GPU

replied to daavoo's post 4 months ago

Routers of routers...

AI & ML interests

Recent Activity

Organizations

AI & ML interests

Recent Activity

Organizations

Blazgo's activity